SlideShare a Scribd company logo
1 of 71
Session 5
ChengCheng Tan
Professional Machine Learning Engineer
Who am I?
ChengCheng Tan
- BA Linguistics & Computer Science, UCLA
- MS Computer Science, Stanford
- LLM + AI Safety
FAR AI Communications
AISafety.info + chatbot
- Google Women Techmakers Ambassador
Where are we on our
journey
Professional Machine Learning Certification
Learning Journey Organized by Google Developer Groups Surrey co hosting with GDG Seattle
Session 1
Feb 24, 2024
Virtual
Session 2
Mar 2, 2024
Virtual
Session 3
Mar 9, 2024
Virtual
Session 4
Mar 16, 2024
Virtual
Session 5
Mar 23, 2024
Virtual
Session 6
Apr 6, 2024
Virtual Review the
Professional ML
Engineer Exam
Guide
Review the
Professional ML
Engineer Sample
Questions
Go through:
Google Cloud
Platform Big Data
and Machine
Learning
Fundamentals
Hands On Lab
Practice:
Perform
Foundational Data,
ML, and AI Tasks in
Google Cloud
(Skill Badge) - 7hrs
Build and Deploy ML
Solutions on Vertex
AI
(Skill Badge) - 8hrs
Self
study
(and
potential
exam)
Lightning talk +
Kick-off & Machine
Learning Basics +
Q&A
Lightning talk +
GCP- Tensorflow &
Feature Engineering
+ Q&A
Lightning talk +
Enterprise Machine
Learning + Q&A
Production ML
Systems and
Computer Vision
with Google Cloud +
Q&A
Lightning talk + NLP
& Recommendation
Systems on GCP +
Q&A
Lightning talk + MOPs
& ML Pipelines on GCP
+ Q&A
Complete course:
Introduction to AI and
Machine Learning on
Google Cloud
Launching into
Machine Learning
Complete course:
TensorFlow on Google
Cloud
Feature
Engineering
Complete course:
Machine Learning in
the Enterprise
Hands On Lab
Practice:
Production Machine
Learning Systems
Computer Vision
Fundamentals with
Google Cloud
Complete course:
Natural Language
Processing on Google
Cloud
Recommendation
Systems on GCP
Complete course:
ML Ops - Getting
Started
ML Pipelines on Google
Cloud
Check Readiness:
Professional ML
Engineer Sample
Questions
1
2
4
5
3
LLM Overview
Intro to Gemini API
Sample Question Review
Q&A
Session 5 Content Review
LLM Overview
Natural Language
Processing [NLP]:
Computers Understand
Human Languages
Pre-1990s:
Rule-Based
Expert Systems
1990s-2000s:
Statistics &
Probabilities
bi-grams, tri-grams, n-grams
J.R. Firth, Linguist
You shall know a word by the
company it keeps
J.R. Firth, Linguist
2010s:
Rise of Deep Learning
and Neural Networks
Woman
Man
Queen
King
V(king) - V(queen) + V(woman) = V(man)
2013: Word2Vec Embeddings
Analogies Word Pair 1 Word Pair 2
Man-Woman king queen man woman
Capital city Athens Greece Oslo Norway
City-in-state Chicago Illinois Sacramento California
Opposite possibly impossibly ethical unethical
Comparative great greater tough tougher
Nationality adjective Switzerland Swiss Canada Canadian
Past tense walking walked swimming swam
Plural nouns mouse mice dollar dollars
2013: Word2Vec Embeddings
2010s:
Neural Networks
RNN, GRU, LSTM
Early Neural Networks
- Slow & Forgetful
2017: Transformers
- Self-Attention
- Algorithm+Data+Compute
Transformer
Architecture
Encoder + Decoder
Rise of LLMs
>1 Billion Neurons
Trained for
Next Word Prediction
Emergent Abilities
Emergent Abilities
Emergent Abilities
Pre-trained Base
Generalist
vs
Fine-tuned Models
Specialists
RLHF:
Reinforcement Learning
from Human Feedback
Fine-tuned
- Follow Instructions
- Conversations
Intro to Gemini API
Generalized Multimodal
Intelligence Network
Prototyping with
Google
AI Studio
goo.gle/ai-dev
Get API Key
Treat like password
Create new
- Freeform prompt
- Structured prompt
- Chat prompt
Freeform Prompt
Blog post creator
Write a prompt as text and
image for the model to
auto continue.
Structured Prompt
Marketing description
Table-based interface for
more complex model
priming and prompting
Chat Prompt
Talk to snowman
Simulate a back & forth
conversation with a model
Get Code
- Choose Language
- Open in Colab
- Copy to Editor
Settings
Tokens
- Words or subwords
- Different LLM tokenizers
- Training data, context window
Temperature
- Selected by probability
- Between 0 to 1.0
- Diversity or “creativity”
Settings
Model Sizes
Settings
Safety Ratings
Harm Categories
- Harassment
- Hate Speech
- Sexually Explicit
- Dangerous Content
Safety Ratings
Harm Categories
- Harassment
- Hate Speech
- Sexually Explicit
- Dangerous Content
Harm Probabilities
- NEGLIGIBLE
- LOW
- MEDIUM
- HIGH
Settings
The hottest new
programming language is
English.
Andrej Karpathy
OpenAI
Prompt Engineering
- Clear & Specific Instructions
- Give Examples
- Step by Step
REST APIs
Client libraries for
- Python
- JavaScript
- Android (Kotlin)
- Swift
- cURL
Setup
Install & import libraries
$ pip install google-generativeai
import google.generativeai as genai
genai.configure(api_key="<YOUR API KEY>")
Generate Text
Text only prompt
model = genai.GenerativeModel('gemini-pro')
response = model.generate_content("Write a story about a
boy and a backpack.")
print(response.text)
Generate Text
Text and image prompt
model = genai.GenerativeModel('gemini-pro-vision')
img = PIL.Image.open('image.jpg')
response = model.generate_content("Write a blog based on
this photo.", img)
print(response.text)
Chat Conversations
For interactive applications
model = genai.GenerativeModel('gemini-pro')
chat = model.start_chat(history=[])
response = chat.send_message("Hello, how are you?")
print(response.text)
Free for now.
Try it out!
goo.gle/ai-dev
Session 5 Content Review
Session 5
Study Group
ML Pipeline Automation & Orchestration
- Design pipeline.
- Implement training pipeline.
- Implement serving pipeline.
- Track and audit metadata.
- Use CI/CD to test and deploy models.
ML Solution Monitoring, Optimization, and
Maintenance
- Monitor ML solutions.
- Troubleshoot ML solutions.
- Tune performance of ML solutions for training & serving in
production.
Pipelines automate the training and deployment of models
Extract
Data
Prepare
Data
Train Model
Validate
Data
Evaluate
Model
Validate
Model
Deploy
Model
● Continuous Integration (CI) is no longer only about testing and
validating code and components, but also testing and validating data,
data schemas, and models.
● Continuous Deployment (CD) is no longer about a single software
package or a service, but a system (ML training pipeline) that should
automatically deploy another service (model prediction service).
● Continuous Training is a new property, specific to ML systems,
concerning automatically retraining and serving the models.
From DevOps to MLOps
In addition to the actual ML...
ML
Code
...you have to worry about so much more.
Configuration
Data Collection
Data
Verification
Feature Extraction
Process Management
Tools
Analysis Tools
Machine
Resource
Management
Serving
Infrastructure
Monitoring
Source: Sculley et al.: Hidden Technical Debt in Machine Learning Systems
ML
Code
Data
Ingestion
Data
Analysis + Validation
Data
Transformatio
n
Trainer
Model Evaluation
and Validation
Serving Logging
Shared Utilities for Garbage Collection, Data Access Controls
Pipeline Storage
Tuner
Shared Configuration Framework and Job Orchestration
Integrated Frontend for Job Management, Monitoring, Debugging, Data/Model/Evaluation Visualization
TFX is the solution to this problem
Data
Analysis + Validation
Data
Transformatio
n
Trainer
Model Evaluation
and Validation
Serving
Tuner
Data
Ingestion
Logging
Shared Utilities for Garbage Collection, Data Access Controls
Pipeline Storage
Shared Configuration Framework and Job Orchestration
Integrated Frontend for Job Management, Monitoring, Debugging, Data/Model/Evaluation Visualization
TFX is the solution to this problem
Tensorflow Data
Validation
TensorFlow
Transform
Estimator
or Keras
Model
TensorFlow
Model Analysis
TensorFlow
Serving
Keras
Tuner
TFX Components
Data
Ingestion
Logging
Tensorflow Data
Validation
TensorFlow
Transform
Estimator
or Keras
Model
TensorFlow
Model Analysis
Validation
Outcomes
TensorFlow
Serving
ExampleGen StatisticsGen
SchemaGen
Example Validator
Transform Trainer Evaluator Pusher
Model
Server
Data
Ingestion
Tensorflow Data
Validation
TensorFlow
Transform
Estimator
or Keras
Model
TensorFlow
Model Analysis
Validation
Outcomes
TensorFlow
Serving
Data
Ingestion
Data
Analysis + Validation
Data
Transformatio
n
Trainer
Model Evaluation
and Validation
Serving Logging
Shared Utilities for Garbage Collection, Data Access Controls
Pipeline Storage
Tuner
Shared Configuration Framework and Job Orchestration
Integrated Frontend for Job Management, Monitoring, Debugging, Data/Model/Evaluation Visualization
Estimator
or Keras
Model
TensorFlow
Model Analysis
TensorFlow
Serving
Keras
Tuner
Rolling
Update
Canary
Deployment
Blue-Green
Deployment
Deployment Strategies
Rolling Update
v1 v2
v1
v1
v2
v2
v2
- Slowly increasing
number of NEW
- Slowly decreasing
number of OLD
Canary Deployment
v1 v2
v1
v1
v2
v2
v2
- Test on small SUBSET
in production
- After testing,
traffic shifts to NEW
Blue-Green Deployment
v1 v2
v1
v2
- 2 SEPARATE
deployments
- After testing,
switch traffic from
OLD Blue to
NEW Green.
Sample Questions Review
Which of the following are benefit(s) of running TFX on Google Cloud? Select all that apply?
A. Vertex Pipelines is the only supported orchestrator for TFX pipelines
B. Simplify scaling of TFX pipeline data processing as your data grows
C. Automate your ML operational processes for individual and multiple ML pipelines.
D. Increase your pipeline development and experimentation velocity.
Which of the following are benefit(s) of running TFX on Google Cloud? Select all that apply?
A. Vertex Pipelines is the only supported orchestrator for TFX pipelines
B. Simplify scaling of TFX pipeline data processing as your data grows
C. Automate your ML operational processes for individual and multiple ML pipelines.
D. Increase your pipeline development and experimentation velocity.
In addition to CI/CD practiced by DevOps teams, MLOps introduces:
A. Continuous classification
B. Continuous regression
C. Continuous training
D. All of the above.
In addition to CI/CD practiced by DevOps teams, MLOps introduces:
A. Continuous classification
B. Continuous regression
C. Continuous training
D. All of the above.
In what order are the following phases executed in a machine learning project?
I - Selection of ML algorithm
II - Data Exploration
III - Definition of the business use case
IV - Model monitoring
V - Model operationalization
VI - Model Development
A. I, II, III, IV, V, VI
B. III, II, I, VI, V, IV
C. II, III, I, VI, IV, V
D. II, I, III, VI, IV, V
In what order are the following phases executed in a machine learning project?
I - Selection of ML algorithm
II - Data Exploration
III - Definition of the business use case
IV - Model monitoring
V - Model operationalization
VI - Model Development
A. I, II, III, IV, V, VI
B. III, II, I, VI, V, IV
C. II, III, I, VI, IV, V
D. II, I, III, VI, IV, V
III - Definition of the business use case
II - Data Exploration
I - Selection of ML algorithm
VI - Model Development
V - Model operationalization
IV - Model monitoring
Pipeline
You want to have two versions of your application in production, but be able to direct a
small percentage of traffic to the newer version as a gradual test. This is an example of
which deployment strategy?
A. Canary deployment
B. Blue-green deployment
C. Rolling updates
You want to have two versions of your application in production, but be able to direct a
small percentage of traffic to the newer version as a gradual test. This is an example of
which deployment strategy?
A. Canary deployment
B. Blue-green deployment
C. Rolling updates
Canary
Deployment
Blue-Green
Deployment
Rolling
Updates
Q&A

More Related Content

Similar to Certification Study Group - NLP & Recommendation Systems on GCP Session 5

data-science-pdf-16588.pdf
data-science-pdf-16588.pdfdata-science-pdf-16588.pdf
data-science-pdf-16588.pdfvkharish18
 
Innovation morning data analytics + ai
Innovation morning data analytics + ai Innovation morning data analytics + ai
Innovation morning data analytics + ai Claudia Angelelli
 
Build, train and deploy ML models at scale.pdf
Build, train and deploy ML models at scale.pdfBuild, train and deploy ML models at scale.pdf
Build, train and deploy ML models at scale.pdfAmazon Web Services
 
Build, train, and deploy ML models at scale.pdf
Build, train, and deploy ML models at scale.pdfBuild, train, and deploy ML models at scale.pdf
Build, train, and deploy ML models at scale.pdfAmazon Web Services
 
Generative AI in CSharp with Semantic Kernel.pptx
Generative AI in CSharp with Semantic Kernel.pptxGenerative AI in CSharp with Semantic Kernel.pptx
Generative AI in CSharp with Semantic Kernel.pptxAlon Fliess
 
Leverage the power of machine learning on windows
Leverage the power of machine learning on windowsLeverage the power of machine learning on windows
Leverage the power of machine learning on windowsJosé António Silva
 
Build, Train and Deploy Machine Learning Models at Scale (April 2019)
Build, Train and Deploy Machine Learning Models at Scale (April 2019)Build, Train and Deploy Machine Learning Models at Scale (April 2019)
Build, Train and Deploy Machine Learning Models at Scale (April 2019)Julien SIMON
 
Data Workflows for Machine Learning - Seattle DAML
Data Workflows for Machine Learning - Seattle DAMLData Workflows for Machine Learning - Seattle DAML
Data Workflows for Machine Learning - Seattle DAMLPaco Nathan
 
B4UConference_machine learning_deeplearning
B4UConference_machine learning_deeplearningB4UConference_machine learning_deeplearning
B4UConference_machine learning_deeplearningHoa Le
 
What’s New with Databricks Machine Learning
What’s New with Databricks Machine LearningWhat’s New with Databricks Machine Learning
What’s New with Databricks Machine LearningDatabricks
 
AzureML Welcome to the future of Predictive Analytics
AzureML Welcome to the future of Predictive Analytics AzureML Welcome to the future of Predictive Analytics
AzureML Welcome to the future of Predictive Analytics Ruben Pertusa Lopez
 
Improving Machine Learning
 Workflows: Training, Packaging and Serving.
Improving  Machine Learning
 Workflows: Training, Packaging and Serving.Improving  Machine Learning
 Workflows: Training, Packaging and Serving.
Improving Machine Learning
 Workflows: Training, Packaging and Serving.Wilder Rodrigues
 
PAPIs LATAM 2019 - Training and deploying ML models with Kubeflow and TensorF...
PAPIs LATAM 2019 - Training and deploying ML models with Kubeflow and TensorF...PAPIs LATAM 2019 - Training and deploying ML models with Kubeflow and TensorF...
PAPIs LATAM 2019 - Training and deploying ML models with Kubeflow and TensorF...Gabriel Moreira
 
PAPIs LATAM 2019 - Training and deploying ML models with Kubeflow and TensorF...
PAPIs LATAM 2019 - Training and deploying ML models with Kubeflow and TensorF...PAPIs LATAM 2019 - Training and deploying ML models with Kubeflow and TensorF...
PAPIs LATAM 2019 - Training and deploying ML models with Kubeflow and TensorF...Gabriel Moreira
 
Leverage the power of machine learning on windows
Leverage the power of machine learning on windowsLeverage the power of machine learning on windows
Leverage the power of machine learning on windowsMia Chang
 
MLOps - Build pipelines with Tensor Flow Extended & Kubeflow
MLOps - Build pipelines with Tensor Flow Extended & KubeflowMLOps - Build pipelines with Tensor Flow Extended & Kubeflow
MLOps - Build pipelines with Tensor Flow Extended & KubeflowJan Kirenz
 
Serverless Machine Learning
Serverless Machine LearningServerless Machine Learning
Serverless Machine LearningAsavari Tayal
 
Data Agility—A Journey to Advanced Analytics and Machine Learning at Scale
Data Agility—A Journey to Advanced Analytics and Machine Learning at ScaleData Agility—A Journey to Advanced Analytics and Machine Learning at Scale
Data Agility—A Journey to Advanced Analytics and Machine Learning at ScaleDatabricks
 
Paige Roberts: Shortcut MLOps with In-Database Machine Learning
Paige Roberts: Shortcut MLOps with In-Database Machine LearningPaige Roberts: Shortcut MLOps with In-Database Machine Learning
Paige Roberts: Shortcut MLOps with In-Database Machine LearningEdunomica
 

Similar to Certification Study Group - NLP & Recommendation Systems on GCP Session 5 (20)

data-science-pdf-16588.pdf
data-science-pdf-16588.pdfdata-science-pdf-16588.pdf
data-science-pdf-16588.pdf
 
Innovation morning data analytics + ai
Innovation morning data analytics + ai Innovation morning data analytics + ai
Innovation morning data analytics + ai
 
Build, train and deploy ML models at scale.pdf
Build, train and deploy ML models at scale.pdfBuild, train and deploy ML models at scale.pdf
Build, train and deploy ML models at scale.pdf
 
Build, train, and deploy ML models at scale.pdf
Build, train, and deploy ML models at scale.pdfBuild, train, and deploy ML models at scale.pdf
Build, train, and deploy ML models at scale.pdf
 
Generative AI in CSharp with Semantic Kernel.pptx
Generative AI in CSharp with Semantic Kernel.pptxGenerative AI in CSharp with Semantic Kernel.pptx
Generative AI in CSharp with Semantic Kernel.pptx
 
Leverage the power of machine learning on windows
Leverage the power of machine learning on windowsLeverage the power of machine learning on windows
Leverage the power of machine learning on windows
 
Build, Train and Deploy Machine Learning Models at Scale (April 2019)
Build, Train and Deploy Machine Learning Models at Scale (April 2019)Build, Train and Deploy Machine Learning Models at Scale (April 2019)
Build, Train and Deploy Machine Learning Models at Scale (April 2019)
 
Data Workflows for Machine Learning - Seattle DAML
Data Workflows for Machine Learning - Seattle DAMLData Workflows for Machine Learning - Seattle DAML
Data Workflows for Machine Learning - Seattle DAML
 
DEVOPS AND MACHINE LEARNING
DEVOPS AND MACHINE LEARNINGDEVOPS AND MACHINE LEARNING
DEVOPS AND MACHINE LEARNING
 
B4UConference_machine learning_deeplearning
B4UConference_machine learning_deeplearningB4UConference_machine learning_deeplearning
B4UConference_machine learning_deeplearning
 
What’s New with Databricks Machine Learning
What’s New with Databricks Machine LearningWhat’s New with Databricks Machine Learning
What’s New with Databricks Machine Learning
 
AzureML Welcome to the future of Predictive Analytics
AzureML Welcome to the future of Predictive Analytics AzureML Welcome to the future of Predictive Analytics
AzureML Welcome to the future of Predictive Analytics
 
Improving Machine Learning
 Workflows: Training, Packaging and Serving.
Improving  Machine Learning
 Workflows: Training, Packaging and Serving.Improving  Machine Learning
 Workflows: Training, Packaging and Serving.
Improving Machine Learning
 Workflows: Training, Packaging and Serving.
 
PAPIs LATAM 2019 - Training and deploying ML models with Kubeflow and TensorF...
PAPIs LATAM 2019 - Training and deploying ML models with Kubeflow and TensorF...PAPIs LATAM 2019 - Training and deploying ML models with Kubeflow and TensorF...
PAPIs LATAM 2019 - Training and deploying ML models with Kubeflow and TensorF...
 
PAPIs LATAM 2019 - Training and deploying ML models with Kubeflow and TensorF...
PAPIs LATAM 2019 - Training and deploying ML models with Kubeflow and TensorF...PAPIs LATAM 2019 - Training and deploying ML models with Kubeflow and TensorF...
PAPIs LATAM 2019 - Training and deploying ML models with Kubeflow and TensorF...
 
Leverage the power of machine learning on windows
Leverage the power of machine learning on windowsLeverage the power of machine learning on windows
Leverage the power of machine learning on windows
 
MLOps - Build pipelines with Tensor Flow Extended & Kubeflow
MLOps - Build pipelines with Tensor Flow Extended & KubeflowMLOps - Build pipelines with Tensor Flow Extended & Kubeflow
MLOps - Build pipelines with Tensor Flow Extended & Kubeflow
 
Serverless Machine Learning
Serverless Machine LearningServerless Machine Learning
Serverless Machine Learning
 
Data Agility—A Journey to Advanced Analytics and Machine Learning at Scale
Data Agility—A Journey to Advanced Analytics and Machine Learning at ScaleData Agility—A Journey to Advanced Analytics and Machine Learning at Scale
Data Agility—A Journey to Advanced Analytics and Machine Learning at Scale
 
Paige Roberts: Shortcut MLOps with In-Database Machine Learning
Paige Roberts: Shortcut MLOps with In-Database Machine LearningPaige Roberts: Shortcut MLOps with In-Database Machine Learning
Paige Roberts: Shortcut MLOps with In-Database Machine Learning
 

More from gdgsurrey

MOPs & ML Pipelines on GCP - Session 6, RGDC
MOPs & ML Pipelines on GCP - Session 6, RGDCMOPs & ML Pipelines on GCP - Session 6, RGDC
MOPs & ML Pipelines on GCP - Session 6, RGDCgdgsurrey
 
Production ML Systems and Computer Vision with Google Cloud
Production ML Systems and Computer Vision with Google CloudProduction ML Systems and Computer Vision with Google Cloud
Production ML Systems and Computer Vision with Google Cloudgdgsurrey
 
Certification Study Group - Professional ML Engineer Session 3 (Machine Learn...
Certification Study Group - Professional ML Engineer Session 3 (Machine Learn...Certification Study Group - Professional ML Engineer Session 3 (Machine Learn...
Certification Study Group - Professional ML Engineer Session 3 (Machine Learn...gdgsurrey
 
Certification Study Group -Professional ML Engineer Session 2 (GCP-TensorFlow...
Certification Study Group -Professional ML Engineer Session 2 (GCP-TensorFlow...Certification Study Group -Professional ML Engineer Session 2 (GCP-TensorFlow...
Certification Study Group -Professional ML Engineer Session 2 (GCP-TensorFlow...gdgsurrey
 
2024-02-24_Session 1 - PMLE_UPDATED.pptx
2024-02-24_Session 1 - PMLE_UPDATED.pptx2024-02-24_Session 1 - PMLE_UPDATED.pptx
2024-02-24_Session 1 - PMLE_UPDATED.pptxgdgsurrey
 
Road to Google Developer Certification: Panel Discussion & Networking
Road to Google Developer Certification: Panel Discussion & NetworkingRoad to Google Developer Certification: Panel Discussion & Networking
Road to Google Developer Certification: Panel Discussion & Networkinggdgsurrey
 

More from gdgsurrey (6)

MOPs & ML Pipelines on GCP - Session 6, RGDC
MOPs & ML Pipelines on GCP - Session 6, RGDCMOPs & ML Pipelines on GCP - Session 6, RGDC
MOPs & ML Pipelines on GCP - Session 6, RGDC
 
Production ML Systems and Computer Vision with Google Cloud
Production ML Systems and Computer Vision with Google CloudProduction ML Systems and Computer Vision with Google Cloud
Production ML Systems and Computer Vision with Google Cloud
 
Certification Study Group - Professional ML Engineer Session 3 (Machine Learn...
Certification Study Group - Professional ML Engineer Session 3 (Machine Learn...Certification Study Group - Professional ML Engineer Session 3 (Machine Learn...
Certification Study Group - Professional ML Engineer Session 3 (Machine Learn...
 
Certification Study Group -Professional ML Engineer Session 2 (GCP-TensorFlow...
Certification Study Group -Professional ML Engineer Session 2 (GCP-TensorFlow...Certification Study Group -Professional ML Engineer Session 2 (GCP-TensorFlow...
Certification Study Group -Professional ML Engineer Session 2 (GCP-TensorFlow...
 
2024-02-24_Session 1 - PMLE_UPDATED.pptx
2024-02-24_Session 1 - PMLE_UPDATED.pptx2024-02-24_Session 1 - PMLE_UPDATED.pptx
2024-02-24_Session 1 - PMLE_UPDATED.pptx
 
Road to Google Developer Certification: Panel Discussion & Networking
Road to Google Developer Certification: Panel Discussion & NetworkingRoad to Google Developer Certification: Panel Discussion & Networking
Road to Google Developer Certification: Panel Discussion & Networking
 

Recently uploaded

Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdfssuser54595a
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentInMediaRes1
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting DataJhengPantaleon
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxOH TEIK BIN
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon AUnboundStockton
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfchloefrazer622
 
Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesFatimaKhan178732
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 

Recently uploaded (20)

Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
Staff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSDStaff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSD
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media Component
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptx
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon A
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 
Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and Actinides
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 

Certification Study Group - NLP & Recommendation Systems on GCP Session 5

  • 1. Session 5 ChengCheng Tan Professional Machine Learning Engineer
  • 2. Who am I? ChengCheng Tan - BA Linguistics & Computer Science, UCLA - MS Computer Science, Stanford - LLM + AI Safety FAR AI Communications AISafety.info + chatbot - Google Women Techmakers Ambassador
  • 3. Where are we on our journey
  • 4. Professional Machine Learning Certification Learning Journey Organized by Google Developer Groups Surrey co hosting with GDG Seattle Session 1 Feb 24, 2024 Virtual Session 2 Mar 2, 2024 Virtual Session 3 Mar 9, 2024 Virtual Session 4 Mar 16, 2024 Virtual Session 5 Mar 23, 2024 Virtual Session 6 Apr 6, 2024 Virtual Review the Professional ML Engineer Exam Guide Review the Professional ML Engineer Sample Questions Go through: Google Cloud Platform Big Data and Machine Learning Fundamentals Hands On Lab Practice: Perform Foundational Data, ML, and AI Tasks in Google Cloud (Skill Badge) - 7hrs Build and Deploy ML Solutions on Vertex AI (Skill Badge) - 8hrs Self study (and potential exam) Lightning talk + Kick-off & Machine Learning Basics + Q&A Lightning talk + GCP- Tensorflow & Feature Engineering + Q&A Lightning talk + Enterprise Machine Learning + Q&A Production ML Systems and Computer Vision with Google Cloud + Q&A Lightning talk + NLP & Recommendation Systems on GCP + Q&A Lightning talk + MOPs & ML Pipelines on GCP + Q&A Complete course: Introduction to AI and Machine Learning on Google Cloud Launching into Machine Learning Complete course: TensorFlow on Google Cloud Feature Engineering Complete course: Machine Learning in the Enterprise Hands On Lab Practice: Production Machine Learning Systems Computer Vision Fundamentals with Google Cloud Complete course: Natural Language Processing on Google Cloud Recommendation Systems on GCP Complete course: ML Ops - Getting Started ML Pipelines on Google Cloud Check Readiness: Professional ML Engineer Sample Questions
  • 5. 1 2 4 5 3 LLM Overview Intro to Gemini API Sample Question Review Q&A Session 5 Content Review
  • 7. Natural Language Processing [NLP]: Computers Understand Human Languages
  • 10. J.R. Firth, Linguist You shall know a word by the company it keeps J.R. Firth, Linguist
  • 11. 2010s: Rise of Deep Learning and Neural Networks
  • 12. Woman Man Queen King V(king) - V(queen) + V(woman) = V(man) 2013: Word2Vec Embeddings
  • 13. Analogies Word Pair 1 Word Pair 2 Man-Woman king queen man woman Capital city Athens Greece Oslo Norway City-in-state Chicago Illinois Sacramento California Opposite possibly impossibly ethical unethical Comparative great greater tough tougher Nationality adjective Switzerland Swiss Canada Canadian Past tense walking walked swimming swam Plural nouns mouse mice dollar dollars 2013: Word2Vec Embeddings
  • 15. Early Neural Networks - Slow & Forgetful
  • 16. 2017: Transformers - Self-Attention - Algorithm+Data+Compute
  • 18. Rise of LLMs >1 Billion Neurons
  • 19. Trained for Next Word Prediction
  • 24. RLHF: Reinforcement Learning from Human Feedback Fine-tuned - Follow Instructions - Conversations
  • 28. Get API Key Treat like password
  • 29. Create new - Freeform prompt - Structured prompt - Chat prompt
  • 30. Freeform Prompt Blog post creator Write a prompt as text and image for the model to auto continue.
  • 31. Structured Prompt Marketing description Table-based interface for more complex model priming and prompting
  • 32. Chat Prompt Talk to snowman Simulate a back & forth conversation with a model
  • 33. Get Code - Choose Language - Open in Colab - Copy to Editor
  • 34. Settings Tokens - Words or subwords - Different LLM tokenizers - Training data, context window Temperature - Selected by probability - Between 0 to 1.0 - Diversity or “creativity”
  • 36. Settings Safety Ratings Harm Categories - Harassment - Hate Speech - Sexually Explicit - Dangerous Content
  • 37. Safety Ratings Harm Categories - Harassment - Hate Speech - Sexually Explicit - Dangerous Content Harm Probabilities - NEGLIGIBLE - LOW - MEDIUM - HIGH Settings
  • 38. The hottest new programming language is English. Andrej Karpathy OpenAI
  • 39. Prompt Engineering - Clear & Specific Instructions - Give Examples - Step by Step
  • 40. REST APIs Client libraries for - Python - JavaScript - Android (Kotlin) - Swift - cURL
  • 41. Setup Install & import libraries $ pip install google-generativeai import google.generativeai as genai genai.configure(api_key="<YOUR API KEY>")
  • 42. Generate Text Text only prompt model = genai.GenerativeModel('gemini-pro') response = model.generate_content("Write a story about a boy and a backpack.") print(response.text)
  • 43. Generate Text Text and image prompt model = genai.GenerativeModel('gemini-pro-vision') img = PIL.Image.open('image.jpg') response = model.generate_content("Write a blog based on this photo.", img) print(response.text)
  • 44. Chat Conversations For interactive applications model = genai.GenerativeModel('gemini-pro') chat = model.start_chat(history=[]) response = chat.send_message("Hello, how are you?") print(response.text)
  • 45. Free for now. Try it out! goo.gle/ai-dev
  • 47. Session 5 Study Group ML Pipeline Automation & Orchestration - Design pipeline. - Implement training pipeline. - Implement serving pipeline. - Track and audit metadata. - Use CI/CD to test and deploy models. ML Solution Monitoring, Optimization, and Maintenance - Monitor ML solutions. - Troubleshoot ML solutions. - Tune performance of ML solutions for training & serving in production.
  • 48. Pipelines automate the training and deployment of models Extract Data Prepare Data Train Model Validate Data Evaluate Model Validate Model Deploy Model
  • 49. ● Continuous Integration (CI) is no longer only about testing and validating code and components, but also testing and validating data, data schemas, and models. ● Continuous Deployment (CD) is no longer about a single software package or a service, but a system (ML training pipeline) that should automatically deploy another service (model prediction service). ● Continuous Training is a new property, specific to ML systems, concerning automatically retraining and serving the models. From DevOps to MLOps
  • 50. In addition to the actual ML... ML Code
  • 51. ...you have to worry about so much more. Configuration Data Collection Data Verification Feature Extraction Process Management Tools Analysis Tools Machine Resource Management Serving Infrastructure Monitoring Source: Sculley et al.: Hidden Technical Debt in Machine Learning Systems ML Code
  • 52. Data Ingestion Data Analysis + Validation Data Transformatio n Trainer Model Evaluation and Validation Serving Logging Shared Utilities for Garbage Collection, Data Access Controls Pipeline Storage Tuner Shared Configuration Framework and Job Orchestration Integrated Frontend for Job Management, Monitoring, Debugging, Data/Model/Evaluation Visualization TFX is the solution to this problem
  • 53. Data Analysis + Validation Data Transformatio n Trainer Model Evaluation and Validation Serving Tuner Data Ingestion Logging Shared Utilities for Garbage Collection, Data Access Controls Pipeline Storage Shared Configuration Framework and Job Orchestration Integrated Frontend for Job Management, Monitoring, Debugging, Data/Model/Evaluation Visualization TFX is the solution to this problem Tensorflow Data Validation TensorFlow Transform Estimator or Keras Model TensorFlow Model Analysis TensorFlow Serving Keras Tuner
  • 54. TFX Components Data Ingestion Logging Tensorflow Data Validation TensorFlow Transform Estimator or Keras Model TensorFlow Model Analysis Validation Outcomes TensorFlow Serving ExampleGen StatisticsGen SchemaGen Example Validator Transform Trainer Evaluator Pusher Model Server Data Ingestion Tensorflow Data Validation TensorFlow Transform Estimator or Keras Model TensorFlow Model Analysis Validation Outcomes TensorFlow Serving
  • 55.
  • 56. Data Ingestion Data Analysis + Validation Data Transformatio n Trainer Model Evaluation and Validation Serving Logging Shared Utilities for Garbage Collection, Data Access Controls Pipeline Storage Tuner Shared Configuration Framework and Job Orchestration Integrated Frontend for Job Management, Monitoring, Debugging, Data/Model/Evaluation Visualization Estimator or Keras Model TensorFlow Model Analysis TensorFlow Serving Keras Tuner
  • 57.
  • 59. Rolling Update v1 v2 v1 v1 v2 v2 v2 - Slowly increasing number of NEW - Slowly decreasing number of OLD
  • 60. Canary Deployment v1 v2 v1 v1 v2 v2 v2 - Test on small SUBSET in production - After testing, traffic shifts to NEW
  • 61. Blue-Green Deployment v1 v2 v1 v2 - 2 SEPARATE deployments - After testing, switch traffic from OLD Blue to NEW Green.
  • 63. Which of the following are benefit(s) of running TFX on Google Cloud? Select all that apply? A. Vertex Pipelines is the only supported orchestrator for TFX pipelines B. Simplify scaling of TFX pipeline data processing as your data grows C. Automate your ML operational processes for individual and multiple ML pipelines. D. Increase your pipeline development and experimentation velocity.
  • 64. Which of the following are benefit(s) of running TFX on Google Cloud? Select all that apply? A. Vertex Pipelines is the only supported orchestrator for TFX pipelines B. Simplify scaling of TFX pipeline data processing as your data grows C. Automate your ML operational processes for individual and multiple ML pipelines. D. Increase your pipeline development and experimentation velocity.
  • 65. In addition to CI/CD practiced by DevOps teams, MLOps introduces: A. Continuous classification B. Continuous regression C. Continuous training D. All of the above.
  • 66. In addition to CI/CD practiced by DevOps teams, MLOps introduces: A. Continuous classification B. Continuous regression C. Continuous training D. All of the above.
  • 67. In what order are the following phases executed in a machine learning project? I - Selection of ML algorithm II - Data Exploration III - Definition of the business use case IV - Model monitoring V - Model operationalization VI - Model Development A. I, II, III, IV, V, VI B. III, II, I, VI, V, IV C. II, III, I, VI, IV, V D. II, I, III, VI, IV, V
  • 68. In what order are the following phases executed in a machine learning project? I - Selection of ML algorithm II - Data Exploration III - Definition of the business use case IV - Model monitoring V - Model operationalization VI - Model Development A. I, II, III, IV, V, VI B. III, II, I, VI, V, IV C. II, III, I, VI, IV, V D. II, I, III, VI, IV, V III - Definition of the business use case II - Data Exploration I - Selection of ML algorithm VI - Model Development V - Model operationalization IV - Model monitoring Pipeline
  • 69. You want to have two versions of your application in production, but be able to direct a small percentage of traffic to the newer version as a gradual test. This is an example of which deployment strategy? A. Canary deployment B. Blue-green deployment C. Rolling updates
  • 70. You want to have two versions of your application in production, but be able to direct a small percentage of traffic to the newer version as a gradual test. This is an example of which deployment strategy? A. Canary deployment B. Blue-green deployment C. Rolling updates Canary Deployment Blue-Green Deployment Rolling Updates
  • 71. Q&A