SlideShare a Scribd company logo
1 of 16
VISVESVARAYA TECHNOLOGICAL UNIVERSITY
"Jnana Sangama", Belgaum: 590 018
H.K.E Society’s
SIR M VISVESVARAYA COLLEGE OF ENGINEERING
(Affiliated to VTU - Belagavi, Approved by AICTE, Accredited by NAAC)
Yeramarus Camp, Raichur-584135, Karnataka
2023-2024
TECHNICAL SEMINAR PRESENTATION
ON
“MULTIMODAL AI ”
UNDER THE GUIDENCE
OF
DR.SHARAN KUMAR
DEPARTMENT OF ELECTRONICS AND COMMUNICATION ENGINEERING
POWERING
THE NEXT
CHAPTER IN
GENERATIVE AI
MULTIMODAL AI
PRESENTED
BY
B CHANDANA
3SL20EC003
CONTENTS
• Introduction
• Literature survey
• Block diagram
• Applications
• Future scope
• Benefits and challenges
• Conclusion
• Reference
Introduction
• Multi modal AI is an
advanced form of artificial
intelligence that is able to
analyze and interpret
multiple modes of data
simultaneously allowing it
to generate more accurate
and human like responses.
Literature survey
• The release of ChatGPT in November 2022, a conversation-focused
model that follows human instructions, further underscored the
feasibility of AGI in practical applications (Liu et al., 2023a). This
development has had a wide-ranging impact across various sectors,
including journalism (Liu et al., 2023c), education (Zhai, 2023; Liu
et al., 2023b), healthcare (Li et al., 2023; Liu et al., [n. d.]; Holmes
et al., 2023), industry (Dou et al., 2023), agriculture (Rezayi
et al., 2023), law (Bubeck et al., 2023), gaming (Bubeck et al., 2023),
and finance (Wu et al., 2023c), catalyzing a popular wave in AI (Liu
et al., 2023a, g, h).
• Rishi Bommasani, Drew A Hudson, Ehsan Adeli, Russ Altman, Simran
Arora, Sydney von Arx, Michael S Bernstein, Jeannette Bohg,
Antoine Bosselut, Emma Brunskill, et al. 2021.On the opportunities
and risks of foundation models.arXiv preprint
arXiv:2108.07258 (2021).
Sensory Inputs
Sensory inputs refer to the various forms of data collected from different
senses such as vision, hearing, touch, and smell that are processed by
multimodal AI technology for a technical seminar.
Data Fusion
Data fusion involves combining information from multiple modalities, such
as text, images, and videos, to improve the accuracy and robustness of AI
systems in a technical seminar on multimodal AI technology generation.
Machine Learning Algorithms
Machine learning algorithms play a crucial role in generating multimodal
AI technology for technical seminars by effectively analyzing and
interpreting data from multiple sources such as text, images, and audio.
Natural Language Processing
Natural Language Processing is a crucial component of Multimodal AI
technology, allowing for the analysis and understanding of human
language in combination with other modalities such as images or videos.
Computer Vision
Computer Vision is a key component of Multimodal AI technology, which
allows for the integration of visual data processing with other modes of
information to enhance overall system performance.
Applications
• Social media content moderation: Multimodal AI can be used to analyze text, images, and audio to
identify and moderate harmful content on social media platforms. For instance, it can detect hate
speech, violence, and bullying.
• Virtual assistants: Smart assistants like Google Assistant and Amazon Alexa are powered by
multimodal AI. They can understand and respond to natural language commands, both spoken and
typed.
• Healthcare imaging: In healthcare, multimodal AI can analyze medical images (X-rays, MRIs) along
with text reports and patient history data to improve diagnostics. This can lead to more accurate
diagnoses and better patient outcomes.
• Autonomous vehicles: Self-driving cars rely heavily on multimodal AI. They use a variety of sensors,
including cameras, radar, and LiDAR, to perceive their surroundings and navigate safely.
• E-commerce product recommendations: Many e-commerce websites use multimodal AI to
personalize product recommendations for customers. By considering both the product image and
description, the AI can recommend items that are more likely to interest the customer
Conclusion
• The future of AI is not just about seeing or hearing, it's
about truly understanding. Multimodal AI holds the
key to unlocking a new level of human-computer
interaction, with applications that can bridge
communication gaps, enhance our understanding of
the world, and empower us to solve complex
challenges in entirely new ways. The potential for
positive impact across various fields is truly limitless.
References
• Rania Abdelghani, Yen-Hsiang Wang, Xingdi Yuan, Tong Wang,
Pauline Lucas, Hélène Sauzéon, and Pierre-Yves Oudeyer.
2023.GPT-3-driven pedagogical agents for training children’s
curious question-asking skills. International Journal of Artificial
Intelligence in Education 167, 3 (2023), 102887.
• Hang Bao, Wen Wang, Li Dong, Qianru Liu, Ola K. Mohammed,
Kirti Aggarwal, and Fang Wei. 2022.Vlmo: Unified vision-language
pre-training with mixture-of-modality-experts. In Advances in
Neural Information Processing Systems (NeurIPS), Vol. 35. 32897–
32912.

More Related Content

Similar to technical seminar.pptx on multi model of AI

The technologies of ai used in different corporate world
The technologies of ai used in different  corporate worldThe technologies of ai used in different  corporate world
The technologies of ai used in different corporate worldEr. rahul abhishek
 
The Unleashing the Power of AI & How Machine Learning is Revolutionizing Ever...
The Unleashing the Power of AI & How Machine Learning is Revolutionizing Ever...The Unleashing the Power of AI & How Machine Learning is Revolutionizing Ever...
The Unleashing the Power of AI & How Machine Learning is Revolutionizing Ever...Ethical Consultant Services
 
Ambient intellegence
Ambient intellegenceAmbient intellegence
Ambient intellegenceLovely Singla
 
Face Detection Using Artificial Intelligence and Machine Learning with Python
Face Detection Using Artificial Intelligence and Machine Learning with PythonFace Detection Using Artificial Intelligence and Machine Learning with Python
Face Detection Using Artificial Intelligence and Machine Learning with PythonIRJET Journal
 
Generative AI .pptx.....................
Generative AI .pptx.....................Generative AI .pptx.....................
Generative AI .pptx.....................hanamshettyvani
 
A Case Study of Artificial Intelligence is being used to Reshape Business
A Case Study of Artificial Intelligence is being used to Reshape BusinessA Case Study of Artificial Intelligence is being used to Reshape Business
A Case Study of Artificial Intelligence is being used to Reshape BusinessAI Publications
 
A SURVEY ON AI POWERED PERSONAL ASSISTANT
A SURVEY ON AI POWERED PERSONAL ASSISTANTA SURVEY ON AI POWERED PERSONAL ASSISTANT
A SURVEY ON AI POWERED PERSONAL ASSISTANTIRJET Journal
 
compueter.pdfurueue7edjcjte6djdjrjducheduu
compueter.pdfurueue7edjcjte6djdjrjducheduucompueter.pdfurueue7edjcjte6djdjrjducheduu
compueter.pdfurueue7edjcjte6djdjrjducheduushubhamgupta7133
 
Artificial Intelligence Scope and Career Opportunity.pdf
Artificial Intelligence Scope and Career Opportunity.pdfArtificial Intelligence Scope and Career Opportunity.pdf
Artificial Intelligence Scope and Career Opportunity.pdfNIET Greater Noida ..
 
Artificial Intelligence Role in Modern Science Aims, Merits, Risks and Its Ap...
Artificial Intelligence Role in Modern Science Aims, Merits, Risks and Its Ap...Artificial Intelligence Role in Modern Science Aims, Merits, Risks and Its Ap...
Artificial Intelligence Role in Modern Science Aims, Merits, Risks and Its Ap...ijtsrd
 
Quantify Measure App Project concept presentation
Quantify Measure App Project concept presentationQuantify Measure App Project concept presentation
Quantify Measure App Project concept presentationAsheeshK
 
Top technologies of ai 2020
Top technologies of ai 2020Top technologies of ai 2020
Top technologies of ai 2020Ruchi Jain
 
The A_Z of Artificial Intelligence Types and Principles_1687569150.pdf
The  A_Z of Artificial Intelligence Types and Principles_1687569150.pdfThe  A_Z of Artificial Intelligence Types and Principles_1687569150.pdf
The A_Z of Artificial Intelligence Types and Principles_1687569150.pdfssuseredfe14
 
Artificial intelligence and Internet of Things.pptx
Artificial intelligence and Internet of Things.pptxArtificial intelligence and Internet of Things.pptx
Artificial intelligence and Internet of Things.pptxSriLakshmi643165
 
Beyond AI The Rise of Cognitive Computing as Future of Computing ChatGPT Anal...
Beyond AI The Rise of Cognitive Computing as Future of Computing ChatGPT Anal...Beyond AI The Rise of Cognitive Computing as Future of Computing ChatGPT Anal...
Beyond AI The Rise of Cognitive Computing as Future of Computing ChatGPT Anal...ijtsrd
 
WEB APPLICATION FOR MATHEMATICS CLUB OF P.C.E
WEB APPLICATION FOR MATHEMATICS CLUB OF P.C.EWEB APPLICATION FOR MATHEMATICS CLUB OF P.C.E
WEB APPLICATION FOR MATHEMATICS CLUB OF P.C.EIRJET Journal
 
[DSC Europe 23] Luciano Catani - AI in Diplomacy.PDF
[DSC Europe 23] Luciano Catani - AI in Diplomacy.PDF[DSC Europe 23] Luciano Catani - AI in Diplomacy.PDF
[DSC Europe 23] Luciano Catani - AI in Diplomacy.PDFDataScienceConferenc1
 
Reveal the Car's Potential with Cognitive IoT
Reveal the Car's Potential with Cognitive IoTReveal the Car's Potential with Cognitive IoT
Reveal the Car's Potential with Cognitive IoTSebastian Wedeniwski
 
Ambient Intelligence made by Shifali Jindal
Ambient Intelligence made by Shifali JindalAmbient Intelligence made by Shifali Jindal
Ambient Intelligence made by Shifali JindalShifaliJindal
 

Similar to technical seminar.pptx on multi model of AI (20)

The technologies of ai used in different corporate world
The technologies of ai used in different  corporate worldThe technologies of ai used in different  corporate world
The technologies of ai used in different corporate world
 
The Unleashing the Power of AI & How Machine Learning is Revolutionizing Ever...
The Unleashing the Power of AI & How Machine Learning is Revolutionizing Ever...The Unleashing the Power of AI & How Machine Learning is Revolutionizing Ever...
The Unleashing the Power of AI & How Machine Learning is Revolutionizing Ever...
 
Ambient intellegence
Ambient intellegenceAmbient intellegence
Ambient intellegence
 
Face Detection Using Artificial Intelligence and Machine Learning with Python
Face Detection Using Artificial Intelligence and Machine Learning with PythonFace Detection Using Artificial Intelligence and Machine Learning with Python
Face Detection Using Artificial Intelligence and Machine Learning with Python
 
Generative AI .pptx.....................
Generative AI .pptx.....................Generative AI .pptx.....................
Generative AI .pptx.....................
 
A Case Study of Artificial Intelligence is being used to Reshape Business
A Case Study of Artificial Intelligence is being used to Reshape BusinessA Case Study of Artificial Intelligence is being used to Reshape Business
A Case Study of Artificial Intelligence is being used to Reshape Business
 
A SURVEY ON AI POWERED PERSONAL ASSISTANT
A SURVEY ON AI POWERED PERSONAL ASSISTANTA SURVEY ON AI POWERED PERSONAL ASSISTANT
A SURVEY ON AI POWERED PERSONAL ASSISTANT
 
compueter.pdfurueue7edjcjte6djdjrjducheduu
compueter.pdfurueue7edjcjte6djdjrjducheduucompueter.pdfurueue7edjcjte6djdjrjducheduu
compueter.pdfurueue7edjcjte6djdjrjducheduu
 
Artificial Intelligence Scope and Career Opportunity.pdf
Artificial Intelligence Scope and Career Opportunity.pdfArtificial Intelligence Scope and Career Opportunity.pdf
Artificial Intelligence Scope and Career Opportunity.pdf
 
Artificial Intelligence Role in Modern Science Aims, Merits, Risks and Its Ap...
Artificial Intelligence Role in Modern Science Aims, Merits, Risks and Its Ap...Artificial Intelligence Role in Modern Science Aims, Merits, Risks and Its Ap...
Artificial Intelligence Role in Modern Science Aims, Merits, Risks and Its Ap...
 
Applications of Artificial Intelligence in Human Life
Applications of Artificial Intelligence in Human LifeApplications of Artificial Intelligence in Human Life
Applications of Artificial Intelligence in Human Life
 
Quantify Measure App Project concept presentation
Quantify Measure App Project concept presentationQuantify Measure App Project concept presentation
Quantify Measure App Project concept presentation
 
Top technologies of ai 2020
Top technologies of ai 2020Top technologies of ai 2020
Top technologies of ai 2020
 
The A_Z of Artificial Intelligence Types and Principles_1687569150.pdf
The  A_Z of Artificial Intelligence Types and Principles_1687569150.pdfThe  A_Z of Artificial Intelligence Types and Principles_1687569150.pdf
The A_Z of Artificial Intelligence Types and Principles_1687569150.pdf
 
Artificial intelligence and Internet of Things.pptx
Artificial intelligence and Internet of Things.pptxArtificial intelligence and Internet of Things.pptx
Artificial intelligence and Internet of Things.pptx
 
Beyond AI The Rise of Cognitive Computing as Future of Computing ChatGPT Anal...
Beyond AI The Rise of Cognitive Computing as Future of Computing ChatGPT Anal...Beyond AI The Rise of Cognitive Computing as Future of Computing ChatGPT Anal...
Beyond AI The Rise of Cognitive Computing as Future of Computing ChatGPT Anal...
 
WEB APPLICATION FOR MATHEMATICS CLUB OF P.C.E
WEB APPLICATION FOR MATHEMATICS CLUB OF P.C.EWEB APPLICATION FOR MATHEMATICS CLUB OF P.C.E
WEB APPLICATION FOR MATHEMATICS CLUB OF P.C.E
 
[DSC Europe 23] Luciano Catani - AI in Diplomacy.PDF
[DSC Europe 23] Luciano Catani - AI in Diplomacy.PDF[DSC Europe 23] Luciano Catani - AI in Diplomacy.PDF
[DSC Europe 23] Luciano Catani - AI in Diplomacy.PDF
 
Reveal the Car's Potential with Cognitive IoT
Reveal the Car's Potential with Cognitive IoTReveal the Car's Potential with Cognitive IoT
Reveal the Car's Potential with Cognitive IoT
 
Ambient Intelligence made by Shifali Jindal
Ambient Intelligence made by Shifali JindalAmbient Intelligence made by Shifali Jindal
Ambient Intelligence made by Shifali Jindal
 

Recently uploaded

Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Dr.Costas Sachpazis
 
result management system report for college project
result management system report for college projectresult management system report for college project
result management system report for college projectTonystark477637
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
University management System project report..pdf
University management System project report..pdfUniversity management System project report..pdf
University management System project report..pdfKamal Acharya
 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxupamatechverse
 
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingUNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingrknatarajan
 
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfKamal Acharya
 
AKTU Computer Networks notes --- Unit 3.pdf
AKTU Computer Networks notes ---  Unit 3.pdfAKTU Computer Networks notes ---  Unit 3.pdf
AKTU Computer Networks notes --- Unit 3.pdfankushspencer015
 
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptxBSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptxfenichawla
 
Introduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxIntroduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxupamatechverse
 
Porous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingPorous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingrakeshbaidya232001
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escortsranjana rawat
 
Glass Ceramics: Processing and Properties
Glass Ceramics: Processing and PropertiesGlass Ceramics: Processing and Properties
Glass Ceramics: Processing and PropertiesPrabhanshu Chaturvedi
 
MANUFACTURING PROCESS-II UNIT-1 THEORY OF METAL CUTTING
MANUFACTURING PROCESS-II UNIT-1 THEORY OF METAL CUTTINGMANUFACTURING PROCESS-II UNIT-1 THEORY OF METAL CUTTING
MANUFACTURING PROCESS-II UNIT-1 THEORY OF METAL CUTTINGSIVASHANKAR N
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxpranjaldaimarysona
 
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Christo Ananth
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performancesivaprakash250
 
Russian Call Girls in Nagpur Grishma Call 7001035870 Meet With Nagpur Escorts
Russian Call Girls in Nagpur Grishma Call 7001035870 Meet With Nagpur EscortsRussian Call Girls in Nagpur Grishma Call 7001035870 Meet With Nagpur Escorts
Russian Call Girls in Nagpur Grishma Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 

Recently uploaded (20)

Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
 
result management system report for college project
result management system report for college projectresult management system report for college project
result management system report for college project
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
 
University management System project report..pdf
University management System project report..pdfUniversity management System project report..pdf
University management System project report..pdf
 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptx
 
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingUNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
 
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
 
AKTU Computer Networks notes --- Unit 3.pdf
AKTU Computer Networks notes ---  Unit 3.pdfAKTU Computer Networks notes ---  Unit 3.pdf
AKTU Computer Networks notes --- Unit 3.pdf
 
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptxBSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
 
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
 
Introduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxIntroduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptx
 
Porous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingPorous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writing
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
 
Glass Ceramics: Processing and Properties
Glass Ceramics: Processing and PropertiesGlass Ceramics: Processing and Properties
Glass Ceramics: Processing and Properties
 
MANUFACTURING PROCESS-II UNIT-1 THEORY OF METAL CUTTING
MANUFACTURING PROCESS-II UNIT-1 THEORY OF METAL CUTTINGMANUFACTURING PROCESS-II UNIT-1 THEORY OF METAL CUTTING
MANUFACTURING PROCESS-II UNIT-1 THEORY OF METAL CUTTING
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptx
 
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performance
 
Russian Call Girls in Nagpur Grishma Call 7001035870 Meet With Nagpur Escorts
Russian Call Girls in Nagpur Grishma Call 7001035870 Meet With Nagpur EscortsRussian Call Girls in Nagpur Grishma Call 7001035870 Meet With Nagpur Escorts
Russian Call Girls in Nagpur Grishma Call 7001035870 Meet With Nagpur Escorts
 

technical seminar.pptx on multi model of AI

  • 1. VISVESVARAYA TECHNOLOGICAL UNIVERSITY "Jnana Sangama", Belgaum: 590 018 H.K.E Society’s SIR M VISVESVARAYA COLLEGE OF ENGINEERING (Affiliated to VTU - Belagavi, Approved by AICTE, Accredited by NAAC) Yeramarus Camp, Raichur-584135, Karnataka 2023-2024 TECHNICAL SEMINAR PRESENTATION ON “MULTIMODAL AI ” UNDER THE GUIDENCE OF DR.SHARAN KUMAR DEPARTMENT OF ELECTRONICS AND COMMUNICATION ENGINEERING
  • 4. CONTENTS • Introduction • Literature survey • Block diagram • Applications • Future scope • Benefits and challenges • Conclusion • Reference
  • 5.
  • 6. Introduction • Multi modal AI is an advanced form of artificial intelligence that is able to analyze and interpret multiple modes of data simultaneously allowing it to generate more accurate and human like responses.
  • 7. Literature survey • The release of ChatGPT in November 2022, a conversation-focused model that follows human instructions, further underscored the feasibility of AGI in practical applications (Liu et al., 2023a). This development has had a wide-ranging impact across various sectors, including journalism (Liu et al., 2023c), education (Zhai, 2023; Liu et al., 2023b), healthcare (Li et al., 2023; Liu et al., [n. d.]; Holmes et al., 2023), industry (Dou et al., 2023), agriculture (Rezayi et al., 2023), law (Bubeck et al., 2023), gaming (Bubeck et al., 2023), and finance (Wu et al., 2023c), catalyzing a popular wave in AI (Liu et al., 2023a, g, h). • Rishi Bommasani, Drew A Hudson, Ehsan Adeli, Russ Altman, Simran Arora, Sydney von Arx, Michael S Bernstein, Jeannette Bohg, Antoine Bosselut, Emma Brunskill, et al. 2021.On the opportunities and risks of foundation models.arXiv preprint arXiv:2108.07258 (2021).
  • 8.
  • 9. Sensory Inputs Sensory inputs refer to the various forms of data collected from different senses such as vision, hearing, touch, and smell that are processed by multimodal AI technology for a technical seminar. Data Fusion Data fusion involves combining information from multiple modalities, such as text, images, and videos, to improve the accuracy and robustness of AI systems in a technical seminar on multimodal AI technology generation. Machine Learning Algorithms Machine learning algorithms play a crucial role in generating multimodal AI technology for technical seminars by effectively analyzing and interpreting data from multiple sources such as text, images, and audio. Natural Language Processing Natural Language Processing is a crucial component of Multimodal AI technology, allowing for the analysis and understanding of human language in combination with other modalities such as images or videos. Computer Vision Computer Vision is a key component of Multimodal AI technology, which allows for the integration of visual data processing with other modes of information to enhance overall system performance.
  • 10.
  • 11.
  • 12. Applications • Social media content moderation: Multimodal AI can be used to analyze text, images, and audio to identify and moderate harmful content on social media platforms. For instance, it can detect hate speech, violence, and bullying. • Virtual assistants: Smart assistants like Google Assistant and Amazon Alexa are powered by multimodal AI. They can understand and respond to natural language commands, both spoken and typed. • Healthcare imaging: In healthcare, multimodal AI can analyze medical images (X-rays, MRIs) along with text reports and patient history data to improve diagnostics. This can lead to more accurate diagnoses and better patient outcomes. • Autonomous vehicles: Self-driving cars rely heavily on multimodal AI. They use a variety of sensors, including cameras, radar, and LiDAR, to perceive their surroundings and navigate safely. • E-commerce product recommendations: Many e-commerce websites use multimodal AI to personalize product recommendations for customers. By considering both the product image and description, the AI can recommend items that are more likely to interest the customer
  • 13.
  • 14.
  • 15. Conclusion • The future of AI is not just about seeing or hearing, it's about truly understanding. Multimodal AI holds the key to unlocking a new level of human-computer interaction, with applications that can bridge communication gaps, enhance our understanding of the world, and empower us to solve complex challenges in entirely new ways. The potential for positive impact across various fields is truly limitless.
  • 16. References • Rania Abdelghani, Yen-Hsiang Wang, Xingdi Yuan, Tong Wang, Pauline Lucas, Hélène Sauzéon, and Pierre-Yves Oudeyer. 2023.GPT-3-driven pedagogical agents for training children’s curious question-asking skills. International Journal of Artificial Intelligence in Education 167, 3 (2023), 102887. • Hang Bao, Wen Wang, Li Dong, Qianru Liu, Ola K. Mohammed, Kirti Aggarwal, and Fang Wei. 2022.Vlmo: Unified vision-language pre-training with mixture-of-modality-experts. In Advances in Neural Information Processing Systems (NeurIPS), Vol. 35. 32897– 32912.