SlideShare a Scribd company logo
1 of 18
Download to read offline
TRiPODの紹介
2021年5月21日
西村仁志
2
本日の紹介論文
 2021/4/8にarXivで発表された論文
 著者は有名な人が多い
 ソースコードは公開されていない
 データセットは公開されている
• Social Motion Forecasting (SoMoF) Benchmark
(http://somof.stanford.edu/)
• このデータセットを使ってICCV2021でワークショップが開催されるらしい
(1st Workshop, Benchmark and Challenge on Human Trajectory and Pose Dynamics Forecasting in the Wild)
3
 Forecasting human movements (pose dynamics and trajectory)
 Real-world applications
 including robotics
 healthcare
 detection of perilous behavioral patterns
 Extremely challenging in real-world scenes due to the different factors
1. Interactions between people in the scene
2. Objects involved in the scene can provide informative clues
3. Different levels of interactions
• (Movements of all the persons in the scene are not always highly correlated with each other nor
the humans to objects)
• These different levels of interactions can change over time
4. A person might move outside the sensor field-of-view or be a partially/fully occluded by an
object
1. Introduction
4
1. Introduction
5
 Existing methods
 Pose dynamics forecasting methods [14, 40, 41, 58]
 Trajectory forecasting [22, 27]
 Problem
 Neglect some of these challenging factors
• Do not effectively model all the informative environmental and social interactions in the scene
• Assume that all tracks and/or body joints are always observable in the past and future
 Proposed method
 Human pose dynamics and trajectory forecasting one step forward toward more practical
scenarios in-the-wild by considering all these factors together
1. Introduction
6
 Model
 Model the input skeleton body joints, the social human-human and human-object interactions
with different attention graphs
 These two types of information are different by nature
-> Applying an iterative message passing
 Humans may retain their influences on each other consistently in future
-> Preserve their spatio-temporal attentional relationships by modeling them also in future
prediction phase
 Address the concept of joint invisibility or body disappearance
 Accumulative error in sequential models for long-term sequences
-> Take a curriculum learning approach to train our model
 Dataset
 No proper benchmark dataset for such real-world problem
-> Introduce a new benchmark by repurposing existing datasets and introducing relevant
evaluation metrics
1. Introduction
7
 Pose dynamics forecasting
 Human trajectory predictions
 Pose dynamics and trajectory forecasting
2. Related Work
8
Our goal: To model the complex human-human and human-object interactions in a way
that can also predict all the joint visibility
 Problem Definition
3. Trajectory and Pose Dynamics Forecasting
A binary value being 0
if the joint is invisible
9
TRiPOD Model
10
A. Attentional Human Pose History
Leverage natural
connectivities
Influence of joints on each
other is not uniform
Encode the past skeleton
history for each person
11
B. Object and Global Scene Features
• visual feature
• geometrical information
• class label
Object detector
Spatio-temporal model to
represent the sequence
12
C. Human to Object Attention Module
Graph attention
-> Learn different levels of interactions
13
D. Social Attention Module
Graph attention
-> Learn different levels of interactions
14
E. Message Passing
𝑓𝑛
2
𝑚𝑛+1
𝑝
𝑓𝑛
1 𝑓𝑛
𝑝
Message to person 𝑝
at step n+1
𝑒𝑛
2
𝑒𝑛
1
Node feature
of person 𝑝
Same as above
Average
15
F. Future Social Interactions
Message
Passing result
𝑓𝑁
𝑝
Dynamically reconsider social
interactions in the future
16
 Problem in the training phase
 The model cannot recover from its accumulating errors at each time step
-> Feeding this error as the input to the next step propagates it throughout the network
 Solution
1. Make the final prediction to consider both the input and output of the RNN decoder at each
time step
2. Employ the concept of curriculum
• Starting with easier sub-tasks and gradually increasing the difficulty level of the tasks
• Divide our future pose prediction problem for 𝜏𝑓 frames into
𝜏𝑓
𝜔
frames
G. Training Strategies
17
5. Experiments ABC: Interpreting the interactions
between humans and objects
D: Being aware to
estimate occlusion
EF: Handling agent leaving the scene
18
 Feature work
 Incorporating 3D information (when camera parameters are available)
 Considering multi-modal future predictions
6. Conclusion

More Related Content

What's hot

Community detection in social networks[1]
Community detection in social networks[1]Community detection in social networks[1]
Community detection in social networks[1]sdnumaygmailcom
 
Complexity Explained: A brief intro to complex systems
Complexity Explained: A brief intro to complex systemsComplexity Explained: A brief intro to complex systems
Complexity Explained: A brief intro to complex systemsHiroki Sayama
 
Community detection algorithms
Community detection algorithmsCommunity detection algorithms
Community detection algorithmsAlireza Andalib
 
Wanted: a larger, different kind of box
Wanted: a larger, different kind of boxWanted: a larger, different kind of box
Wanted: a larger, different kind of boxLina Martinsson Achi
 
Adaptive network models of socio-cultural dynamics
Adaptive network models of socio-cultural dynamicsAdaptive network models of socio-cultural dynamics
Adaptive network models of socio-cultural dynamicsHiroki Sayama
 
ICPSR - Complex Systems Models in the Social Sciences - Lecture 2 - Professor...
ICPSR - Complex Systems Models in the Social Sciences - Lecture 2 - Professor...ICPSR - Complex Systems Models in the Social Sciences - Lecture 2 - Professor...
ICPSR - Complex Systems Models in the Social Sciences - Lecture 2 - Professor...Daniel Katz
 
Finding prominent features in communities in social networks using ontology
Finding prominent features in communities in social networks using ontologyFinding prominent features in communities in social networks using ontology
Finding prominent features in communities in social networks using ontologycsandit
 
DOMINANT FEATURES IDENTIFICATION FOR COVERT NODES IN 9/11 ATTACK USING THEIR ...
DOMINANT FEATURES IDENTIFICATION FOR COVERT NODES IN 9/11 ATTACK USING THEIR ...DOMINANT FEATURES IDENTIFICATION FOR COVERT NODES IN 9/11 ATTACK USING THEIR ...
DOMINANT FEATURES IDENTIFICATION FOR COVERT NODES IN 9/11 ATTACK USING THEIR ...IJNSA Journal
 
Taxonomy and survey of community
Taxonomy and survey of communityTaxonomy and survey of community
Taxonomy and survey of communityIJCSES Journal
 
Using e-Research Tools for Micro-Level Simulation
Using e-Research Tools for Micro-Level SimulationUsing e-Research Tools for Micro-Level Simulation
Using e-Research Tools for Micro-Level SimulationNeISSProject
 
GIS and Agent-based modeling: Part 1
GIS and Agent-based modeling: Part 1GIS and Agent-based modeling: Part 1
GIS and Agent-based modeling: Part 1crooksAndrew
 
A technical paper presentation on Evaluation of Deep Learning techniques in S...
A technical paper presentation on Evaluation of Deep Learning techniques in S...A technical paper presentation on Evaluation of Deep Learning techniques in S...
A technical paper presentation on Evaluation of Deep Learning techniques in S...VarshaR19
 
Community Detection with Networkx
Community Detection with NetworkxCommunity Detection with Networkx
Community Detection with NetworkxErika Fille Legara
 
AN GROUP BEHAVIOR MOBILITY MODEL FOR OPPORTUNISTIC NETWORKS
AN GROUP BEHAVIOR MOBILITY MODEL FOR OPPORTUNISTIC NETWORKS AN GROUP BEHAVIOR MOBILITY MODEL FOR OPPORTUNISTIC NETWORKS
AN GROUP BEHAVIOR MOBILITY MODEL FOR OPPORTUNISTIC NETWORKS csandit
 
Community Detection in Social Media
Community Detection in Social MediaCommunity Detection in Social Media
Community Detection in Social MediaSymeon Papadopoulos
 
ICPSR - Complex Systems Models in the Social Sciences - Lecture 8 and 9 - Pro...
ICPSR - Complex Systems Models in the Social Sciences - Lecture 8 and 9 - Pro...ICPSR - Complex Systems Models in the Social Sciences - Lecture 8 and 9 - Pro...
ICPSR - Complex Systems Models in the Social Sciences - Lecture 8 and 9 - Pro...Daniel Katz
 
Recomendation system: Community Detection Based Recomendation System using Hy...
Recomendation system: Community Detection Based Recomendation System using Hy...Recomendation system: Community Detection Based Recomendation System using Hy...
Recomendation system: Community Detection Based Recomendation System using Hy...Rajul Kukreja
 

What's hot (20)

06 Community Detection
06 Community Detection06 Community Detection
06 Community Detection
 
Community detection in social networks[1]
Community detection in social networks[1]Community detection in social networks[1]
Community detection in social networks[1]
 
Complexity Explained: A brief intro to complex systems
Complexity Explained: A brief intro to complex systemsComplexity Explained: A brief intro to complex systems
Complexity Explained: A brief intro to complex systems
 
Community detection algorithms
Community detection algorithmsCommunity detection algorithms
Community detection algorithms
 
Wanted: a larger, different kind of box
Wanted: a larger, different kind of boxWanted: a larger, different kind of box
Wanted: a larger, different kind of box
 
Adaptive network models of socio-cultural dynamics
Adaptive network models of socio-cultural dynamicsAdaptive network models of socio-cultural dynamics
Adaptive network models of socio-cultural dynamics
 
ICPSR - Complex Systems Models in the Social Sciences - Lecture 2 - Professor...
ICPSR - Complex Systems Models in the Social Sciences - Lecture 2 - Professor...ICPSR - Complex Systems Models in the Social Sciences - Lecture 2 - Professor...
ICPSR - Complex Systems Models in the Social Sciences - Lecture 2 - Professor...
 
D1803022335
D1803022335D1803022335
D1803022335
 
Finding prominent features in communities in social networks using ontology
Finding prominent features in communities in social networks using ontologyFinding prominent features in communities in social networks using ontology
Finding prominent features in communities in social networks using ontology
 
DOMINANT FEATURES IDENTIFICATION FOR COVERT NODES IN 9/11 ATTACK USING THEIR ...
DOMINANT FEATURES IDENTIFICATION FOR COVERT NODES IN 9/11 ATTACK USING THEIR ...DOMINANT FEATURES IDENTIFICATION FOR COVERT NODES IN 9/11 ATTACK USING THEIR ...
DOMINANT FEATURES IDENTIFICATION FOR COVERT NODES IN 9/11 ATTACK USING THEIR ...
 
Taxonomy and survey of community
Taxonomy and survey of communityTaxonomy and survey of community
Taxonomy and survey of community
 
Using e-Research Tools for Micro-Level Simulation
Using e-Research Tools for Micro-Level SimulationUsing e-Research Tools for Micro-Level Simulation
Using e-Research Tools for Micro-Level Simulation
 
GIS and Agent-based modeling: Part 1
GIS and Agent-based modeling: Part 1GIS and Agent-based modeling: Part 1
GIS and Agent-based modeling: Part 1
 
Social Connectome
Social ConnectomeSocial Connectome
Social Connectome
 
A technical paper presentation on Evaluation of Deep Learning techniques in S...
A technical paper presentation on Evaluation of Deep Learning techniques in S...A technical paper presentation on Evaluation of Deep Learning techniques in S...
A technical paper presentation on Evaluation of Deep Learning techniques in S...
 
Community Detection with Networkx
Community Detection with NetworkxCommunity Detection with Networkx
Community Detection with Networkx
 
AN GROUP BEHAVIOR MOBILITY MODEL FOR OPPORTUNISTIC NETWORKS
AN GROUP BEHAVIOR MOBILITY MODEL FOR OPPORTUNISTIC NETWORKS AN GROUP BEHAVIOR MOBILITY MODEL FOR OPPORTUNISTIC NETWORKS
AN GROUP BEHAVIOR MOBILITY MODEL FOR OPPORTUNISTIC NETWORKS
 
Community Detection in Social Media
Community Detection in Social MediaCommunity Detection in Social Media
Community Detection in Social Media
 
ICPSR - Complex Systems Models in the Social Sciences - Lecture 8 and 9 - Pro...
ICPSR - Complex Systems Models in the Social Sciences - Lecture 8 and 9 - Pro...ICPSR - Complex Systems Models in the Social Sciences - Lecture 8 and 9 - Pro...
ICPSR - Complex Systems Models in the Social Sciences - Lecture 8 and 9 - Pro...
 
Recomendation system: Community Detection Based Recomendation System using Hy...
Recomendation system: Community Detection Based Recomendation System using Hy...Recomendation system: Community Detection Based Recomendation System using Hy...
Recomendation system: Community Detection Based Recomendation System using Hy...
 

Similar to TRiPOD: Modeling Human-Human and Human-Object Interactions for Pose Forecasting

A Case Study Of Dynamic Visualization And Problem Solving
A Case Study Of Dynamic Visualization And Problem SolvingA Case Study Of Dynamic Visualization And Problem Solving
A Case Study Of Dynamic Visualization And Problem SolvingYasmine Anino
 
Action Recognition using Nonnegative Action
Action Recognition using Nonnegative ActionAction Recognition using Nonnegative Action
Action Recognition using Nonnegative Actionsuthi
 
Towards the Design of Intelligible Object-based Applications for the Web of T...
Towards the Design of Intelligible Object-based Applications for the Web of T...Towards the Design of Intelligible Object-based Applications for the Web of T...
Towards the Design of Intelligible Object-based Applications for the Web of T...Pierrick Thébault
 
AERA 2022 Presentation
AERA 2022 PresentationAERA 2022 Presentation
AERA 2022 PresentationNguyenDao72
 
Deep Visual Understanding from Deep Learning by Prof. Jitendra Malik
Deep Visual Understanding from Deep Learning by Prof. Jitendra MalikDeep Visual Understanding from Deep Learning by Prof. Jitendra Malik
Deep Visual Understanding from Deep Learning by Prof. Jitendra MalikThe Hive
 
最近の研究情勢についていくために - Deep Learningを中心に -
最近の研究情勢についていくために - Deep Learningを中心に - 最近の研究情勢についていくために - Deep Learningを中心に -
最近の研究情勢についていくために - Deep Learningを中心に - Hiroshi Fukui
 
Assessing Complex Problem Solving In The Classroom Meeting Challenges And Op...
Assessing Complex Problem Solving In The Classroom  Meeting Challenges And Op...Assessing Complex Problem Solving In The Classroom  Meeting Challenges And Op...
Assessing Complex Problem Solving In The Classroom Meeting Challenges And Op...Emily Smith
 
Making Connections and Scheduling on the Route to School: The Smartphone enab...
Making Connections and Scheduling on the Route to School: The Smartphone enab...Making Connections and Scheduling on the Route to School: The Smartphone enab...
Making Connections and Scheduling on the Route to School: The Smartphone enab...Fraser McLeod
 
Simulation of Imperative Animatronics for Mobile Video Games
Simulation of Imperative Animatronics for Mobile Video GamesSimulation of Imperative Animatronics for Mobile Video Games
Simulation of Imperative Animatronics for Mobile Video GamesDR.P.S.JAGADEESH KUMAR
 
MULTI-LEVEL FEATURE FUSION BASED TRANSFER LEARNING FOR PERSON RE-IDENTIFICATION
MULTI-LEVEL FEATURE FUSION BASED TRANSFER LEARNING FOR PERSON RE-IDENTIFICATIONMULTI-LEVEL FEATURE FUSION BASED TRANSFER LEARNING FOR PERSON RE-IDENTIFICATION
MULTI-LEVEL FEATURE FUSION BASED TRANSFER LEARNING FOR PERSON RE-IDENTIFICATIONijaia
 
Temporal Reasoning Graph for Activity Recognition
Temporal Reasoning Graph for Activity RecognitionTemporal Reasoning Graph for Activity Recognition
Temporal Reasoning Graph for Activity RecognitionIRJET Journal
 
Description and Composition of Bio-Inspired Design Patterns: The Gradient Case
Description and Composition of Bio-Inspired Design Patterns: The Gradient CaseDescription and Composition of Bio-Inspired Design Patterns: The Gradient Case
Description and Composition of Bio-Inspired Design Patterns: The Gradient CaseFernandez-Marquez
 
Object Detection with Computer Vision
Object Detection with Computer VisionObject Detection with Computer Vision
Object Detection with Computer VisionIRJET Journal
 
Identifier of human emotions based on convolutional neural network for assist...
Identifier of human emotions based on convolutional neural network for assist...Identifier of human emotions based on convolutional neural network for assist...
Identifier of human emotions based on convolutional neural network for assist...TELKOMNIKA JOURNAL
 
CrossScenarioTransferPersonReidentification_finalManuscript
CrossScenarioTransferPersonReidentification_finalManuscriptCrossScenarioTransferPersonReidentification_finalManuscript
CrossScenarioTransferPersonReidentification_finalManuscriptXiaojuan (Kathleen) WANG
 

Similar to TRiPOD: Modeling Human-Human and Human-Object Interactions for Pose Forecasting (20)

Faoyan agus
Faoyan agusFaoyan agus
Faoyan agus
 
A Case Study Of Dynamic Visualization And Problem Solving
A Case Study Of Dynamic Visualization And Problem SolvingA Case Study Of Dynamic Visualization And Problem Solving
A Case Study Of Dynamic Visualization And Problem Solving
 
Metrics in virtual worlds
Metrics in virtual worldsMetrics in virtual worlds
Metrics in virtual worlds
 
Action Recognition using Nonnegative Action
Action Recognition using Nonnegative ActionAction Recognition using Nonnegative Action
Action Recognition using Nonnegative Action
 
Towards the Design of Intelligible Object-based Applications for the Web of T...
Towards the Design of Intelligible Object-based Applications for the Web of T...Towards the Design of Intelligible Object-based Applications for the Web of T...
Towards the Design of Intelligible Object-based Applications for the Web of T...
 
AERA 2022 Presentation
AERA 2022 PresentationAERA 2022 Presentation
AERA 2022 Presentation
 
Deep Visual Understanding from Deep Learning by Prof. Jitendra Malik
Deep Visual Understanding from Deep Learning by Prof. Jitendra MalikDeep Visual Understanding from Deep Learning by Prof. Jitendra Malik
Deep Visual Understanding from Deep Learning by Prof. Jitendra Malik
 
The Tower of Knowledge A Generic System Architecture
The Tower of Knowledge A Generic System ArchitectureThe Tower of Knowledge A Generic System Architecture
The Tower of Knowledge A Generic System Architecture
 
最近の研究情勢についていくために - Deep Learningを中心に -
最近の研究情勢についていくために - Deep Learningを中心に - 最近の研究情勢についていくために - Deep Learningを中心に -
最近の研究情勢についていくために - Deep Learningを中心に -
 
Assessing Complex Problem Solving In The Classroom Meeting Challenges And Op...
Assessing Complex Problem Solving In The Classroom  Meeting Challenges And Op...Assessing Complex Problem Solving In The Classroom  Meeting Challenges And Op...
Assessing Complex Problem Solving In The Classroom Meeting Challenges And Op...
 
Making Connections and Scheduling on the Route to School: The Smartphone enab...
Making Connections and Scheduling on the Route to School: The Smartphone enab...Making Connections and Scheduling on the Route to School: The Smartphone enab...
Making Connections and Scheduling on the Route to School: The Smartphone enab...
 
Simulation of Imperative Animatronics for Mobile Video Games
Simulation of Imperative Animatronics for Mobile Video GamesSimulation of Imperative Animatronics for Mobile Video Games
Simulation of Imperative Animatronics for Mobile Video Games
 
Report
ReportReport
Report
 
MULTI-LEVEL FEATURE FUSION BASED TRANSFER LEARNING FOR PERSON RE-IDENTIFICATION
MULTI-LEVEL FEATURE FUSION BASED TRANSFER LEARNING FOR PERSON RE-IDENTIFICATIONMULTI-LEVEL FEATURE FUSION BASED TRANSFER LEARNING FOR PERSON RE-IDENTIFICATION
MULTI-LEVEL FEATURE FUSION BASED TRANSFER LEARNING FOR PERSON RE-IDENTIFICATION
 
The Foresight method for scenario building
The Foresight method for scenario buildingThe Foresight method for scenario building
The Foresight method for scenario building
 
Temporal Reasoning Graph for Activity Recognition
Temporal Reasoning Graph for Activity RecognitionTemporal Reasoning Graph for Activity Recognition
Temporal Reasoning Graph for Activity Recognition
 
Description and Composition of Bio-Inspired Design Patterns: The Gradient Case
Description and Composition of Bio-Inspired Design Patterns: The Gradient CaseDescription and Composition of Bio-Inspired Design Patterns: The Gradient Case
Description and Composition of Bio-Inspired Design Patterns: The Gradient Case
 
Object Detection with Computer Vision
Object Detection with Computer VisionObject Detection with Computer Vision
Object Detection with Computer Vision
 
Identifier of human emotions based on convolutional neural network for assist...
Identifier of human emotions based on convolutional neural network for assist...Identifier of human emotions based on convolutional neural network for assist...
Identifier of human emotions based on convolutional neural network for assist...
 
CrossScenarioTransferPersonReidentification_finalManuscript
CrossScenarioTransferPersonReidentification_finalManuscriptCrossScenarioTransferPersonReidentification_finalManuscript
CrossScenarioTransferPersonReidentification_finalManuscript
 

More from Hitoshi Nishimura

Tracking emerges by colorizing videosの紹介
Tracking emerges by colorizing videosの紹介Tracking emerges by colorizing videosの紹介
Tracking emerges by colorizing videosの紹介Hitoshi Nishimura
 
Online real time multiple spatiotemporal action localisation and predictionの紹介
Online real time multiple spatiotemporal action localisation and predictionの紹介Online real time multiple spatiotemporal action localisation and predictionの紹介
Online real time multiple spatiotemporal action localisation and predictionの紹介Hitoshi Nishimura
 
Learning to discover objects in rgb d images using correlation clusteringの紹介
Learning to discover objects in rgb d images using correlation clusteringの紹介Learning to discover objects in rgb d images using correlation clusteringの紹介
Learning to discover objects in rgb d images using correlation clusteringの紹介Hitoshi Nishimura
 
A bayesian approach to multimodal visual dictionary learningの紹介
A bayesian approach to multimodal visual dictionary learningの紹介A bayesian approach to multimodal visual dictionary learningの紹介
A bayesian approach to multimodal visual dictionary learningの紹介Hitoshi Nishimura
 
Sparse isotropic hashingの紹介
Sparse isotropic hashingの紹介Sparse isotropic hashingの紹介
Sparse isotropic hashingの紹介Hitoshi Nishimura
 
Dimensionality reduction with side information for image classification
Dimensionality reduction with side information for image classificationDimensionality reduction with side information for image classification
Dimensionality reduction with side information for image classificationHitoshi Nishimura
 
単一物体追跡論文のサーベイ
単一物体追跡論文のサーベイ単一物体追跡論文のサーベイ
単一物体追跡論文のサーベイHitoshi Nishimura
 

More from Hitoshi Nishimura (9)

Tracking emerges by colorizing videosの紹介
Tracking emerges by colorizing videosの紹介Tracking emerges by colorizing videosの紹介
Tracking emerges by colorizing videosの紹介
 
Online real time multiple spatiotemporal action localisation and predictionの紹介
Online real time multiple spatiotemporal action localisation and predictionの紹介Online real time multiple spatiotemporal action localisation and predictionの紹介
Online real time multiple spatiotemporal action localisation and predictionの紹介
 
Learning to discover objects in rgb d images using correlation clusteringの紹介
Learning to discover objects in rgb d images using correlation clusteringの紹介Learning to discover objects in rgb d images using correlation clusteringの紹介
Learning to discover objects in rgb d images using correlation clusteringの紹介
 
A bayesian approach to multimodal visual dictionary learningの紹介
A bayesian approach to multimodal visual dictionary learningの紹介A bayesian approach to multimodal visual dictionary learningの紹介
A bayesian approach to multimodal visual dictionary learningの紹介
 
Sparse isotropic hashingの紹介
Sparse isotropic hashingの紹介Sparse isotropic hashingの紹介
Sparse isotropic hashingの紹介
 
Dimensionality reduction with side information for image classification
Dimensionality reduction with side information for image classificationDimensionality reduction with side information for image classification
Dimensionality reduction with side information for image classification
 
Lucas kanade法について
Lucas kanade法についてLucas kanade法について
Lucas kanade法について
 
単一物体追跡論文のサーベイ
単一物体追跡論文のサーベイ単一物体追跡論文のサーベイ
単一物体追跡論文のサーベイ
 
KCFの紹介
KCFの紹介KCFの紹介
KCFの紹介
 

Recently uploaded

IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2Hyundai Motor Group
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraDeakin University
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxnull - The Open Security Community
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 

Recently uploaded (20)

IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
The transition to renewables in India.pdf
The transition to renewables in India.pdfThe transition to renewables in India.pdf
The transition to renewables in India.pdf
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning era
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptxVulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 

TRiPOD: Modeling Human-Human and Human-Object Interactions for Pose Forecasting

  • 2. 2 本日の紹介論文  2021/4/8にarXivで発表された論文  著者は有名な人が多い  ソースコードは公開されていない  データセットは公開されている • Social Motion Forecasting (SoMoF) Benchmark (http://somof.stanford.edu/) • このデータセットを使ってICCV2021でワークショップが開催されるらしい (1st Workshop, Benchmark and Challenge on Human Trajectory and Pose Dynamics Forecasting in the Wild)
  • 3. 3  Forecasting human movements (pose dynamics and trajectory)  Real-world applications  including robotics  healthcare  detection of perilous behavioral patterns  Extremely challenging in real-world scenes due to the different factors 1. Interactions between people in the scene 2. Objects involved in the scene can provide informative clues 3. Different levels of interactions • (Movements of all the persons in the scene are not always highly correlated with each other nor the humans to objects) • These different levels of interactions can change over time 4. A person might move outside the sensor field-of-view or be a partially/fully occluded by an object 1. Introduction
  • 5. 5  Existing methods  Pose dynamics forecasting methods [14, 40, 41, 58]  Trajectory forecasting [22, 27]  Problem  Neglect some of these challenging factors • Do not effectively model all the informative environmental and social interactions in the scene • Assume that all tracks and/or body joints are always observable in the past and future  Proposed method  Human pose dynamics and trajectory forecasting one step forward toward more practical scenarios in-the-wild by considering all these factors together 1. Introduction
  • 6. 6  Model  Model the input skeleton body joints, the social human-human and human-object interactions with different attention graphs  These two types of information are different by nature -> Applying an iterative message passing  Humans may retain their influences on each other consistently in future -> Preserve their spatio-temporal attentional relationships by modeling them also in future prediction phase  Address the concept of joint invisibility or body disappearance  Accumulative error in sequential models for long-term sequences -> Take a curriculum learning approach to train our model  Dataset  No proper benchmark dataset for such real-world problem -> Introduce a new benchmark by repurposing existing datasets and introducing relevant evaluation metrics 1. Introduction
  • 7. 7  Pose dynamics forecasting  Human trajectory predictions  Pose dynamics and trajectory forecasting 2. Related Work
  • 8. 8 Our goal: To model the complex human-human and human-object interactions in a way that can also predict all the joint visibility  Problem Definition 3. Trajectory and Pose Dynamics Forecasting A binary value being 0 if the joint is invisible
  • 10. 10 A. Attentional Human Pose History Leverage natural connectivities Influence of joints on each other is not uniform Encode the past skeleton history for each person
  • 11. 11 B. Object and Global Scene Features • visual feature • geometrical information • class label Object detector Spatio-temporal model to represent the sequence
  • 12. 12 C. Human to Object Attention Module Graph attention -> Learn different levels of interactions
  • 13. 13 D. Social Attention Module Graph attention -> Learn different levels of interactions
  • 14. 14 E. Message Passing 𝑓𝑛 2 𝑚𝑛+1 𝑝 𝑓𝑛 1 𝑓𝑛 𝑝 Message to person 𝑝 at step n+1 𝑒𝑛 2 𝑒𝑛 1 Node feature of person 𝑝 Same as above Average
  • 15. 15 F. Future Social Interactions Message Passing result 𝑓𝑁 𝑝 Dynamically reconsider social interactions in the future
  • 16. 16  Problem in the training phase  The model cannot recover from its accumulating errors at each time step -> Feeding this error as the input to the next step propagates it throughout the network  Solution 1. Make the final prediction to consider both the input and output of the RNN decoder at each time step 2. Employ the concept of curriculum • Starting with easier sub-tasks and gradually increasing the difficulty level of the tasks • Divide our future pose prediction problem for 𝜏𝑓 frames into 𝜏𝑓 𝜔 frames G. Training Strategies
  • 17. 17 5. Experiments ABC: Interpreting the interactions between humans and objects D: Being aware to estimate occlusion EF: Handling agent leaving the scene
  • 18. 18  Feature work  Incorporating 3D information (when camera parameters are available)  Considering multi-modal future predictions 6. Conclusion