SlideShare a Scribd company logo
1 of 27
direct use of particle filters
for decision making
Ryuichi Ueda, Chiba inst. of Technology
パーティクルフィルタを推定だけでなく行動決定に直接使う試み
千葉工業大学 上田隆一
日本知能情報ファジィ学会(SOFT)ベンチャー研究会
第1回「動きの様相から先を読む」研究会
@名古屋工業大学
metacognition -Flavell 1979
• knowledge or cognition about cognitive phenomena
– evaluation of the extent of its own knowledge
• how to implement to robots
– probability (Bayes') theory
– implementation
• methods of probabilistic robotics, methods of machine learning,
artificial neural networks, ...
Aug. 1, 2017 第1回「動きの様相から先を読む」研究会 2
probability expression of knowledge
• state variables: 𝒙 = 𝑥1, 𝑥2, … 𝑥 𝑛
– 𝑛 = 3 : mobile robot self-localization
– 𝑛 = 10 𝑁: SLAM (mapping)
– The actual 𝒙 is unknown.
• 𝑏𝑒𝑙 𝒙 : the belief of the robot about 𝒙
– a probability density function
Aug. 1, 2017 第1回「動きの様相から先を読む」研究会 3
𝑥
𝑦
heavy-tailed distribution
(unconfident)
𝑥
𝑦
peaky distribution
(confident)
𝑥
𝑦
peaky but some peaks
particle filters
• a popular method for self-localization
– Monte Carlo localization: particle filter for self-localization
• used for all of the methods in this presentation
• representation of the belief
• updates of the particles
– Sensor information reduces the distribution of the particles.
– Robot motion invokes motion of the particles.
Aug. 1, 2017 第1回「動きの様相から先を読む」研究会 4
by courtesy of Ryoma Aoki (Ueda lab.)
particles
(candidates
of the pose)
the actual pose
(unknown)
an example –Tsukuba challenge
Aug. 1, 2017 第1回「動きの様相から先を読む」研究会 5
decide its action
based on the most
reliable particle
• 2km run of autonomous robots
– a standard method for win
• put a LIDAR on their robot
• make a map with a SLAM method beforehand
• probabilistic self-localization with the map
• motion planning with non-probabilistic methods
Hayashibara laboratory's team of Chiba Inst. of Technology in 2017 (completes 2km run.)
severer cases
• RoboCup
– small camera
– vibration and collision
– few landmarks
• a micromouse in the maze
– only four range sensors
– perceptual aliasing
Aug. 1, 2017 第1回「動きの様相から先を読む」研究会 6
 Robots must decides their motion based on uncertain 𝑏𝑒𝑙s.
decision with broad beliefs
• Is it possible?
– easy for human beings
• "You sense that you do not yet know a certain chapter in your text
well enough to pass tomorrow's exam, so you read it through once
more." [Flavell 1979]
• Intelligent robots in the real world must be able to ...
– find an action that is effective even if the belief is broad
– find an action to reduce the uncertainty
• 2 (+ 1) cases are presented from our study.
Aug. 1, 2017 第1回「動きの様相から先を読む」研究会 7
CASE 1: REAL-TIME QMDP
R. Ueda, T. Arai, K. Sakamoto, Y. Jitsukawa, K. Umeda, H. Osumi, T. Kikuchi
and M. Komura: "Real-time decision making with state-value function
under uncertainty of state estimation – Evaluation with Local Maxima and
Discontinuity," IEEE ICRA, 2005.
Aug. 1, 2017 第1回「動きの様相から先を読む」研究会 8
a navigation problem
with multiple destinations
• the problem:
– There are more than one destinations.
– The robot knows its uncertainty of self-localization (𝑏𝑒𝑙).
– The robot must decide an effective action.
Aug. 1, 2017 第1回「動きの様相から先を読む」研究会 9
destination 1
destination 2
Which is easy to go?
a goalie task
for RoboCup 4 legged robot league
• three kinds of "destinations (sub-task)"
a) staying in the goal (ball: invisible)
b) punching the ball (ball: near to the goal)
c) closing a goalpost (ball: at a side of the goal)
Aug. 1, 2017 第1回「動きの様相から先を読む」研究会 10
ERS-210
x
y
(r,j)
(x,y,q)
goalie
goal
(a)
(b)
(c)
difference of accuracy requirement
• requirement to reach the sub-tasks
– (a) accurate self-localization only relative to the goal
– (b) no need of accurate self-localization
– (c) accurate self-localization
Aug. 1, 2017 第1回「動きの様相から先を読む」研究会 11
goal
(a)
(b)
(c)
𝑏𝑒𝑙 is broad but
the relative pose toward
the goal is accurate.
goal
 The robot must choose its sub-task with the
consideration of accuracy requirement in real-time.
real-time QMDP
• QMDP value method: written in [Littman 95]
• composed of offline and online calculation
– offline
• calculate the value (cost to go) function
without consideration of uncertain
• state variables: 𝒙 = 𝑥, 𝑦, 𝜃, 𝑥ball, 𝑦ball
– online
1. place all particles on the value function
2. choose an action that maximizes
the average value of the particles
Aug. 1, 2017 第1回「動きの様相から先を読む」研究会 12
state space (5D)
b c
value
a
ball
goal
calculated value function
• 3,000,000 discrete states in 5D state space
• 49[min] calculation with a 3.6[GHz] CPU
Aug. 1, 2017 第1回「動きの様相から先を読む」研究会 13
a part of the value function
(values on 𝑥𝑦-plane with a fixed (𝜃, 𝑥ball, 𝑦ball) )
motion with real-time QMDP
• Motion correspond to
the sub-tasks can be seen.
– real-time calculation of1000 particles
with 192MHz CPU
– Detailed evaluation can be seen
in [Ueda ICRA2005]
Aug. 1, 2017 第1回「動きの様相から先を読む」研究会 14
x2
waiting in the goal closing a goal post punching the ball
https://youtu.be/fsQicKXE5AU
CASE 2:
PROBABILISTIC FLOW CONTROL
Ryuichi Ueda: Generation of Compensation Behavior of Autonomous
Robot for Uncertainty of Information with Probabilistic Flow Control,
Advanced Robotics, 29(11), pp. 721-734, June, 2015.
Aug. 1, 2017 第1回「動きの様相から先を読む」研究会 15
motivation
• When I go to bed at midnight without light,
how I behave?
– I search a wall by my hand, and trace the wall.
• symbolical study: "coastal navigation" [Roy 99]
– planning with uncertainty evaluation at offline calculation
Aug. 1, 2017 第1回「動きの様相から先を読む」研究会 16
A degree of freedom is erased.
goal
possibility of lost
(x,y,q)
wall
H
how to realize the behavior
with real-time QMDP
• problems:
– no strategy for obtaining information
• no consideration of uncertainty at offline
• no consideration of future observation at online
– deadlocks
• The robot stops when any motion cannot improve the
average value.
• not fatal in RoboCup but fatal in navigation
 A small modification gives an interesting behavior.
Aug. 1, 2017 第1回「動きの様相から先を読む」研究会 17
probabilistic flow control (PFC)
• an additional assumption
– The robot can know whether it reached on a goal or not
through a sensor.
• modification of calculation
– The average value is
weighted by the value.
– Particles near a goal
have a priority.
Aug. 1, 2017 第1回「動きの様相から先を読む」研究会 18
value
state space
high weighted
low weighted
a navigation problem with one landmark
• state variables: 𝒙 = 𝑥, 𝑦, 𝜃
• information:
– landmark observation
– goal or not
• Particles do not converge
most of time.
Aug. 1, 2017 第1回「動きの様相から先を読む」研究会 19
destination
landmark
Poses of these particles never
contradict the sensor data.
application of PFC
• The robot moves as it is
dragged by some particles
near the goal.
• real-time QMDP
– 73 deadlocks in 100 trials
Aug. 1, 2017 第1回「動きの様相から先を読む」研究会 20
other applications
• Some unpublished modifications are applied to.
– (I must write but ...)
Aug. 1, 2017 第1回「動きの様相から先を読む」研究会 21
searching behavior of
a manipulator with (modified) PFC
a rod
(position
unknown)
red color:
likelihood of the
rod existence
wandering behavior of a raspberry
pi mouse with (modified) PFC
goal
these movies: https://blog.ueda.tech/?page_id=10034
CASE 3: PARTICLE FILTER ON EPISODE
• Ryuichi Ueda, Kotaro Mizuta, Hiroshi Yamakawa and Hiroyuki Okada:
Particle Filter on Episode for Learning Decision Making Rule, Proc. of
The 14th International Conference on Intelligent Autonomous
Systems (IAS-14), Shanghai, July, 2016.
• The 35-th Annual Conference of the RSJ (to appear)
Aug. 1, 2017 第1回「動きの様相から先を読む」研究会 22
motivation
• decision making before/without environmental maps
• Memory goes before, and a map follow after.
– Hippocampus of mammals generate a sequence of memory,
and the sequence becomes maps with dropout of time sequence
– information: memory > map
• Robots can store seemingly unlimited memory.
– different from creatures
Aug. 1, 2017 第1回「動きの様相から先を読む」研究会 23
no need of SLAM for intelligent decision making (?)
particle filter on episode (PFoE)
• procedure
1. record I/O and reward
2. calculate the similarity between the current situation and
each past state
3. choose an action that maximizes future reward
Aug. 1, 2017 第1回「動きの様相から先を読む」研究会 24
time axis
states (sensors)
episode
rewards
belief
s s s s s s s
present time
1 -1
a a a a a a a actions
past current
particles
a simple application
• The robot goes from the bottom of the T shape maze
to the goal that is set alternately on one of the arm.
• conditions (very simplified)
– rewards:
• 1: the robot turned to the goal arm
• -1: it turned to the wrong arm
– only four states:
start, T-junction, after turn,
end of an arm
– only one chance of decision:
right or left at T-junction
Aug. 1, 2017 第1回「動きの様相から先を読む」研究会 25
another version of PFoE
• used for teaching
• presented in the 35-th
Annual Conference of the RSJ
– currently secret
Aug. 1, 2017 第1回「動きの様相から先を読む」研究会 26
these movies on the web :
https://blog.ueda.tech/?page_id=10021
conclusion of this presentation
• Real-time QMDP can choose appropriate locations of a
goalkeeper in accordance with the belief of the robot.
– on a 192MHz CPU, 32MB DRAM
– It was actually used in RoboCup competitions for some years.
• PFC compensates loss of information by motion of robots.
– Robots with PFC show "searching behavior."
• We are trying to build a cognitive/metacognitive model
for robots with poor computing resources.
Aug. 1, 2017 第1回「動きの様相から先を読む」研究会 27

More Related Content

Similar to direct use of particle filters for decision making

Self-Flying Drones: On a Mission to Navigate Dark, Dangerous and Unknown Worlds
Self-Flying Drones: On a Mission to Navigate Dark, Dangerous and Unknown WorldsSelf-Flying Drones: On a Mission to Navigate Dark, Dangerous and Unknown Worlds
Self-Flying Drones: On a Mission to Navigate Dark, Dangerous and Unknown WorldsTahoe Silicon Mountain
 
Pso kota baru parahyangan 2017
Pso kota baru parahyangan 2017Pso kota baru parahyangan 2017
Pso kota baru parahyangan 2017Iwan Sofana
 
게임 AI를 통해 본 인공지능 기본 개념
게임 AI를 통해 본 인공지능 기본 개념게임 AI를 통해 본 인공지능 기본 개념
게임 AI를 통해 본 인공지능 기본 개념Keunhyun Oh
 
Useing PSO to optimize logit model with Tensorflow
Useing PSO to optimize logit model with TensorflowUseing PSO to optimize logit model with Tensorflow
Useing PSO to optimize logit model with TensorflowYi-Fan Liou
 
Simultaneous Localization, Mapping and Self-body Shape Estimation by a Mobile...
Simultaneous Localization, Mapping and Self-body Shape Estimation by a Mobile...Simultaneous Localization, Mapping and Self-body Shape Estimation by a Mobile...
Simultaneous Localization, Mapping and Self-body Shape Estimation by a Mobile...Akira Taniguchi
 
Particle Swarm Optimization - PSO
Particle Swarm Optimization - PSOParticle Swarm Optimization - PSO
Particle Swarm Optimization - PSOMohamed Talaat
 
Hyperoptimized Machine Learning and Deep Learning Methods For Geospatial and ...
Hyperoptimized Machine Learning and Deep Learning Methods For Geospatial and ...Hyperoptimized Machine Learning and Deep Learning Methods For Geospatial and ...
Hyperoptimized Machine Learning and Deep Learning Methods For Geospatial and ...Neelabha Pant
 
Citron : Context Information Acquisition Framework on Personal Devices
Citron : Context Information Acquisition Framework on Personal DevicesCitron : Context Information Acquisition Framework on Personal Devices
Citron : Context Information Acquisition Framework on Personal DevicesTetsuo Yamabe
 
NASA.ppt
NASA.pptNASA.ppt
NASA.pptVyshuy
 
Particle Swarm Optimization by Rajorshi Mukherjee
Particle Swarm Optimization by Rajorshi MukherjeeParticle Swarm Optimization by Rajorshi Mukherjee
Particle Swarm Optimization by Rajorshi MukherjeeRajorshi Mukherjee
 
Simulation-based optimization: Upper Confidence Tree and Direct Policy Search
Simulation-based optimization: Upper Confidence Tree and Direct Policy SearchSimulation-based optimization: Upper Confidence Tree and Direct Policy Search
Simulation-based optimization: Upper Confidence Tree and Direct Policy SearchOlivier Teytaud
 
Effect of Self-animated Avatars in Virtual Environments
Effect of Self-animated Avatars in Virtual EnvironmentsEffect of Self-animated Avatars in Virtual Environments
Effect of Self-animated Avatars in Virtual Environmentsmukundraj2
 
Human action recognition with kinect using a joint motion descriptor
Human action recognition with kinect using a joint motion descriptorHuman action recognition with kinect using a joint motion descriptor
Human action recognition with kinect using a joint motion descriptorSoma Boubou
 
Y conf talk - Andrej Karpathy
Y conf talk - Andrej KarpathyY conf talk - Andrej Karpathy
Y conf talk - Andrej KarpathySze Siong Teo
 
Intro to data visualization
Intro to data visualizationIntro to data visualization
Intro to data visualizationJan Aerts
 
Lecture 4 neural networks
Lecture 4 neural networksLecture 4 neural networks
Lecture 4 neural networksParveenMalik18
 
Types of Artificial Intelligence.ppt
Types of Artificial Intelligence.pptTypes of Artificial Intelligence.ppt
Types of Artificial Intelligence.pptGEETHAS668001
 

Similar to direct use of particle filters for decision making (20)

Self-Flying Drones: On a Mission to Navigate Dark, Dangerous and Unknown Worlds
Self-Flying Drones: On a Mission to Navigate Dark, Dangerous and Unknown WorldsSelf-Flying Drones: On a Mission to Navigate Dark, Dangerous and Unknown Worlds
Self-Flying Drones: On a Mission to Navigate Dark, Dangerous and Unknown Worlds
 
Pso kota baru parahyangan 2017
Pso kota baru parahyangan 2017Pso kota baru parahyangan 2017
Pso kota baru parahyangan 2017
 
게임 AI를 통해 본 인공지능 기본 개념
게임 AI를 통해 본 인공지능 기본 개념게임 AI를 통해 본 인공지능 기본 개념
게임 AI를 통해 본 인공지능 기본 개념
 
Useing PSO to optimize logit model with Tensorflow
Useing PSO to optimize logit model with TensorflowUseing PSO to optimize logit model with Tensorflow
Useing PSO to optimize logit model with Tensorflow
 
Simultaneous Localization, Mapping and Self-body Shape Estimation by a Mobile...
Simultaneous Localization, Mapping and Self-body Shape Estimation by a Mobile...Simultaneous Localization, Mapping and Self-body Shape Estimation by a Mobile...
Simultaneous Localization, Mapping and Self-body Shape Estimation by a Mobile...
 
Particle Swarm Optimization - PSO
Particle Swarm Optimization - PSOParticle Swarm Optimization - PSO
Particle Swarm Optimization - PSO
 
AI Robotics
AI RoboticsAI Robotics
AI Robotics
 
Hyperoptimized Machine Learning and Deep Learning Methods For Geospatial and ...
Hyperoptimized Machine Learning and Deep Learning Methods For Geospatial and ...Hyperoptimized Machine Learning and Deep Learning Methods For Geospatial and ...
Hyperoptimized Machine Learning and Deep Learning Methods For Geospatial and ...
 
alexVAE_New.pdf
alexVAE_New.pdfalexVAE_New.pdf
alexVAE_New.pdf
 
Neural Networks-1
Neural Networks-1Neural Networks-1
Neural Networks-1
 
Citron : Context Information Acquisition Framework on Personal Devices
Citron : Context Information Acquisition Framework on Personal DevicesCitron : Context Information Acquisition Framework on Personal Devices
Citron : Context Information Acquisition Framework on Personal Devices
 
NASA.ppt
NASA.pptNASA.ppt
NASA.ppt
 
Particle Swarm Optimization by Rajorshi Mukherjee
Particle Swarm Optimization by Rajorshi MukherjeeParticle Swarm Optimization by Rajorshi Mukherjee
Particle Swarm Optimization by Rajorshi Mukherjee
 
Simulation-based optimization: Upper Confidence Tree and Direct Policy Search
Simulation-based optimization: Upper Confidence Tree and Direct Policy SearchSimulation-based optimization: Upper Confidence Tree and Direct Policy Search
Simulation-based optimization: Upper Confidence Tree and Direct Policy Search
 
Effect of Self-animated Avatars in Virtual Environments
Effect of Self-animated Avatars in Virtual EnvironmentsEffect of Self-animated Avatars in Virtual Environments
Effect of Self-animated Avatars in Virtual Environments
 
Human action recognition with kinect using a joint motion descriptor
Human action recognition with kinect using a joint motion descriptorHuman action recognition with kinect using a joint motion descriptor
Human action recognition with kinect using a joint motion descriptor
 
Y conf talk - Andrej Karpathy
Y conf talk - Andrej KarpathyY conf talk - Andrej Karpathy
Y conf talk - Andrej Karpathy
 
Intro to data visualization
Intro to data visualizationIntro to data visualization
Intro to data visualization
 
Lecture 4 neural networks
Lecture 4 neural networksLecture 4 neural networks
Lecture 4 neural networks
 
Types of Artificial Intelligence.ppt
Types of Artificial Intelligence.pptTypes of Artificial Intelligence.ppt
Types of Artificial Intelligence.ppt
 

More from Ryuichi Ueda

第27回ロボティクスシンポジアスライド
第27回ロボティクスシンポジアスライド第27回ロボティクスシンポジアスライド
第27回ロボティクスシンポジアスライドRyuichi Ueda
 
シェル・ワンライナー160本ノック
シェル・ワンライナー160本ノックシェル・ワンライナー160本ノック
シェル・ワンライナー160本ノックRyuichi Ueda
 
日本ロボット学会第139回ロボット工学セミナー
日本ロボット学会第139回ロボット工学セミナー日本ロボット学会第139回ロボット工学セミナー
日本ロボット学会第139回ロボット工学セミナーRyuichi Ueda
 
シェル芸勉強会と会場の話
シェル芸勉強会と会場の話シェル芸勉強会と会場の話
シェル芸勉強会と会場の話Ryuichi Ueda
 
PythonとJupyter Notebookを利用した教科書「詳解確率ロボティクス」の企画と執筆
PythonとJupyter Notebookを利用した教科書「詳解確率ロボティクス」の企画と執筆PythonとJupyter Notebookを利用した教科書「詳解確率ロボティクス」の企画と執筆
PythonとJupyter Notebookを利用した教科書「詳解確率ロボティクス」の企画と執筆Ryuichi Ueda
 
第45回シェル芸勉強会オープニングスライド
第45回シェル芸勉強会オープニングスライド第45回シェル芸勉強会オープニングスライド
第45回シェル芸勉強会オープニングスライドRyuichi Ueda
 
bash(の変な使い方)update
bash(の変な使い方)updatebash(の変な使い方)update
bash(の変な使い方)updateRyuichi Ueda
 
第41回シェル芸勉強会 午後オープニング
第41回シェル芸勉強会 午後オープニング第41回シェル芸勉強会 午後オープニング
第41回シェル芸勉強会 午後オープニングRyuichi Ueda
 
Searching Behavior of a Simple Manipulator only with Sense of Touch Generated...
Searching Behavior of a Simple Manipulator only with Sense of Touch Generated...Searching Behavior of a Simple Manipulator only with Sense of Touch Generated...
Searching Behavior of a Simple Manipulator only with Sense of Touch Generated...Ryuichi Ueda
 
20181113_子ども夢ロボット&トーク
20181113_子ども夢ロボット&トーク20181113_子ども夢ロボット&トーク
20181113_子ども夢ロボット&トークRyuichi Ueda
 
第37回シェル芸勉強会イントロ
第37回シェル芸勉強会イントロ第37回シェル芸勉強会イントロ
第37回シェル芸勉強会イントロRyuichi Ueda
 
シェル芸勉強会にみる、コミュニティを通じたIT学習
シェル芸勉強会にみる、コミュニティを通じたIT学習シェル芸勉強会にみる、コミュニティを通じたIT学習
シェル芸勉強会にみる、コミュニティを通じたIT学習Ryuichi Ueda
 
ROSチュートリアル ROBOMECH2018
ROSチュートリアル ROBOMECH2018ROSチュートリアル ROBOMECH2018
ROSチュートリアル ROBOMECH2018Ryuichi Ueda
 
poster of PFoE used in ICRA 2018
poster of PFoE used in ICRA 2018poster of PFoE used in ICRA 2018
poster of PFoE used in ICRA 2018Ryuichi Ueda
 
Robot frontier lesson3 2018
Robot frontier lesson3 2018Robot frontier lesson3 2018
Robot frontier lesson3 2018Ryuichi Ueda
 
Robot frontier lesson2 2018
Robot frontier lesson2 2018Robot frontier lesson2 2018
Robot frontier lesson2 2018Ryuichi Ueda
 
Robot frontier lesson1 2018
Robot frontier lesson1 2018Robot frontier lesson1 2018
Robot frontier lesson1 2018Ryuichi Ueda
 
第34回シェル芸勉強会
第34回シェル芸勉強会第34回シェル芸勉強会
第34回シェル芸勉強会Ryuichi Ueda
 
第32回信号処理シンポジウム「Raspberry PiとROSを 使ったロボットシステム」
第32回信号処理シンポジウム「Raspberry PiとROSを使ったロボットシステム」第32回信号処理シンポジウム「Raspberry PiとROSを使ったロボットシステム」
第32回信号処理シンポジウム「Raspberry PiとROSを 使ったロボットシステム」Ryuichi Ueda
 
2017年10月18日 シェル芸勉強会 meets バイオインフォマティクス vol.1 スライド
2017年10月18日 シェル芸勉強会 meets バイオインフォマティクス vol.1 スライド2017年10月18日 シェル芸勉強会 meets バイオインフォマティクス vol.1 スライド
2017年10月18日 シェル芸勉強会 meets バイオインフォマティクス vol.1 スライドRyuichi Ueda
 

More from Ryuichi Ueda (20)

第27回ロボティクスシンポジアスライド
第27回ロボティクスシンポジアスライド第27回ロボティクスシンポジアスライド
第27回ロボティクスシンポジアスライド
 
シェル・ワンライナー160本ノック
シェル・ワンライナー160本ノックシェル・ワンライナー160本ノック
シェル・ワンライナー160本ノック
 
日本ロボット学会第139回ロボット工学セミナー
日本ロボット学会第139回ロボット工学セミナー日本ロボット学会第139回ロボット工学セミナー
日本ロボット学会第139回ロボット工学セミナー
 
シェル芸勉強会と会場の話
シェル芸勉強会と会場の話シェル芸勉強会と会場の話
シェル芸勉強会と会場の話
 
PythonとJupyter Notebookを利用した教科書「詳解確率ロボティクス」の企画と執筆
PythonとJupyter Notebookを利用した教科書「詳解確率ロボティクス」の企画と執筆PythonとJupyter Notebookを利用した教科書「詳解確率ロボティクス」の企画と執筆
PythonとJupyter Notebookを利用した教科書「詳解確率ロボティクス」の企画と執筆
 
第45回シェル芸勉強会オープニングスライド
第45回シェル芸勉強会オープニングスライド第45回シェル芸勉強会オープニングスライド
第45回シェル芸勉強会オープニングスライド
 
bash(の変な使い方)update
bash(の変な使い方)updatebash(の変な使い方)update
bash(の変な使い方)update
 
第41回シェル芸勉強会 午後オープニング
第41回シェル芸勉強会 午後オープニング第41回シェル芸勉強会 午後オープニング
第41回シェル芸勉強会 午後オープニング
 
Searching Behavior of a Simple Manipulator only with Sense of Touch Generated...
Searching Behavior of a Simple Manipulator only with Sense of Touch Generated...Searching Behavior of a Simple Manipulator only with Sense of Touch Generated...
Searching Behavior of a Simple Manipulator only with Sense of Touch Generated...
 
20181113_子ども夢ロボット&トーク
20181113_子ども夢ロボット&トーク20181113_子ども夢ロボット&トーク
20181113_子ども夢ロボット&トーク
 
第37回シェル芸勉強会イントロ
第37回シェル芸勉強会イントロ第37回シェル芸勉強会イントロ
第37回シェル芸勉強会イントロ
 
シェル芸勉強会にみる、コミュニティを通じたIT学習
シェル芸勉強会にみる、コミュニティを通じたIT学習シェル芸勉強会にみる、コミュニティを通じたIT学習
シェル芸勉強会にみる、コミュニティを通じたIT学習
 
ROSチュートリアル ROBOMECH2018
ROSチュートリアル ROBOMECH2018ROSチュートリアル ROBOMECH2018
ROSチュートリアル ROBOMECH2018
 
poster of PFoE used in ICRA 2018
poster of PFoE used in ICRA 2018poster of PFoE used in ICRA 2018
poster of PFoE used in ICRA 2018
 
Robot frontier lesson3 2018
Robot frontier lesson3 2018Robot frontier lesson3 2018
Robot frontier lesson3 2018
 
Robot frontier lesson2 2018
Robot frontier lesson2 2018Robot frontier lesson2 2018
Robot frontier lesson2 2018
 
Robot frontier lesson1 2018
Robot frontier lesson1 2018Robot frontier lesson1 2018
Robot frontier lesson1 2018
 
第34回シェル芸勉強会
第34回シェル芸勉強会第34回シェル芸勉強会
第34回シェル芸勉強会
 
第32回信号処理シンポジウム「Raspberry PiとROSを 使ったロボットシステム」
第32回信号処理シンポジウム「Raspberry PiとROSを使ったロボットシステム」第32回信号処理シンポジウム「Raspberry PiとROSを使ったロボットシステム」
第32回信号処理シンポジウム「Raspberry PiとROSを 使ったロボットシステム」
 
2017年10月18日 シェル芸勉強会 meets バイオインフォマティクス vol.1 スライド
2017年10月18日 シェル芸勉強会 meets バイオインフォマティクス vol.1 スライド2017年10月18日 シェル芸勉強会 meets バイオインフォマティクス vol.1 スライド
2017年10月18日 シェル芸勉強会 meets バイオインフォマティクス vol.1 スライド
 

Recently uploaded

IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 

Recently uploaded (20)

IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 

direct use of particle filters for decision making

  • 1. direct use of particle filters for decision making Ryuichi Ueda, Chiba inst. of Technology パーティクルフィルタを推定だけでなく行動決定に直接使う試み 千葉工業大学 上田隆一 日本知能情報ファジィ学会(SOFT)ベンチャー研究会 第1回「動きの様相から先を読む」研究会 @名古屋工業大学
  • 2. metacognition -Flavell 1979 • knowledge or cognition about cognitive phenomena – evaluation of the extent of its own knowledge • how to implement to robots – probability (Bayes') theory – implementation • methods of probabilistic robotics, methods of machine learning, artificial neural networks, ... Aug. 1, 2017 第1回「動きの様相から先を読む」研究会 2
  • 3. probability expression of knowledge • state variables: 𝒙 = 𝑥1, 𝑥2, … 𝑥 𝑛 – 𝑛 = 3 : mobile robot self-localization – 𝑛 = 10 𝑁: SLAM (mapping) – The actual 𝒙 is unknown. • 𝑏𝑒𝑙 𝒙 : the belief of the robot about 𝒙 – a probability density function Aug. 1, 2017 第1回「動きの様相から先を読む」研究会 3 𝑥 𝑦 heavy-tailed distribution (unconfident) 𝑥 𝑦 peaky distribution (confident) 𝑥 𝑦 peaky but some peaks
  • 4. particle filters • a popular method for self-localization – Monte Carlo localization: particle filter for self-localization • used for all of the methods in this presentation • representation of the belief • updates of the particles – Sensor information reduces the distribution of the particles. – Robot motion invokes motion of the particles. Aug. 1, 2017 第1回「動きの様相から先を読む」研究会 4 by courtesy of Ryoma Aoki (Ueda lab.) particles (candidates of the pose) the actual pose (unknown)
  • 5. an example –Tsukuba challenge Aug. 1, 2017 第1回「動きの様相から先を読む」研究会 5 decide its action based on the most reliable particle • 2km run of autonomous robots – a standard method for win • put a LIDAR on their robot • make a map with a SLAM method beforehand • probabilistic self-localization with the map • motion planning with non-probabilistic methods Hayashibara laboratory's team of Chiba Inst. of Technology in 2017 (completes 2km run.)
  • 6. severer cases • RoboCup – small camera – vibration and collision – few landmarks • a micromouse in the maze – only four range sensors – perceptual aliasing Aug. 1, 2017 第1回「動きの様相から先を読む」研究会 6  Robots must decides their motion based on uncertain 𝑏𝑒𝑙s.
  • 7. decision with broad beliefs • Is it possible? – easy for human beings • "You sense that you do not yet know a certain chapter in your text well enough to pass tomorrow's exam, so you read it through once more." [Flavell 1979] • Intelligent robots in the real world must be able to ... – find an action that is effective even if the belief is broad – find an action to reduce the uncertainty • 2 (+ 1) cases are presented from our study. Aug. 1, 2017 第1回「動きの様相から先を読む」研究会 7
  • 8. CASE 1: REAL-TIME QMDP R. Ueda, T. Arai, K. Sakamoto, Y. Jitsukawa, K. Umeda, H. Osumi, T. Kikuchi and M. Komura: "Real-time decision making with state-value function under uncertainty of state estimation – Evaluation with Local Maxima and Discontinuity," IEEE ICRA, 2005. Aug. 1, 2017 第1回「動きの様相から先を読む」研究会 8
  • 9. a navigation problem with multiple destinations • the problem: – There are more than one destinations. – The robot knows its uncertainty of self-localization (𝑏𝑒𝑙). – The robot must decide an effective action. Aug. 1, 2017 第1回「動きの様相から先を読む」研究会 9 destination 1 destination 2 Which is easy to go?
  • 10. a goalie task for RoboCup 4 legged robot league • three kinds of "destinations (sub-task)" a) staying in the goal (ball: invisible) b) punching the ball (ball: near to the goal) c) closing a goalpost (ball: at a side of the goal) Aug. 1, 2017 第1回「動きの様相から先を読む」研究会 10 ERS-210 x y (r,j) (x,y,q) goalie goal (a) (b) (c)
  • 11. difference of accuracy requirement • requirement to reach the sub-tasks – (a) accurate self-localization only relative to the goal – (b) no need of accurate self-localization – (c) accurate self-localization Aug. 1, 2017 第1回「動きの様相から先を読む」研究会 11 goal (a) (b) (c) 𝑏𝑒𝑙 is broad but the relative pose toward the goal is accurate. goal  The robot must choose its sub-task with the consideration of accuracy requirement in real-time.
  • 12. real-time QMDP • QMDP value method: written in [Littman 95] • composed of offline and online calculation – offline • calculate the value (cost to go) function without consideration of uncertain • state variables: 𝒙 = 𝑥, 𝑦, 𝜃, 𝑥ball, 𝑦ball – online 1. place all particles on the value function 2. choose an action that maximizes the average value of the particles Aug. 1, 2017 第1回「動きの様相から先を読む」研究会 12 state space (5D) b c value a
  • 13. ball goal calculated value function • 3,000,000 discrete states in 5D state space • 49[min] calculation with a 3.6[GHz] CPU Aug. 1, 2017 第1回「動きの様相から先を読む」研究会 13 a part of the value function (values on 𝑥𝑦-plane with a fixed (𝜃, 𝑥ball, 𝑦ball) )
  • 14. motion with real-time QMDP • Motion correspond to the sub-tasks can be seen. – real-time calculation of1000 particles with 192MHz CPU – Detailed evaluation can be seen in [Ueda ICRA2005] Aug. 1, 2017 第1回「動きの様相から先を読む」研究会 14 x2 waiting in the goal closing a goal post punching the ball https://youtu.be/fsQicKXE5AU
  • 15. CASE 2: PROBABILISTIC FLOW CONTROL Ryuichi Ueda: Generation of Compensation Behavior of Autonomous Robot for Uncertainty of Information with Probabilistic Flow Control, Advanced Robotics, 29(11), pp. 721-734, June, 2015. Aug. 1, 2017 第1回「動きの様相から先を読む」研究会 15
  • 16. motivation • When I go to bed at midnight without light, how I behave? – I search a wall by my hand, and trace the wall. • symbolical study: "coastal navigation" [Roy 99] – planning with uncertainty evaluation at offline calculation Aug. 1, 2017 第1回「動きの様相から先を読む」研究会 16 A degree of freedom is erased. goal possibility of lost (x,y,q) wall H
  • 17. how to realize the behavior with real-time QMDP • problems: – no strategy for obtaining information • no consideration of uncertainty at offline • no consideration of future observation at online – deadlocks • The robot stops when any motion cannot improve the average value. • not fatal in RoboCup but fatal in navigation  A small modification gives an interesting behavior. Aug. 1, 2017 第1回「動きの様相から先を読む」研究会 17
  • 18. probabilistic flow control (PFC) • an additional assumption – The robot can know whether it reached on a goal or not through a sensor. • modification of calculation – The average value is weighted by the value. – Particles near a goal have a priority. Aug. 1, 2017 第1回「動きの様相から先を読む」研究会 18 value state space high weighted low weighted
  • 19. a navigation problem with one landmark • state variables: 𝒙 = 𝑥, 𝑦, 𝜃 • information: – landmark observation – goal or not • Particles do not converge most of time. Aug. 1, 2017 第1回「動きの様相から先を読む」研究会 19 destination landmark Poses of these particles never contradict the sensor data.
  • 20. application of PFC • The robot moves as it is dragged by some particles near the goal. • real-time QMDP – 73 deadlocks in 100 trials Aug. 1, 2017 第1回「動きの様相から先を読む」研究会 20
  • 21. other applications • Some unpublished modifications are applied to. – (I must write but ...) Aug. 1, 2017 第1回「動きの様相から先を読む」研究会 21 searching behavior of a manipulator with (modified) PFC a rod (position unknown) red color: likelihood of the rod existence wandering behavior of a raspberry pi mouse with (modified) PFC goal these movies: https://blog.ueda.tech/?page_id=10034
  • 22. CASE 3: PARTICLE FILTER ON EPISODE • Ryuichi Ueda, Kotaro Mizuta, Hiroshi Yamakawa and Hiroyuki Okada: Particle Filter on Episode for Learning Decision Making Rule, Proc. of The 14th International Conference on Intelligent Autonomous Systems (IAS-14), Shanghai, July, 2016. • The 35-th Annual Conference of the RSJ (to appear) Aug. 1, 2017 第1回「動きの様相から先を読む」研究会 22
  • 23. motivation • decision making before/without environmental maps • Memory goes before, and a map follow after. – Hippocampus of mammals generate a sequence of memory, and the sequence becomes maps with dropout of time sequence – information: memory > map • Robots can store seemingly unlimited memory. – different from creatures Aug. 1, 2017 第1回「動きの様相から先を読む」研究会 23 no need of SLAM for intelligent decision making (?)
  • 24. particle filter on episode (PFoE) • procedure 1. record I/O and reward 2. calculate the similarity between the current situation and each past state 3. choose an action that maximizes future reward Aug. 1, 2017 第1回「動きの様相から先を読む」研究会 24 time axis states (sensors) episode rewards belief s s s s s s s present time 1 -1 a a a a a a a actions past current particles
  • 25. a simple application • The robot goes from the bottom of the T shape maze to the goal that is set alternately on one of the arm. • conditions (very simplified) – rewards: • 1: the robot turned to the goal arm • -1: it turned to the wrong arm – only four states: start, T-junction, after turn, end of an arm – only one chance of decision: right or left at T-junction Aug. 1, 2017 第1回「動きの様相から先を読む」研究会 25
  • 26. another version of PFoE • used for teaching • presented in the 35-th Annual Conference of the RSJ – currently secret Aug. 1, 2017 第1回「動きの様相から先を読む」研究会 26 these movies on the web : https://blog.ueda.tech/?page_id=10021
  • 27. conclusion of this presentation • Real-time QMDP can choose appropriate locations of a goalkeeper in accordance with the belief of the robot. – on a 192MHz CPU, 32MB DRAM – It was actually used in RoboCup competitions for some years. • PFC compensates loss of information by motion of robots. – Robots with PFC show "searching behavior." • We are trying to build a cognitive/metacognitive model for robots with poor computing resources. Aug. 1, 2017 第1回「動きの様相から先を読む」研究会 27