SlideShare a Scribd company logo
1 of 25
Download to read offline
Jérôme Gasperi
WGISS #37

Cocoa Beach, Florida - USA - April 16th, 2014
Semantic search
Semantic search helps users to
find the right data
Semantic search helps users to
find the right data
How to add semantics capabilities to EO products search services ?
Characterize products with relevant information.
Think « users », not « experts »
1
Characterize products with relevant information.
Think « users », not « experts »
1
Decode users natural language queries
2
Use footprint to enrich metadata from exogenous data
1 Enrich products
github.com/jjrom/itag
!
Tag this footprint with continent, country and Land use
!
http://goo.gl/WtbcbR

iTag1 Enrich products
2 Decode queries
RESTo provides semantic search capabilities
It uses a Query Analyzer to translate query into a set of EO OpenSearch parameters
Query string analysis algorithm
is based on simple recognition
of words and patterns
Split query string
into list of unitary words
Extract «key=value» strings e.g. orbitNumber=4
Extract Platforms and
Instruments
Platforms and instruments list are stored
within common dictionary
!https://github.com/jjrom/resto/blob/
master/resto/dictionaries/common.php
Remove excluded words
and non dictionary words
with length < 4 characters
e.g. «area of Mexico in 2012»
Extract patterns and
dates
e.g. «acquired in the last 2 days»
Extract keywords e.g. «urban area in France»
Extract location on
remaining words
e.g. «images acquired in Toulouse»
2 Decode queries
Recognized patterns
<with> "keyword"	
<without> "keyword"	
!
"quantity" <lesser> (than) "numeric" "unit"	
"quantity" <greater> (than) "numeric" "unit"	
"quantity" <equal> (to) "numeric" "unit"	
<lesser> (than) "numeric" "unit" (of) "quantity"	
<greater> (than) "numeric" "unit" (of) "quantity"	
<equal> (to) "numeric" "unit" (of) "quantity"	
"quantity" <between> "numeric" <and> "numeric" ("unit")	
<between> "numeric" <and> "numeric" "unit" (of) "quantity"	
!
<today>	
<yesterday>	
<before> "date"	
<after> "date"	
<between> "date" <and> "date"	
"numeric" "(year|day|month)" <ago>	
<last> "(year|day|month)"	
<last> "numeric" "(year|day|month)"	
"numeric" <last> "(year|day|month)"	
"(year|day|month)" <last>	
<since> "numeric" "(year|day|month)"	
<since> "month" "year"	
<since> "date"	
<since> "numeric" <last> "(year|day|month)"	
<since> <last> "numeric" "(year|day|month)"	
<since> <last> "(year|day|month)"	
<since> "(year|day|month)" <last>	
2 Decode queries
$dictionary = array( 	
'excluded' => array(	
'than',	
'image',

...	
),	
'modifiers' => array(	
'ago' => 'ago',	
'before' => 'before',	
'after' => 'after',	
...	
),	
'units' => array(	
'm' => 'm',	
'meter' => 'm',

'days' => 'days',

...	
),	
'numbers' => array(	
'one' => '1',	
...	
),	
'months' => array(	
'january' => '01',	
...	
),	
'quantities' => array(	
'resolution' => 'resolution',	
...	
),	
'keywords' => array(	
'continent' => array(	
'europe' => 'europe',	
...	
)	
)	
Words are stored within a dictionary
2 Decode queries
« Images of urban area in the US acquired in the last 10 days with less than 5 % of cloud cover »
Example
2 Decode queries
2 Decode queries
« Images of urban area in the US acquired in the last 10 days with less than 5 % of cloud cover »
Example
keyword location date acquisition parameter
2. Each search result has an « human readable url » that can
be indexed by web crawler (i.e. google robots)
1. Search parameters are derived from
Natural Language query
3. Keywords on resources are links to search requests :
they can be indexed by web crawler…and so on
Search (example)
2. Each search result has an « human readable url » that can
be indexed by web crawler (i.e. google robots)
1. Search parameters are derived from
Natural Language query
3. Keywords on resources are links to search requests :
they can be indexed by web crawler…and so on
Search (example)
http://goo.gl/GvMEHj
Issues with keywords approach
Earthquakes in november 2008 in china
Earthquakes in november 2008 in china
Ambiguous since it appears to be
a location in New Zealand
« Linked data is the right way to do Semantic Web »
Tim Berners-Lee
Update RESTo JSON model to follow JSON-LD format
{
"@context": "http://json-ld.org/contexts/person.jsonld",
"@id": "http://dbpedia.org/resource/John_Lennon",
"name": "John Lennon",
"born": "1940-10-09",
"spouse": "http://dbpedia.org/resource/Cynthia_Lennon"
}

More Related Content

Viewers also liked

2016 vendor showcase track: an introduction to spike and the application of p...
2016 vendor showcase track: an introduction to spike and the application of p...2016 vendor showcase track: an introduction to spike and the application of p...
2016 vendor showcase track: an introduction to spike and the application of p...GIS in the Rockies
 
2016 conservation track: under the hood of an rea: what is within a rapid ec...
2016 conservation track: under the hood of an rea:  what is within a rapid ec...2016 conservation track: under the hood of an rea:  what is within a rapid ec...
2016 conservation track: under the hood of an rea: what is within a rapid ec...GIS in the Rockies
 
2016 asprs track: gis support for trail planning by jeff orlowski
2016 asprs track: gis support for trail planning by jeff orlowski2016 asprs track: gis support for trail planning by jeff orlowski
2016 asprs track: gis support for trail planning by jeff orlowskiGIS in the Rockies
 
2016 conservation track: evaluating lidar derived synthetic streams as a ...
2016 conservation track: evaluating   lidar derived   synthetic streams as a ...2016 conservation track: evaluating   lidar derived   synthetic streams as a ...
2016 conservation track: evaluating lidar derived synthetic streams as a ...GIS in the Rockies
 
2016 conservation track: broad scale assessment, planning and management of ...
2016 conservation track:  broad scale assessment, planning and management of ...2016 conservation track:  broad scale assessment, planning and management of ...
2016 conservation track: broad scale assessment, planning and management of ...GIS in the Rockies
 
2016 foss4 g track facilitators and inhibitors for the integration and use ...
2016 foss4 g track  facilitators and inhibitors  for the integration and use ...2016 foss4 g track  facilitators and inhibitors  for the integration and use ...
2016 foss4 g track facilitators and inhibitors for the integration and use ...GIS in the Rockies
 
2016 conservation track: geolocation by light: following the migration of le...
2016 conservation track:  geolocation by light: following the migration of le...2016 conservation track:  geolocation by light: following the migration of le...
2016 conservation track: geolocation by light: following the migration of le...GIS in the Rockies
 
2016 conservation track: strategies and tips for large scale data collection ...
2016 conservation track: strategies and tips for large scale data collection ...2016 conservation track: strategies and tips for large scale data collection ...
2016 conservation track: strategies and tips for large scale data collection ...GIS in the Rockies
 
2016 conservation track: automated river classification using gis delineated ...
2016 conservation track: automated river classification using gis delineated ...2016 conservation track: automated river classification using gis delineated ...
2016 conservation track: automated river classification using gis delineated ...GIS in the Rockies
 
2016 conservation track: identifying key wetlands areas in the rio grande na...
2016 conservation track:  identifying key wetlands areas in the rio grande na...2016 conservation track:  identifying key wetlands areas in the rio grande na...
2016 conservation track: identifying key wetlands areas in the rio grande na...GIS in the Rockies
 
2016 conservation track: ecological and social resilience to climate variabil...
2016 conservation track: ecological and social resilience to climate variabil...2016 conservation track: ecological and social resilience to climate variabil...
2016 conservation track: ecological and social resilience to climate variabil...GIS in the Rockies
 
2016 conservation track: applications of rapid ecoregional assessments (re as...
2016 conservation track: applications of rapid ecoregional assessments (re as...2016 conservation track: applications of rapid ecoregional assessments (re as...
2016 conservation track: applications of rapid ecoregional assessments (re as...GIS in the Rockies
 
2016 asprs track: overview and user perspective of usgs 3 dep lidar by john ...
2016 asprs track:  overview and user perspective of usgs 3 dep lidar by john ...2016 asprs track:  overview and user perspective of usgs 3 dep lidar by john ...
2016 asprs track: overview and user perspective of usgs 3 dep lidar by john ...GIS in the Rockies
 

Viewers also liked (13)

2016 vendor showcase track: an introduction to spike and the application of p...
2016 vendor showcase track: an introduction to spike and the application of p...2016 vendor showcase track: an introduction to spike and the application of p...
2016 vendor showcase track: an introduction to spike and the application of p...
 
2016 conservation track: under the hood of an rea: what is within a rapid ec...
2016 conservation track: under the hood of an rea:  what is within a rapid ec...2016 conservation track: under the hood of an rea:  what is within a rapid ec...
2016 conservation track: under the hood of an rea: what is within a rapid ec...
 
2016 asprs track: gis support for trail planning by jeff orlowski
2016 asprs track: gis support for trail planning by jeff orlowski2016 asprs track: gis support for trail planning by jeff orlowski
2016 asprs track: gis support for trail planning by jeff orlowski
 
2016 conservation track: evaluating lidar derived synthetic streams as a ...
2016 conservation track: evaluating   lidar derived   synthetic streams as a ...2016 conservation track: evaluating   lidar derived   synthetic streams as a ...
2016 conservation track: evaluating lidar derived synthetic streams as a ...
 
2016 conservation track: broad scale assessment, planning and management of ...
2016 conservation track:  broad scale assessment, planning and management of ...2016 conservation track:  broad scale assessment, planning and management of ...
2016 conservation track: broad scale assessment, planning and management of ...
 
2016 foss4 g track facilitators and inhibitors for the integration and use ...
2016 foss4 g track  facilitators and inhibitors  for the integration and use ...2016 foss4 g track  facilitators and inhibitors  for the integration and use ...
2016 foss4 g track facilitators and inhibitors for the integration and use ...
 
2016 conservation track: geolocation by light: following the migration of le...
2016 conservation track:  geolocation by light: following the migration of le...2016 conservation track:  geolocation by light: following the migration of le...
2016 conservation track: geolocation by light: following the migration of le...
 
2016 conservation track: strategies and tips for large scale data collection ...
2016 conservation track: strategies and tips for large scale data collection ...2016 conservation track: strategies and tips for large scale data collection ...
2016 conservation track: strategies and tips for large scale data collection ...
 
2016 conservation track: automated river classification using gis delineated ...
2016 conservation track: automated river classification using gis delineated ...2016 conservation track: automated river classification using gis delineated ...
2016 conservation track: automated river classification using gis delineated ...
 
2016 conservation track: identifying key wetlands areas in the rio grande na...
2016 conservation track:  identifying key wetlands areas in the rio grande na...2016 conservation track:  identifying key wetlands areas in the rio grande na...
2016 conservation track: identifying key wetlands areas in the rio grande na...
 
2016 conservation track: ecological and social resilience to climate variabil...
2016 conservation track: ecological and social resilience to climate variabil...2016 conservation track: ecological and social resilience to climate variabil...
2016 conservation track: ecological and social resilience to climate variabil...
 
2016 conservation track: applications of rapid ecoregional assessments (re as...
2016 conservation track: applications of rapid ecoregional assessments (re as...2016 conservation track: applications of rapid ecoregional assessments (re as...
2016 conservation track: applications of rapid ecoregional assessments (re as...
 
2016 asprs track: overview and user perspective of usgs 3 dep lidar by john ...
2016 asprs track:  overview and user perspective of usgs 3 dep lidar by john ...2016 asprs track:  overview and user perspective of usgs 3 dep lidar by john ...
2016 asprs track: overview and user perspective of usgs 3 dep lidar by john ...
 

Similar to Semantic search for Earth Observation products

Semantic search within Earth Observation products databases based on automati...
Semantic search within Earth Observation products databases based on automati...Semantic search within Earth Observation products databases based on automati...
Semantic search within Earth Observation products databases based on automati...Gasperi Jerome
 
Machine Learning, Key to Your Classification Challenges
Machine Learning, Key to Your Classification ChallengesMachine Learning, Key to Your Classification Challenges
Machine Learning, Key to Your Classification ChallengesMarc Borowczak
 
DataCamp Cheat Sheets 4 Python Users (2020)
DataCamp Cheat Sheets 4 Python Users (2020)DataCamp Cheat Sheets 4 Python Users (2020)
DataCamp Cheat Sheets 4 Python Users (2020)EMRE AKCAOGLU
 
Elasticsearch first-steps
Elasticsearch first-stepsElasticsearch first-steps
Elasticsearch first-stepsMatteo Moci
 
DH11: Browsing Highly Interconnected Humanities Databases Through Multi-Resul...
DH11: Browsing Highly Interconnected Humanities Databases Through Multi-Resul...DH11: Browsing Highly Interconnected Humanities Databases Through Multi-Resul...
DH11: Browsing Highly Interconnected Humanities Databases Through Multi-Resul...Michele Pasin
 
Crab: A Python Framework for Building Recommender Systems
Crab: A Python Framework for Building Recommender Systems Crab: A Python Framework for Building Recommender Systems
Crab: A Python Framework for Building Recommender Systems Marcel Caraciolo
 
RESTo - restful semantic search tool for geospatial
RESTo - restful semantic search tool for geospatialRESTo - restful semantic search tool for geospatial
RESTo - restful semantic search tool for geospatialGasperi Jerome
 
A Rusty introduction to Apache Arrow and how it applies to a time series dat...
A Rusty introduction to Apache Arrow and how it applies to a  time series dat...A Rusty introduction to Apache Arrow and how it applies to a  time series dat...
A Rusty introduction to Apache Arrow and how it applies to a time series dat...Andrew Lamb
 
Reflected Intelligence - Lucene/Solr as a self-learning data system: Presente...
Reflected Intelligence - Lucene/Solr as a self-learning data system: Presente...Reflected Intelligence - Lucene/Solr as a self-learning data system: Presente...
Reflected Intelligence - Lucene/Solr as a self-learning data system: Presente...Lucidworks
 
Reflected Intelligence: Lucene/Solr as a self-learning data system
Reflected Intelligence: Lucene/Solr as a self-learning data systemReflected Intelligence: Lucene/Solr as a self-learning data system
Reflected Intelligence: Lucene/Solr as a self-learning data systemTrey Grainger
 
An Introduction to Data Mining with R
An Introduction to Data Mining with RAn Introduction to Data Mining with R
An Introduction to Data Mining with RYanchang Zhao
 
Improving Semantic Search Using Query Log Analysis
Improving Semantic Search Using Query Log AnalysisImproving Semantic Search Using Query Log Analysis
Improving Semantic Search Using Query Log AnalysisStuart Wrigley
 
Machine Learning ebook.pdf
Machine Learning ebook.pdfMachine Learning ebook.pdf
Machine Learning ebook.pdfHODIT12
 
1_5_AI_edx_ml_51intro_240204_104838machine learning lecture 1
1_5_AI_edx_ml_51intro_240204_104838machine learning lecture 11_5_AI_edx_ml_51intro_240204_104838machine learning lecture 1
1_5_AI_edx_ml_51intro_240204_104838machine learning lecture 1MostafaHazemMostafaa
 
Simple APIs and innovative documentation
Simple APIs and innovative documentationSimple APIs and innovative documentation
Simple APIs and innovative documentationPyDataParis
 
Mapreduce Algorithms
Mapreduce AlgorithmsMapreduce Algorithms
Mapreduce AlgorithmsAmund Tveit
 
know Machine Learning Basic Concepts.pdf
know Machine Learning Basic Concepts.pdfknow Machine Learning Basic Concepts.pdf
know Machine Learning Basic Concepts.pdfhemangppatel
 
What's Coming Next in Sencha Frameworks
What's Coming Next in Sencha FrameworksWhat's Coming Next in Sencha Frameworks
What's Coming Next in Sencha FrameworksGrgur Grisogono
 
Evaluating Your Learning to Rank Model: Dos and Don’ts in Offline/Online Eval...
Evaluating Your Learning to Rank Model: Dos and Don’ts in Offline/Online Eval...Evaluating Your Learning to Rank Model: Dos and Don’ts in Offline/Online Eval...
Evaluating Your Learning to Rank Model: Dos and Don’ts in Offline/Online Eval...Sease
 

Similar to Semantic search for Earth Observation products (20)

Semantic search within Earth Observation products databases based on automati...
Semantic search within Earth Observation products databases based on automati...Semantic search within Earth Observation products databases based on automati...
Semantic search within Earth Observation products databases based on automati...
 
Machine Learning, Key to Your Classification Challenges
Machine Learning, Key to Your Classification ChallengesMachine Learning, Key to Your Classification Challenges
Machine Learning, Key to Your Classification Challenges
 
DataCamp Cheat Sheets 4 Python Users (2020)
DataCamp Cheat Sheets 4 Python Users (2020)DataCamp Cheat Sheets 4 Python Users (2020)
DataCamp Cheat Sheets 4 Python Users (2020)
 
Elasticsearch first-steps
Elasticsearch first-stepsElasticsearch first-steps
Elasticsearch first-steps
 
DH11: Browsing Highly Interconnected Humanities Databases Through Multi-Resul...
DH11: Browsing Highly Interconnected Humanities Databases Through Multi-Resul...DH11: Browsing Highly Interconnected Humanities Databases Through Multi-Resul...
DH11: Browsing Highly Interconnected Humanities Databases Through Multi-Resul...
 
Crab: A Python Framework for Building Recommender Systems
Crab: A Python Framework for Building Recommender Systems Crab: A Python Framework for Building Recommender Systems
Crab: A Python Framework for Building Recommender Systems
 
RESTo - restful semantic search tool for geospatial
RESTo - restful semantic search tool for geospatialRESTo - restful semantic search tool for geospatial
RESTo - restful semantic search tool for geospatial
 
A Rusty introduction to Apache Arrow and how it applies to a time series dat...
A Rusty introduction to Apache Arrow and how it applies to a  time series dat...A Rusty introduction to Apache Arrow and how it applies to a  time series dat...
A Rusty introduction to Apache Arrow and how it applies to a time series dat...
 
Reflected Intelligence - Lucene/Solr as a self-learning data system: Presente...
Reflected Intelligence - Lucene/Solr as a self-learning data system: Presente...Reflected Intelligence - Lucene/Solr as a self-learning data system: Presente...
Reflected Intelligence - Lucene/Solr as a self-learning data system: Presente...
 
Reflected Intelligence: Lucene/Solr as a self-learning data system
Reflected Intelligence: Lucene/Solr as a self-learning data systemReflected Intelligence: Lucene/Solr as a self-learning data system
Reflected Intelligence: Lucene/Solr as a self-learning data system
 
An Introduction to Data Mining with R
An Introduction to Data Mining with RAn Introduction to Data Mining with R
An Introduction to Data Mining with R
 
Improving Semantic Search Using Query Log Analysis
Improving Semantic Search Using Query Log AnalysisImproving Semantic Search Using Query Log Analysis
Improving Semantic Search Using Query Log Analysis
 
Machine Learning ebook.pdf
Machine Learning ebook.pdfMachine Learning ebook.pdf
Machine Learning ebook.pdf
 
1_5_AI_edx_ml_51intro_240204_104838machine learning lecture 1
1_5_AI_edx_ml_51intro_240204_104838machine learning lecture 11_5_AI_edx_ml_51intro_240204_104838machine learning lecture 1
1_5_AI_edx_ml_51intro_240204_104838machine learning lecture 1
 
Simple APIs and innovative documentation
Simple APIs and innovative documentationSimple APIs and innovative documentation
Simple APIs and innovative documentation
 
Mapreduce Algorithms
Mapreduce AlgorithmsMapreduce Algorithms
Mapreduce Algorithms
 
know Machine Learning Basic Concepts.pdf
know Machine Learning Basic Concepts.pdfknow Machine Learning Basic Concepts.pdf
know Machine Learning Basic Concepts.pdf
 
What's Coming Next in Sencha Frameworks
What's Coming Next in Sencha FrameworksWhat's Coming Next in Sencha Frameworks
What's Coming Next in Sencha Frameworks
 
Decision Tree.pptx
Decision Tree.pptxDecision Tree.pptx
Decision Tree.pptx
 
Evaluating Your Learning to Rank Model: Dos and Don’ts in Offline/Online Eval...
Evaluating Your Learning to Rank Model: Dos and Don’ts in Offline/Online Eval...Evaluating Your Learning to Rank Model: Dos and Don’ts in Offline/Online Eval...
Evaluating Your Learning to Rank Model: Dos and Don’ts in Offline/Online Eval...
 

More from Gasperi Jerome

Big data from space - Module Big Data ISAE 2017
Big data from space - Module Big Data ISAE 2017Big data from space - Module Big Data ISAE 2017
Big data from space - Module Big Data ISAE 2017Gasperi Jerome
 
Le Big Data et les données Copernicus
Le Big Data et les données CopernicusLe Big Data et les données Copernicus
Le Big Data et les données CopernicusGasperi Jerome
 
2016.02.18 big data from space toulouse data science
2016.02.18   big data from space    toulouse data science2016.02.18   big data from space    toulouse data science
2016.02.18 big data from space toulouse data scienceGasperi Jerome
 
2015.11.12 big data from space - cusi toulouse
2015.11.12   big data from space - cusi toulouse2015.11.12   big data from space - cusi toulouse
2015.11.12 big data from space - cusi toulouseGasperi Jerome
 
Big Data - Accès et traitement des données d’Observation de laTerre
Big Data - Accès et traitement des données d’Observation de laTerreBig Data - Accès et traitement des données d’Observation de laTerre
Big Data - Accès et traitement des données d’Observation de laTerreGasperi Jerome
 
2014.09.04 federated ground segments - toulouse
2014.09.04   federated ground segments - toulouse2014.09.04   federated ground segments - toulouse
2014.09.04 federated ground segments - toulouseGasperi Jerome
 
Web Processing Service
Web Processing ServiceWeb Processing Service
Web Processing ServiceGasperi Jerome
 
2014.04.22 - HyDre - Hydroweb Distribution Server
2014.04.22 - HyDre - Hydroweb Distribution Server2014.04.22 - HyDre - Hydroweb Distribution Server
2014.04.22 - HyDre - Hydroweb Distribution ServerGasperi Jerome
 
Single Sign On with OAuth and OpenID
Single Sign On with OAuth and OpenIDSingle Sign On with OAuth and OpenID
Single Sign On with OAuth and OpenIDGasperi Jerome
 
CNES OpenSearch implementations
CNES OpenSearch implementationsCNES OpenSearch implementations
CNES OpenSearch implementationsGasperi Jerome
 
Web Processing Service
Web Processing ServiceWeb Processing Service
Web Processing ServiceGasperi Jerome
 
Unify Earth Observation products access with OpenSearch
Unify Earth Observation products access with OpenSearchUnify Earth Observation products access with OpenSearch
Unify Earth Observation products access with OpenSearchGasperi Jerome
 
CNES activities on semantic search
CNES activities on semantic searchCNES activities on semantic search
CNES activities on semantic searchGasperi Jerome
 
Traitements de données à la demande - Introduction au Web Processing Service
Traitements de données à la demande - Introduction au Web Processing ServiceTraitements de données à la demande - Introduction au Web Processing Service
Traitements de données à la demande - Introduction au Web Processing ServiceGasperi Jerome
 
Data access and data extraction services within the Land Imagery Portal
Data access and data extraction services within the Land Imagery PortalData access and data extraction services within the Land Imagery Portal
Data access and data extraction services within the Land Imagery PortalGasperi Jerome
 
Semantic search applied to Earth Observation products
Semantic search applied to Earth Observation productsSemantic search applied to Earth Observation products
Semantic search applied to Earth Observation productsGasperi Jerome
 
Accès à l’information satellitaire dans un contexte réactif de catastrophe na...
Accès à l’information satellitaire dans un contexte réactif de catastrophe na...Accès à l’information satellitaire dans un contexte réactif de catastrophe na...
Accès à l’information satellitaire dans un contexte réactif de catastrophe na...Gasperi Jerome
 
Experimenting a cloud based solution for image processing and data access
Experimenting a cloud based solution for image processing and data accessExperimenting a cloud based solution for image processing and data access
Experimenting a cloud based solution for image processing and data accessGasperi Jerome
 
Interoperability and value added to earth observation data - 2011.11.24
Interoperability and value added to earth observation data - 2011.11.24Interoperability and value added to earth observation data - 2011.11.24
Interoperability and value added to earth observation data - 2011.11.24Gasperi Jerome
 

More from Gasperi Jerome (20)

Big data from space - Module Big Data ISAE 2017
Big data from space - Module Big Data ISAE 2017Big data from space - Module Big Data ISAE 2017
Big data from space - Module Big Data ISAE 2017
 
Le Big Data et les données Copernicus
Le Big Data et les données CopernicusLe Big Data et les données Copernicus
Le Big Data et les données Copernicus
 
2016.02.18 big data from space toulouse data science
2016.02.18   big data from space    toulouse data science2016.02.18   big data from space    toulouse data science
2016.02.18 big data from space toulouse data science
 
2015.11.12 big data from space - cusi toulouse
2015.11.12   big data from space - cusi toulouse2015.11.12   big data from space - cusi toulouse
2015.11.12 big data from space - cusi toulouse
 
Big Data - Accès et traitement des données d’Observation de laTerre
Big Data - Accès et traitement des données d’Observation de laTerreBig Data - Accès et traitement des données d’Observation de laTerre
Big Data - Accès et traitement des données d’Observation de laTerre
 
2014.09.04 federated ground segments - toulouse
2014.09.04   federated ground segments - toulouse2014.09.04   federated ground segments - toulouse
2014.09.04 federated ground segments - toulouse
 
Web Processing Service
Web Processing ServiceWeb Processing Service
Web Processing Service
 
2014.04.22 - HyDre - Hydroweb Distribution Server
2014.04.22 - HyDre - Hydroweb Distribution Server2014.04.22 - HyDre - Hydroweb Distribution Server
2014.04.22 - HyDre - Hydroweb Distribution Server
 
Single Sign On with OAuth and OpenID
Single Sign On with OAuth and OpenIDSingle Sign On with OAuth and OpenID
Single Sign On with OAuth and OpenID
 
CNES Data Center
CNES Data CenterCNES Data Center
CNES Data Center
 
CNES OpenSearch implementations
CNES OpenSearch implementationsCNES OpenSearch implementations
CNES OpenSearch implementations
 
Web Processing Service
Web Processing ServiceWeb Processing Service
Web Processing Service
 
Unify Earth Observation products access with OpenSearch
Unify Earth Observation products access with OpenSearchUnify Earth Observation products access with OpenSearch
Unify Earth Observation products access with OpenSearch
 
CNES activities on semantic search
CNES activities on semantic searchCNES activities on semantic search
CNES activities on semantic search
 
Traitements de données à la demande - Introduction au Web Processing Service
Traitements de données à la demande - Introduction au Web Processing ServiceTraitements de données à la demande - Introduction au Web Processing Service
Traitements de données à la demande - Introduction au Web Processing Service
 
Data access and data extraction services within the Land Imagery Portal
Data access and data extraction services within the Land Imagery PortalData access and data extraction services within the Land Imagery Portal
Data access and data extraction services within the Land Imagery Portal
 
Semantic search applied to Earth Observation products
Semantic search applied to Earth Observation productsSemantic search applied to Earth Observation products
Semantic search applied to Earth Observation products
 
Accès à l’information satellitaire dans un contexte réactif de catastrophe na...
Accès à l’information satellitaire dans un contexte réactif de catastrophe na...Accès à l’information satellitaire dans un contexte réactif de catastrophe na...
Accès à l’information satellitaire dans un contexte réactif de catastrophe na...
 
Experimenting a cloud based solution for image processing and data access
Experimenting a cloud based solution for image processing and data accessExperimenting a cloud based solution for image processing and data access
Experimenting a cloud based solution for image processing and data access
 
Interoperability and value added to earth observation data - 2011.11.24
Interoperability and value added to earth observation data - 2011.11.24Interoperability and value added to earth observation data - 2011.11.24
Interoperability and value added to earth observation data - 2011.11.24
 

Recently uploaded

Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDGMarianaLemus7
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentationphoebematthew05
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 

Recently uploaded (20)

Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort ServiceHot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDG
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentation
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 

Semantic search for Earth Observation products

  • 1. Jérôme Gasperi WGISS #37
 Cocoa Beach, Florida - USA - April 16th, 2014 Semantic search
  • 2. Semantic search helps users to find the right data
  • 3. Semantic search helps users to find the right data
  • 4. How to add semantics capabilities to EO products search services ?
  • 5.
  • 6. Characterize products with relevant information. Think « users », not « experts » 1
  • 7. Characterize products with relevant information. Think « users », not « experts » 1 Decode users natural language queries 2
  • 8.
  • 9. Use footprint to enrich metadata from exogenous data 1 Enrich products github.com/jjrom/itag
  • 10. ! Tag this footprint with continent, country and Land use ! http://goo.gl/WtbcbR
 iTag1 Enrich products
  • 11.
  • 12. 2 Decode queries RESTo provides semantic search capabilities It uses a Query Analyzer to translate query into a set of EO OpenSearch parameters
  • 13. Query string analysis algorithm is based on simple recognition of words and patterns Split query string into list of unitary words Extract «key=value» strings e.g. orbitNumber=4 Extract Platforms and Instruments Platforms and instruments list are stored within common dictionary !https://github.com/jjrom/resto/blob/ master/resto/dictionaries/common.php Remove excluded words and non dictionary words with length < 4 characters e.g. «area of Mexico in 2012» Extract patterns and dates e.g. «acquired in the last 2 days» Extract keywords e.g. «urban area in France» Extract location on remaining words e.g. «images acquired in Toulouse» 2 Decode queries
  • 14. Recognized patterns <with> "keyword" <without> "keyword" ! "quantity" <lesser> (than) "numeric" "unit" "quantity" <greater> (than) "numeric" "unit" "quantity" <equal> (to) "numeric" "unit" <lesser> (than) "numeric" "unit" (of) "quantity" <greater> (than) "numeric" "unit" (of) "quantity" <equal> (to) "numeric" "unit" (of) "quantity" "quantity" <between> "numeric" <and> "numeric" ("unit") <between> "numeric" <and> "numeric" "unit" (of) "quantity" ! <today> <yesterday> <before> "date" <after> "date" <between> "date" <and> "date" "numeric" "(year|day|month)" <ago> <last> "(year|day|month)" <last> "numeric" "(year|day|month)" "numeric" <last> "(year|day|month)" "(year|day|month)" <last> <since> "numeric" "(year|day|month)" <since> "month" "year" <since> "date" <since> "numeric" <last> "(year|day|month)" <since> <last> "numeric" "(year|day|month)" <since> <last> "(year|day|month)" <since> "(year|day|month)" <last> 2 Decode queries
  • 15. $dictionary = array( 'excluded' => array( 'than', 'image',
 ... ), 'modifiers' => array( 'ago' => 'ago', 'before' => 'before', 'after' => 'after', ... ), 'units' => array( 'm' => 'm', 'meter' => 'm',
 'days' => 'days',
 ... ), 'numbers' => array( 'one' => '1', ... ), 'months' => array( 'january' => '01', ... ), 'quantities' => array( 'resolution' => 'resolution', ... ), 'keywords' => array( 'continent' => array( 'europe' => 'europe', ... ) ) Words are stored within a dictionary 2 Decode queries
  • 16. « Images of urban area in the US acquired in the last 10 days with less than 5 % of cloud cover » Example 2 Decode queries
  • 17. 2 Decode queries « Images of urban area in the US acquired in the last 10 days with less than 5 % of cloud cover » Example keyword location date acquisition parameter
  • 18. 2. Each search result has an « human readable url » that can be indexed by web crawler (i.e. google robots) 1. Search parameters are derived from Natural Language query 3. Keywords on resources are links to search requests : they can be indexed by web crawler…and so on Search (example)
  • 19. 2. Each search result has an « human readable url » that can be indexed by web crawler (i.e. google robots) 1. Search parameters are derived from Natural Language query 3. Keywords on resources are links to search requests : they can be indexed by web crawler…and so on Search (example) http://goo.gl/GvMEHj
  • 21.
  • 22. Earthquakes in november 2008 in china
  • 23. Earthquakes in november 2008 in china Ambiguous since it appears to be a location in New Zealand
  • 24. « Linked data is the right way to do Semantic Web » Tim Berners-Lee
  • 25. Update RESTo JSON model to follow JSON-LD format { "@context": "http://json-ld.org/contexts/person.jsonld", "@id": "http://dbpedia.org/resource/John_Lennon", "name": "John Lennon", "born": "1940-10-09", "spouse": "http://dbpedia.org/resource/Cynthia_Lennon" }