SlideShare a Scribd company logo
1 of 38
Towards a integrated network of data
and services for the life sciences
1
Michel Dumontier, Ph.D.
Associate Professor of Bioinformatics
Carleton University
Department of Biology
School of Computer Science
Institute of Biochemistry
Ottawa Institute of Systems Biology
Ottawa-Carleton Institute of Biomedical Engineering
Finding the right information to answer a question is hard
and sometimes requires a sophisticated workflow
2
What if we could answer a question
by automatically building a knowledge base
using both data and services?4
The Semantic Web is a web of knowledge.
5
It is about standards for publishing, sharing and querying
knowledge drawn from diverse sources
It enables the answering of
sophisticated questions
Is caffeine a drug-like molecule?
To answer this question we need to know:
• what ‘drug like molecule’ really
means
• caffeine’s molecular structure
• use the structural information to
compute the attributes
• determine whether caffeine
satisfies the requirements of being
‘drug like’
Is caffeine a drug-like molecule?
Lipinski Rule of Five
• Rule of thumb for druglikeness (orally active in humans)
(4 rules with multiples of 5)
– mass of less than 500 Daltons
– fewer than 5 hydrogen bond donors
– fewer than 10 hydrogen bond acceptors
– A partition coefficient value between -5 and 5
We need a more formal (machine understandable) description
of a ‘drug-like molecule’ which specifies values for chemical
descriptors
ontology as a
strategy to
formally represent
knowledge
9
The Web Ontology Language (OWL) Has
Explicit Semantics
Can therefore be used to capture knowledge in a
machine understandable way
10
The Chemical Information Ontology
(CHEMINF)
• 100+ chemical descriptors
• 50+ chemical qualities
• Relates descriptors to their
specifications, the software that
generated them (along with the running
parameters, and the algorithms that they
implement)
• Contributors: Nico Adams, Leonid Chepelev,
Michel Dumontier, Janna Hastings, Egon
Willighagen, Peter Murray-Rust, Cristoph
Steinbeck
11
http://semanticchemistry.googlecode.com
Molecular structure can be represented using a
SMILES string, which is a common representation
of the chemical graph
12
ball & stick model for
caffeine
SMILES string
for caffeine
Cn1cnc2n(C)c(=O)n(C)c(=O)c12
Lipinski Rule of Five
• Empirically derived ruleset for druglikeness
(4 rules with multiples of 5)
– mass of less than 500 Daltons
– fewer than 5 hydrogen bond donors
– fewer than 10 hydrogen bond acceptors
– A partition coefficient value between -5 and 5
• A formal description using OWL:
What we then need are services that will consume SMILES
strings and annotate the molecule with the required chemical
descriptors
14
then we can reason
about whether it
satisfies the drug-
likeness definition
Semantic Automated
Discovery and Integration
http://sadiframework.org
Mark Wilkinson, UBC
Michel Dumontier, Carleton University
Christopher Baker, UNB
SADI is a framework to create Semantic Web services using OWL
classes as service inputs and outputs
15
SADI
• OWL classes in SADI are local to individual
services
– They should uniquely specify the service input and
outputs (they exactly have the right restrictions)
– one service’s world-view can conflict with another,
but a client can use any or all
• maximize interoperability by reusing types
and relations
Semanticscience Integrated Ontology
(SIO)
• OWL2 ontology
• 800 classes covering basic types (physical, processual,
informational) with an emphasis on biological entities
• 129 basic relations (mereological, participatory,
attribute/quality, spatial, temporal and representational)
• axioms can be used by reasoners to generate inferences
for consistency checking, classification and answering
questions about life science knowledge
• embodies emerging ontology design patterns
• dereferenceable URIs
• searchable in the NCBO bioportal
http://semanticscience.org/ontology/sio.owl
17 CASCON: Nov 3, 2010
Create code stubs using the ontology
• Publish the ontology to a web-accessible location
http://semanticscience.org/sadi/ontology/lipinskiserviceontology.owl
• Make sure that the class names are resolvable
(easy when using the hash notation)
http://semanticscience.org/sadi/ontology/lipinskiserviceontology.owl#smiles-molecule
http://semanticscience.org/sadi/ontology/lipinskiserviceontology.owl#logp-molecule
http://semanticscience.org/sadi/ontology/lipinskiserviceontology.owl#hbdc-molecule
http://semanticscience.org/sadi/ontology/lipinskiserviceontology.owl#hdba-molecule
http://semanticscience.org/sadi/ontology/lipinskiserviceontology.owl#lipinksi-druglike-molecule
• Download/checkout the code
http://sadiframework.org
• Run the code generator
– specify the URIs that correspond to input and output types
18
Implement the functionality
• Java version
– Uses Jena to manipulate the RDF graph
– Uses Maven to build from command-line or Eclipse; Invokes Jetty for
service testing
• Chemistry
– We used the Chemistry Development Kit (CDK) to implement 4
services
19
Responds to a GET operation by providing
the service description in RDF
conforms to Feta (BioMoby, myGrid)
20
curl http://cbrass.biordf.net/logpdc/logpc
<rdf:RDF
xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
xmlns:j.0="http://www.mygrid.org.uk/mygrid-moby-service#" >
<rdf:Description rdf:about="">
<j.0:hasServiceDescriptionText>no description</j.0:hasServiceDescriptionText>
<j.0:hasServiceNameText rdf:datatype="http://www.w3.org/2001/XMLSchema#string">logpc</j.0:hasServiceNameText>
<j.0:hasOperation rdf:resource="#operation"/>
<rdf:type rdf:resource="http://www.mygrid.org.uk/mygrid-moby-service#serviceDescription"/>
</rdf:Description>
<rdf:Description rdf:about="#input">
<j.0:objectType rdf:resource="http://semanticscience.org/sadi/ontology/lipinskiserviceontology.owl#smilesmolecule"/>
<rdf:type rdf:resource="http://www.mygrid.org.uk/mygrid-moby-service#parameter"/>
</rdf:Description>
<rdf:Description rdf:about="#operation">
<j.0:outputParameter rdf:resource="#output"/>
<j.0:inputParameter rdf:resource="#input"/>
<rdf:type rdf:resource="http://www.mygrid.org.uk/mygrid-moby-service#operation"/>
</rdf:Description>
<rdf:Description rdf:about="#output">
<j.0:objectType rdf:resource="http://semanticscience.org/sadi/ontology/lipinskiserviceontology.owl#alogpsmilesmolecule"/>
<rdf:type rdf:resource="http://www.mygrid.org.uk/mygrid-moby-service#parameter"/>
</rdf:Description>
</rdf:RDF>
Responds to a POST containing service
input with a service output in RDF
21
<rdf:Description rdf:about="http://semanticscience.org/sadi/ontology/caffeine.rdf#mdalogp">
<rdf:type rdf:resource="http://semanticscience.org/resource/CHEMINF_000251"/>
<j.0:SIO_000300 rdf:datatype="http://www.w3.org/2001/XMLSchema#double">-0.4311000000000006</j.0:SIO_000300>
</rdf:Description>
<rdf:RDF xmlns="http://semanticscience.org/sadi/ontology/caffeine.rdf#"
xmlns:so="http://semanticscience.org/sadi/ontology/lipinskiserviceontology.owl#"
xmlns:owl="http://www.w3.org/2002/07/owl#"
xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
xmlns:sio="http://semanticscience.org/resource/"
xmlns:xsd="http://www.w3.org/2001/XMLSchema#">
<so:smilesmolecule rdf:about="http://semanticscience.org/sadi/ontology/caffeine.rdf#m">
<sio:SIO_000008 rdf:resource = "http://semanticscience.org/sadi/ontology/caffeine.rdf#msmiles"/>
</so:smilesmolecule>
<sio:CHEMINF_000018 rdf:about = "http://semanticscience.org/sadi/ontology/caffeine.rdf#msmiles">
<sio:SIO_000300 rdf:datatype="xsd:string">Cn1cnc2n(C)c(=O)n(C)c(=O)c12</sio:SIO_000300>
</sio:CHEMINF_000018>
</rdf:RDF>
curl --data @caffeine.rdf http://cbrass.biordf.net/logpdc/logpc
Now what?
22
23
Semantic Health and Research Environment
SHARE is an application that execute (SPARQL) queries as workflows
over SADI Services
“Reckoning”
dynamic discovery of instances of OWL classes
through synthesis and invocation of a Web Service
workflow capable of generating data described by
the OWL class restrictions, followed by reasoning
to classify the data into that ontology
24
ChEBI has (non-SW) data!
25
Bio2RDF provides ChEBI in RDF 
26
Bio2RDF is now serving over
40 billion triples of linked biological data
27
Bio2RDF covers the major biological
databases
28
Bio2RDF is part of a growing web of linked data
29 “Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/”
something you can lookup or
search for with rich descriptions
30
31
SPARQL is the new cool kid on the query block
SQL SPARQL
Query for log p
32
33
Query:
Is caffeine a drug-like molecule?
34
Benefits
• Data remains distributed – as the internet was
meant to be!
• Data is not “exposed” as a SPARQL endpoint
– greater provider-control over computational
resources
• Service invocation is straightforward and
matchmaking by reasoning about ontology-based
input/output descriptions
35
Summary
• Semantic Web technologies offer tantalizing
new opportunities to publish, share and query
data and services
• Bio2RDF provides linked life science data
• SADI provides a framework to provide
semantic web services
• SHARE allows us to simultaneously query and
reason about data and services represented
using RDF/OWL
36 CASCON: Nov 3, 2010
37
Acknowledgements
This research is supported by The Heart + Stroke Foundation of BC and Yukon, Microsoft Research,
The Canadian Institutes of Health Research, The Natural Sciences and Engineering Research Council of Canada and CANARIE.
Marc-Alexandre Nolin & Francois Belleau (Bio2RDF)
Leo Chepelev (implementing the services)
Luke McCarthy (SADI technical support)
Mark Wilkinson (vision and leadership)
Chris Baker (lipidomics)
CHEMINF Group
Leo Chepelev
Janna Hastings
Egon Willighagen
Nico Adams
dumontierlab.com
michel_dumontier@carleton.ca
38

More Related Content

What's hot

FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe...
FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe...FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe...
FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe...Carole Goble
 
HyQue: Evaluating scientific Hypotheses using semantic web technologies
HyQue: Evaluating scientific Hypotheses using semantic web technologiesHyQue: Evaluating scientific Hypotheses using semantic web technologies
HyQue: Evaluating scientific Hypotheses using semantic web technologiesMichel Dumontier
 
The beauty of workflows and models
The beauty of workflows and modelsThe beauty of workflows and models
The beauty of workflows and modelsmyGrid team
 
Light Intro to the Gene Ontology
Light Intro to the Gene OntologyLight Intro to the Gene Ontology
Light Intro to the Gene Ontologynniiicc
 
USUGM 2014 - Gregory Landrum (Novartis): What else can you do with the Marku...
USUGM 2014 -  Gregory Landrum (Novartis): What else can you do with the Marku...USUGM 2014 -  Gregory Landrum (Novartis): What else can you do with the Marku...
USUGM 2014 - Gregory Landrum (Novartis): What else can you do with the Marku...ChemAxon
 
Publishing data and code openly
Publishing data and code openlyPublishing data and code openly
Publishing data and code openlyFAIRDOM
 
US2TS presentation on Gene Ontology
US2TS presentation on Gene OntologyUS2TS presentation on Gene Ontology
US2TS presentation on Gene OntologyChris Mungall
 
Publication of raw and curated NMR spectroscopic data for organic molecules
Publication of raw and curated NMR spectroscopic data for organic moleculesPublication of raw and curated NMR spectroscopic data for organic molecules
Publication of raw and curated NMR spectroscopic data for organic moleculesChristoph Steinbeck
 
Ontology Development Kit: Bio-Ontologies 2019
Ontology Development Kit: Bio-Ontologies 2019Ontology Development Kit: Bio-Ontologies 2019
Ontology Development Kit: Bio-Ontologies 2019Chris Mungall
 
Tools and approaches for data deposition into nanomaterial databases
Tools and approaches for data deposition into nanomaterial databasesTools and approaches for data deposition into nanomaterial databases
Tools and approaches for data deposition into nanomaterial databasesValery Tkachenko
 

What's hot (20)

Whitney Symposium Lecture June 2008
Whitney Symposium Lecture June 2008Whitney Symposium Lecture June 2008
Whitney Symposium Lecture June 2008
 
FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe...
FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe...FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe...
FAIRDOM - FAIR Asset management and sharing experiences in Systems and Synthe...
 
DisGeNET Tutorial SWAT4LS 2015-12-07
DisGeNET Tutorial SWAT4LS 2015-12-07DisGeNET Tutorial SWAT4LS 2015-12-07
DisGeNET Tutorial SWAT4LS 2015-12-07
 
HyQue: Evaluating scientific Hypotheses using semantic web technologies
HyQue: Evaluating scientific Hypotheses using semantic web technologiesHyQue: Evaluating scientific Hypotheses using semantic web technologies
HyQue: Evaluating scientific Hypotheses using semantic web technologies
 
The beauty of workflows and models
The beauty of workflows and modelsThe beauty of workflows and models
The beauty of workflows and models
 
Light Intro to the Gene Ontology
Light Intro to the Gene OntologyLight Intro to the Gene Ontology
Light Intro to the Gene Ontology
 
Phylogenetics: Making publication-quality tree figures
Phylogenetics: Making publication-quality tree figuresPhylogenetics: Making publication-quality tree figures
Phylogenetics: Making publication-quality tree figures
 
USUGM 2014 - Gregory Landrum (Novartis): What else can you do with the Marku...
USUGM 2014 -  Gregory Landrum (Novartis): What else can you do with the Marku...USUGM 2014 -  Gregory Landrum (Novartis): What else can you do with the Marku...
USUGM 2014 - Gregory Landrum (Novartis): What else can you do with the Marku...
 
RSC ChemSpider -- Managing and Integrating Chemistry on the Internet to Build...
RSC ChemSpider -- Managing and Integrating Chemistry on the Internet to Build...RSC ChemSpider -- Managing and Integrating Chemistry on the Internet to Build...
RSC ChemSpider -- Managing and Integrating Chemistry on the Internet to Build...
 
Open innovation contributions from RSC resulting from the Open Phacts project
Open innovation contributions from RSC resulting from the Open Phacts projectOpen innovation contributions from RSC resulting from the Open Phacts project
Open innovation contributions from RSC resulting from the Open Phacts project
 
Ibn Sina
Ibn SinaIbn Sina
Ibn Sina
 
Delivering Curated Chemistry to the World via Crowdsourced Deposition and Ann...
Delivering Curated Chemistry to the World via Crowdsourced Deposition and Ann...Delivering Curated Chemistry to the World via Crowdsourced Deposition and Ann...
Delivering Curated Chemistry to the World via Crowdsourced Deposition and Ann...
 
Publishing data and code openly
Publishing data and code openlyPublishing data and code openly
Publishing data and code openly
 
Sybrandt Thesis Proposal Presentation
Sybrandt Thesis Proposal PresentationSybrandt Thesis Proposal Presentation
Sybrandt Thesis Proposal Presentation
 
US2TS presentation on Gene Ontology
US2TS presentation on Gene OntologyUS2TS presentation on Gene Ontology
US2TS presentation on Gene Ontology
 
Knowledge Beacons
Knowledge BeaconsKnowledge Beacons
Knowledge Beacons
 
Publication of raw and curated NMR spectroscopic data for organic molecules
Publication of raw and curated NMR spectroscopic data for organic moleculesPublication of raw and curated NMR spectroscopic data for organic molecules
Publication of raw and curated NMR spectroscopic data for organic molecules
 
Ontology Development Kit: Bio-Ontologies 2019
Ontology Development Kit: Bio-Ontologies 2019Ontology Development Kit: Bio-Ontologies 2019
Ontology Development Kit: Bio-Ontologies 2019
 
Tools and approaches for data deposition into nanomaterial databases
Tools and approaches for data deposition into nanomaterial databasesTools and approaches for data deposition into nanomaterial databases
Tools and approaches for data deposition into nanomaterial databases
 
RSC ChemSpider is the online chemistry database where community contributions...
RSC ChemSpider is the online chemistry database where community contributions...RSC ChemSpider is the online chemistry database where community contributions...
RSC ChemSpider is the online chemistry database where community contributions...
 

Viewers also liked

Konsep Presentasi Portal Berita
Konsep Presentasi Portal BeritaKonsep Presentasi Portal Berita
Konsep Presentasi Portal BeritaGameloft
 
jQtouch, Building Awesome Webapps
jQtouch, Building Awesome WebappsjQtouch, Building Awesome Webapps
jQtouch, Building Awesome WebappsHome
 
I Dream of Justice, I Dream of Change
I Dream of Justice, I Dream of ChangeI Dream of Justice, I Dream of Change
I Dream of Justice, I Dream of Changeewingj33
 
The Economics of Grid-Connected Hybrid Distributed Generation
The Economics of Grid-Connected Hybrid Distributed GenerationThe Economics of Grid-Connected Hybrid Distributed Generation
The Economics of Grid-Connected Hybrid Distributed GenerationIain Sanders
 
IVI Presentation At Rusnano Conference
IVI Presentation At Rusnano ConferenceIVI Presentation At Rusnano Conference
IVI Presentation At Rusnano ConferenceThomas Nastas
 
U of L and The Social Web
U of L and The Social WebU of L and The Social Web
U of L and The Social Webjackbr4
 
Socialmediastenden
SocialmediastendenSocialmediastenden
SocialmediastendenSascha Funk
 
Plenaire Presentatie Jan Bommerez
Plenaire Presentatie Jan BommerezPlenaire Presentatie Jan Bommerez
Plenaire Presentatie Jan BommerezPatrick Terranea
 
Wastewater 101: Decentralized approach, community process, and options
Wastewater 101: Decentralized approach, community process, and optionsWastewater 101: Decentralized approach, community process, and options
Wastewater 101: Decentralized approach, community process, and optionsdmalchow
 
IVI Workshop Value Creation in European Pe
IVI Workshop Value Creation in European PeIVI Workshop Value Creation in European Pe
IVI Workshop Value Creation in European PeThomas Nastas
 
Connect Globally For An Innovation Economy, Nastas Article In Moscow Times
Connect Globally For An Innovation Economy, Nastas Article In Moscow TimesConnect Globally For An Innovation Economy, Nastas Article In Moscow Times
Connect Globally For An Innovation Economy, Nastas Article In Moscow TimesThomas Nastas
 
Real World Applications of OWL
Real World Applications of OWLReal World Applications of OWL
Real World Applications of OWLMichel Dumontier
 
clusterstor-hadoop-data-sheet
clusterstor-hadoop-data-sheetclusterstor-hadoop-data-sheet
clusterstor-hadoop-data-sheetAndrei Khurshudov
 

Viewers also liked (20)

Burlata
BurlataBurlata
Burlata
 
Konsep Presentasi Portal Berita
Konsep Presentasi Portal BeritaKonsep Presentasi Portal Berita
Konsep Presentasi Portal Berita
 
jQtouch, Building Awesome Webapps
jQtouch, Building Awesome WebappsjQtouch, Building Awesome Webapps
jQtouch, Building Awesome Webapps
 
IPCC2010-1
IPCC2010-1IPCC2010-1
IPCC2010-1
 
I Dream of Justice, I Dream of Change
I Dream of Justice, I Dream of ChangeI Dream of Justice, I Dream of Change
I Dream of Justice, I Dream of Change
 
The Economics of Grid-Connected Hybrid Distributed Generation
The Economics of Grid-Connected Hybrid Distributed GenerationThe Economics of Grid-Connected Hybrid Distributed Generation
The Economics of Grid-Connected Hybrid Distributed Generation
 
Connexions Roy 2013
Connexions Roy 2013Connexions Roy 2013
Connexions Roy 2013
 
IVI Presentation At Rusnano Conference
IVI Presentation At Rusnano ConferenceIVI Presentation At Rusnano Conference
IVI Presentation At Rusnano Conference
 
U of L and The Social Web
U of L and The Social WebU of L and The Social Web
U of L and The Social Web
 
Socialmediastenden
SocialmediastendenSocialmediastenden
Socialmediastenden
 
Plenaire Presentatie Jan Bommerez
Plenaire Presentatie Jan BommerezPlenaire Presentatie Jan Bommerez
Plenaire Presentatie Jan Bommerez
 
Wastewater 101: Decentralized approach, community process, and options
Wastewater 101: Decentralized approach, community process, and optionsWastewater 101: Decentralized approach, community process, and options
Wastewater 101: Decentralized approach, community process, and options
 
IVI Workshop Value Creation in European Pe
IVI Workshop Value Creation in European PeIVI Workshop Value Creation in European Pe
IVI Workshop Value Creation in European Pe
 
Connect Globally For An Innovation Economy, Nastas Article In Moscow Times
Connect Globally For An Innovation Economy, Nastas Article In Moscow TimesConnect Globally For An Innovation Economy, Nastas Article In Moscow Times
Connect Globally For An Innovation Economy, Nastas Article In Moscow Times
 
Monaco201209
Monaco201209Monaco201209
Monaco201209
 
Real World Applications of OWL
Real World Applications of OWLReal World Applications of OWL
Real World Applications of OWL
 
Cda esm waste oil disposal application part 2
Cda esm waste oil disposal application part 2Cda esm waste oil disposal application part 2
Cda esm waste oil disposal application part 2
 
clusterstor-hadoop-data-sheet
clusterstor-hadoop-data-sheetclusterstor-hadoop-data-sheet
clusterstor-hadoop-data-sheet
 
HR head dilemma ideate assignment
HR head dilemma ideate assignmentHR head dilemma ideate assignment
HR head dilemma ideate assignment
 
Social Media Summit
Social Media SummitSocial Media Summit
Social Media Summit
 

Similar to 2010 CASCON - Towards a integrated network of data and services for the life sciences

Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...William Gunn
 
Beyond Transparency: Success & Lessons From tambisBoston2003
Beyond Transparency: Success & Lessons From tambisBoston2003Beyond Transparency: Success & Lessons From tambisBoston2003
Beyond Transparency: Success & Lessons From tambisBoston2003robertstevens65
 
Implementing chemistry platform for OpenPHACTS
Implementing chemistry platform for OpenPHACTSImplementing chemistry platform for OpenPHACTS
Implementing chemistry platform for OpenPHACTSValery Tkachenko
 
Building a Network of Interoperable and Independently Produced Linked and Ope...
Building a Network of Interoperable and Independently Produced Linked and Ope...Building a Network of Interoperable and Independently Produced Linked and Ope...
Building a Network of Interoperable and Independently Produced Linked and Ope...Michel Dumontier
 
Online Chemical Database with Modelling Environment
Online Chemical Database with Modelling EnvironmentOnline Chemical Database with Modelling Environment
Online Chemical Database with Modelling EnvironmentSSA KPI
 
Mining 'Bigger' Datasets to Create, Validate and Share Machine Learning Models
Mining 'Bigger' Datasets to Create, Validate and Share Machine Learning ModelsMining 'Bigger' Datasets to Create, Validate and Share Machine Learning Models
Mining 'Bigger' Datasets to Create, Validate and Share Machine Learning ModelsSean Ekins
 
Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...
Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...
Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...Amit Sheth
 
Ontology Services for the Biomedical Sciences
Ontology Services for the Biomedical SciencesOntology Services for the Biomedical Sciences
Ontology Services for the Biomedical SciencesConnected Data World
 
Ontologies for life sciences: examples from the gene ontology
Ontologies for life sciences: examples from the gene ontologyOntologies for life sciences: examples from the gene ontology
Ontologies for life sciences: examples from the gene ontologyMelanie Courtot
 
MADICES Mungall 2022.pptx
MADICES Mungall 2022.pptxMADICES Mungall 2022.pptx
MADICES Mungall 2022.pptxChris Mungall
 
Enhancing the Quality of ImmPort Data
Enhancing the Quality of ImmPort DataEnhancing the Quality of ImmPort Data
Enhancing the Quality of ImmPort DataBarry Smith
 
Advanced Bioinformatics for Genomics and BioData Driven Research
Advanced Bioinformatics for Genomics and BioData Driven ResearchAdvanced Bioinformatics for Genomics and BioData Driven Research
Advanced Bioinformatics for Genomics and BioData Driven ResearchEuropean Bioinformatics Institute
 
ICIC 2013 Conference Proceedings Antony Williams Royal Society of Chemistry
ICIC 2013 Conference Proceedings Antony Williams Royal Society of ChemistryICIC 2013 Conference Proceedings Antony Williams Royal Society of Chemistry
ICIC 2013 Conference Proceedings Antony Williams Royal Society of ChemistryDr. Haxel Consult
 

Similar to 2010 CASCON - Towards a integrated network of data and services for the life sciences (20)

Facilitating Scientific Discovery through Crowdsourcing and Distributed Parti...
Facilitating Scientific Discovery through Crowdsourcing and Distributed Parti...Facilitating Scientific Discovery through Crowdsourcing and Distributed Parti...
Facilitating Scientific Discovery through Crowdsourcing and Distributed Parti...
 
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
 
Beyond Transparency: Success & Lessons From tambisBoston2003
Beyond Transparency: Success & Lessons From tambisBoston2003Beyond Transparency: Success & Lessons From tambisBoston2003
Beyond Transparency: Success & Lessons From tambisBoston2003
 
Implementing chemistry platform for OpenPHACTS
Implementing chemistry platform for OpenPHACTSImplementing chemistry platform for OpenPHACTS
Implementing chemistry platform for OpenPHACTS
 
Serving the medicinal chemistry community with Royal Society of Chemistry che...
Serving the medicinal chemistry community with Royal Society of Chemistry che...Serving the medicinal chemistry community with Royal Society of Chemistry che...
Serving the medicinal chemistry community with Royal Society of Chemistry che...
 
Accessing Environmental Chemistry Data via Data Dashboards
Accessing Environmental Chemistry Data via Data Dashboards Accessing Environmental Chemistry Data via Data Dashboards
Accessing Environmental Chemistry Data via Data Dashboards
 
Building a Network of Interoperable and Independently Produced Linked and Ope...
Building a Network of Interoperable and Independently Produced Linked and Ope...Building a Network of Interoperable and Independently Produced Linked and Ope...
Building a Network of Interoperable and Independently Produced Linked and Ope...
 
Online Chemical Database with Modelling Environment
Online Chemical Database with Modelling EnvironmentOnline Chemical Database with Modelling Environment
Online Chemical Database with Modelling Environment
 
Mining 'Bigger' Datasets to Create, Validate and Share Machine Learning Models
Mining 'Bigger' Datasets to Create, Validate and Share Machine Learning ModelsMining 'Bigger' Datasets to Create, Validate and Share Machine Learning Models
Mining 'Bigger' Datasets to Create, Validate and Share Machine Learning Models
 
Applications of the US EPA’s CompTox chemicals dashboard to support structure...
Applications of the US EPA’s CompTox chemicals dashboard to support structure...Applications of the US EPA’s CompTox chemicals dashboard to support structure...
Applications of the US EPA’s CompTox chemicals dashboard to support structure...
 
Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...
Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...
Semantic Web & Web 3.0 empowering real world outcomes in biomedical research ...
 
Ontology Services for the Biomedical Sciences
Ontology Services for the Biomedical SciencesOntology Services for the Biomedical Sciences
Ontology Services for the Biomedical Sciences
 
Ontologies for life sciences: examples from the gene ontology
Ontologies for life sciences: examples from the gene ontologyOntologies for life sciences: examples from the gene ontology
Ontologies for life sciences: examples from the gene ontology
 
BioNLPSADI
BioNLPSADIBioNLPSADI
BioNLPSADI
 
MADICES Mungall 2022.pptx
MADICES Mungall 2022.pptxMADICES Mungall 2022.pptx
MADICES Mungall 2022.pptx
 
Enhancing the Quality of ImmPort Data
Enhancing the Quality of ImmPort DataEnhancing the Quality of ImmPort Data
Enhancing the Quality of ImmPort Data
 
Advanced Bioinformatics for Genomics and BioData Driven Research
Advanced Bioinformatics for Genomics and BioData Driven ResearchAdvanced Bioinformatics for Genomics and BioData Driven Research
Advanced Bioinformatics for Genomics and BioData Driven Research
 
ICIC 2013 Conference Proceedings Antony Williams Royal Society of Chemistry
ICIC 2013 Conference Proceedings Antony Williams Royal Society of ChemistryICIC 2013 Conference Proceedings Antony Williams Royal Society of Chemistry
ICIC 2013 Conference Proceedings Antony Williams Royal Society of Chemistry
 
Big data challenges associated with building a national data repository for c...
Big data challenges associated with building a national data repository for c...Big data challenges associated with building a national data repository for c...
Big data challenges associated with building a national data repository for c...
 
Neo4j and bioinformatics
Neo4j and bioinformaticsNeo4j and bioinformatics
Neo4j and bioinformatics
 

More from Michel Dumontier

A metadata standard for Knowledge Graphs
A metadata standard for Knowledge GraphsA metadata standard for Knowledge Graphs
A metadata standard for Knowledge GraphsMichel Dumontier
 
Data-Driven Discovery Science with FAIR Knowledge Graphs
Data-Driven Discovery Science with FAIR Knowledge GraphsData-Driven Discovery Science with FAIR Knowledge Graphs
Data-Driven Discovery Science with FAIR Knowledge GraphsMichel Dumontier
 
The Role of the FAIR Guiding Principles for an effective Learning Health System
The Role of the FAIR Guiding Principles for an effective Learning Health SystemThe Role of the FAIR Guiding Principles for an effective Learning Health System
The Role of the FAIR Guiding Principles for an effective Learning Health SystemMichel Dumontier
 
CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...
CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...
CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...Michel Dumontier
 
The role of the FAIR Guiding Principles in a Learning Health System
The role of the FAIR Guiding Principles in a Learning Health SystemThe role of the FAIR Guiding Principles in a Learning Health System
The role of the FAIR Guiding Principles in a Learning Health SystemMichel Dumontier
 
Acclerating biomedical discovery with an internet of FAIR data and services -...
Acclerating biomedical discovery with an internet of FAIR data and services -...Acclerating biomedical discovery with an internet of FAIR data and services -...
Acclerating biomedical discovery with an internet of FAIR data and services -...Michel Dumontier
 
Accelerating Biomedical Research with the Emerging Internet of FAIR Data and ...
Accelerating Biomedical Research with the Emerging Internet of FAIR Data and ...Accelerating Biomedical Research with the Emerging Internet of FAIR Data and ...
Accelerating Biomedical Research with the Emerging Internet of FAIR Data and ...Michel Dumontier
 
Are we FAIR yet? And will it be worth it?
Are we FAIR yet? And will it be worth it?Are we FAIR yet? And will it be worth it?
Are we FAIR yet? And will it be worth it?Michel Dumontier
 
The Future of FAIR Data: An international social, legal and technological inf...
The Future of FAIR Data: An international social, legal and technological inf...The Future of FAIR Data: An international social, legal and technological inf...
The Future of FAIR Data: An international social, legal and technological inf...Michel Dumontier
 
Keynote at the 2018 Maastricht University Dinner
Keynote at the 2018 Maastricht University DinnerKeynote at the 2018 Maastricht University Dinner
Keynote at the 2018 Maastricht University DinnerMichel Dumontier
 
The future of science and business - a UM Star Lecture
The future of science and business - a UM Star LectureThe future of science and business - a UM Star Lecture
The future of science and business - a UM Star LectureMichel Dumontier
 
Developing and assessing FAIR digital resources
Developing and assessing FAIR digital resourcesDeveloping and assessing FAIR digital resources
Developing and assessing FAIR digital resourcesMichel Dumontier
 
Advancing Biomedical Knowledge Reuse with FAIR
Advancing Biomedical Knowledge Reuse with FAIRAdvancing Biomedical Knowledge Reuse with FAIR
Advancing Biomedical Knowledge Reuse with FAIRMichel Dumontier
 
A Framework to develop the FAIR Metrics
A Framework to develop the FAIR MetricsA Framework to develop the FAIR Metrics
A Framework to develop the FAIR MetricsMichel Dumontier
 
FAIR principles and metrics for evaluation
FAIR principles and metrics for evaluationFAIR principles and metrics for evaluation
FAIR principles and metrics for evaluationMichel Dumontier
 
Towards metrics to assess and encourage FAIRness
Towards metrics to assess and encourage FAIRnessTowards metrics to assess and encourage FAIRness
Towards metrics to assess and encourage FAIRnessMichel Dumontier
 

More from Michel Dumontier (20)

A metadata standard for Knowledge Graphs
A metadata standard for Knowledge GraphsA metadata standard for Knowledge Graphs
A metadata standard for Knowledge Graphs
 
Data-Driven Discovery Science with FAIR Knowledge Graphs
Data-Driven Discovery Science with FAIR Knowledge GraphsData-Driven Discovery Science with FAIR Knowledge Graphs
Data-Driven Discovery Science with FAIR Knowledge Graphs
 
Evaluating FAIRness
Evaluating FAIRnessEvaluating FAIRness
Evaluating FAIRness
 
The Role of the FAIR Guiding Principles for an effective Learning Health System
The Role of the FAIR Guiding Principles for an effective Learning Health SystemThe Role of the FAIR Guiding Principles for an effective Learning Health System
The Role of the FAIR Guiding Principles for an effective Learning Health System
 
CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...
CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...
CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...
 
The role of the FAIR Guiding Principles in a Learning Health System
The role of the FAIR Guiding Principles in a Learning Health SystemThe role of the FAIR Guiding Principles in a Learning Health System
The role of the FAIR Guiding Principles in a Learning Health System
 
Acclerating biomedical discovery with an internet of FAIR data and services -...
Acclerating biomedical discovery with an internet of FAIR data and services -...Acclerating biomedical discovery with an internet of FAIR data and services -...
Acclerating biomedical discovery with an internet of FAIR data and services -...
 
Accelerating Biomedical Research with the Emerging Internet of FAIR Data and ...
Accelerating Biomedical Research with the Emerging Internet of FAIR Data and ...Accelerating Biomedical Research with the Emerging Internet of FAIR Data and ...
Accelerating Biomedical Research with the Emerging Internet of FAIR Data and ...
 
Are we FAIR yet? And will it be worth it?
Are we FAIR yet? And will it be worth it?Are we FAIR yet? And will it be worth it?
Are we FAIR yet? And will it be worth it?
 
The Future of FAIR Data: An international social, legal and technological inf...
The Future of FAIR Data: An international social, legal and technological inf...The Future of FAIR Data: An international social, legal and technological inf...
The Future of FAIR Data: An international social, legal and technological inf...
 
Keynote at the 2018 Maastricht University Dinner
Keynote at the 2018 Maastricht University DinnerKeynote at the 2018 Maastricht University Dinner
Keynote at the 2018 Maastricht University Dinner
 
The future of science and business - a UM Star Lecture
The future of science and business - a UM Star LectureThe future of science and business - a UM Star Lecture
The future of science and business - a UM Star Lecture
 
Are we FAIR yet?
Are we FAIR yet?Are we FAIR yet?
Are we FAIR yet?
 
Developing and assessing FAIR digital resources
Developing and assessing FAIR digital resourcesDeveloping and assessing FAIR digital resources
Developing and assessing FAIR digital resources
 
Advancing Biomedical Knowledge Reuse with FAIR
Advancing Biomedical Knowledge Reuse with FAIRAdvancing Biomedical Knowledge Reuse with FAIR
Advancing Biomedical Knowledge Reuse with FAIR
 
A Framework to develop the FAIR Metrics
A Framework to develop the FAIR MetricsA Framework to develop the FAIR Metrics
A Framework to develop the FAIR Metrics
 
FAIR principles and metrics for evaluation
FAIR principles and metrics for evaluationFAIR principles and metrics for evaluation
FAIR principles and metrics for evaluation
 
Towards metrics to assess and encourage FAIRness
Towards metrics to assess and encourage FAIRnessTowards metrics to assess and encourage FAIRness
Towards metrics to assess and encourage FAIRness
 
Data Science for the Win
Data Science for the WinData Science for the Win
Data Science for the Win
 
2016 bmdid-mappings
2016 bmdid-mappings2016 bmdid-mappings
2016 bmdid-mappings
 

Recently uploaded

The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon AUnboundStockton
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Celine George
 
Pharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfPharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfMahmoud M. Sallam
 
History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxHistory Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxsocialsciencegdgrohi
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsanshu789521
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17Celine George
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,Virag Sontakke
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application ) Sakshi Ghasle
 
internship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developerinternship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developerunnathinaik
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 
ENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptx
ENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptxENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptx
ENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptxAnaBeatriceAblay2
 

Recently uploaded (20)

The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon A
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17
 
Pharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfPharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdf
 
History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxHistory Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha elections
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application )
 
internship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developerinternship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developer
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 
ENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptx
ENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptxENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptx
ENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptx
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 

2010 CASCON - Towards a integrated network of data and services for the life sciences

  • 1. Towards a integrated network of data and services for the life sciences 1 Michel Dumontier, Ph.D. Associate Professor of Bioinformatics Carleton University Department of Biology School of Computer Science Institute of Biochemistry Ottawa Institute of Systems Biology Ottawa-Carleton Institute of Biomedical Engineering
  • 2. Finding the right information to answer a question is hard and sometimes requires a sophisticated workflow 2
  • 3.
  • 4. What if we could answer a question by automatically building a knowledge base using both data and services?4
  • 5. The Semantic Web is a web of knowledge. 5 It is about standards for publishing, sharing and querying knowledge drawn from diverse sources It enables the answering of sophisticated questions
  • 6. Is caffeine a drug-like molecule?
  • 7. To answer this question we need to know: • what ‘drug like molecule’ really means • caffeine’s molecular structure • use the structural information to compute the attributes • determine whether caffeine satisfies the requirements of being ‘drug like’ Is caffeine a drug-like molecule?
  • 8. Lipinski Rule of Five • Rule of thumb for druglikeness (orally active in humans) (4 rules with multiples of 5) – mass of less than 500 Daltons – fewer than 5 hydrogen bond donors – fewer than 10 hydrogen bond acceptors – A partition coefficient value between -5 and 5 We need a more formal (machine understandable) description of a ‘drug-like molecule’ which specifies values for chemical descriptors
  • 9. ontology as a strategy to formally represent knowledge 9
  • 10. The Web Ontology Language (OWL) Has Explicit Semantics Can therefore be used to capture knowledge in a machine understandable way 10
  • 11. The Chemical Information Ontology (CHEMINF) • 100+ chemical descriptors • 50+ chemical qualities • Relates descriptors to their specifications, the software that generated them (along with the running parameters, and the algorithms that they implement) • Contributors: Nico Adams, Leonid Chepelev, Michel Dumontier, Janna Hastings, Egon Willighagen, Peter Murray-Rust, Cristoph Steinbeck 11 http://semanticchemistry.googlecode.com
  • 12. Molecular structure can be represented using a SMILES string, which is a common representation of the chemical graph 12 ball & stick model for caffeine SMILES string for caffeine Cn1cnc2n(C)c(=O)n(C)c(=O)c12
  • 13. Lipinski Rule of Five • Empirically derived ruleset for druglikeness (4 rules with multiples of 5) – mass of less than 500 Daltons – fewer than 5 hydrogen bond donors – fewer than 10 hydrogen bond acceptors – A partition coefficient value between -5 and 5 • A formal description using OWL:
  • 14. What we then need are services that will consume SMILES strings and annotate the molecule with the required chemical descriptors 14 then we can reason about whether it satisfies the drug- likeness definition
  • 15. Semantic Automated Discovery and Integration http://sadiframework.org Mark Wilkinson, UBC Michel Dumontier, Carleton University Christopher Baker, UNB SADI is a framework to create Semantic Web services using OWL classes as service inputs and outputs 15
  • 16. SADI • OWL classes in SADI are local to individual services – They should uniquely specify the service input and outputs (they exactly have the right restrictions) – one service’s world-view can conflict with another, but a client can use any or all • maximize interoperability by reusing types and relations
  • 17. Semanticscience Integrated Ontology (SIO) • OWL2 ontology • 800 classes covering basic types (physical, processual, informational) with an emphasis on biological entities • 129 basic relations (mereological, participatory, attribute/quality, spatial, temporal and representational) • axioms can be used by reasoners to generate inferences for consistency checking, classification and answering questions about life science knowledge • embodies emerging ontology design patterns • dereferenceable URIs • searchable in the NCBO bioportal http://semanticscience.org/ontology/sio.owl 17 CASCON: Nov 3, 2010
  • 18. Create code stubs using the ontology • Publish the ontology to a web-accessible location http://semanticscience.org/sadi/ontology/lipinskiserviceontology.owl • Make sure that the class names are resolvable (easy when using the hash notation) http://semanticscience.org/sadi/ontology/lipinskiserviceontology.owl#smiles-molecule http://semanticscience.org/sadi/ontology/lipinskiserviceontology.owl#logp-molecule http://semanticscience.org/sadi/ontology/lipinskiserviceontology.owl#hbdc-molecule http://semanticscience.org/sadi/ontology/lipinskiserviceontology.owl#hdba-molecule http://semanticscience.org/sadi/ontology/lipinskiserviceontology.owl#lipinksi-druglike-molecule • Download/checkout the code http://sadiframework.org • Run the code generator – specify the URIs that correspond to input and output types 18
  • 19. Implement the functionality • Java version – Uses Jena to manipulate the RDF graph – Uses Maven to build from command-line or Eclipse; Invokes Jetty for service testing • Chemistry – We used the Chemistry Development Kit (CDK) to implement 4 services 19
  • 20. Responds to a GET operation by providing the service description in RDF conforms to Feta (BioMoby, myGrid) 20 curl http://cbrass.biordf.net/logpdc/logpc <rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:j.0="http://www.mygrid.org.uk/mygrid-moby-service#" > <rdf:Description rdf:about=""> <j.0:hasServiceDescriptionText>no description</j.0:hasServiceDescriptionText> <j.0:hasServiceNameText rdf:datatype="http://www.w3.org/2001/XMLSchema#string">logpc</j.0:hasServiceNameText> <j.0:hasOperation rdf:resource="#operation"/> <rdf:type rdf:resource="http://www.mygrid.org.uk/mygrid-moby-service#serviceDescription"/> </rdf:Description> <rdf:Description rdf:about="#input"> <j.0:objectType rdf:resource="http://semanticscience.org/sadi/ontology/lipinskiserviceontology.owl#smilesmolecule"/> <rdf:type rdf:resource="http://www.mygrid.org.uk/mygrid-moby-service#parameter"/> </rdf:Description> <rdf:Description rdf:about="#operation"> <j.0:outputParameter rdf:resource="#output"/> <j.0:inputParameter rdf:resource="#input"/> <rdf:type rdf:resource="http://www.mygrid.org.uk/mygrid-moby-service#operation"/> </rdf:Description> <rdf:Description rdf:about="#output"> <j.0:objectType rdf:resource="http://semanticscience.org/sadi/ontology/lipinskiserviceontology.owl#alogpsmilesmolecule"/> <rdf:type rdf:resource="http://www.mygrid.org.uk/mygrid-moby-service#parameter"/> </rdf:Description> </rdf:RDF>
  • 21. Responds to a POST containing service input with a service output in RDF 21 <rdf:Description rdf:about="http://semanticscience.org/sadi/ontology/caffeine.rdf#mdalogp"> <rdf:type rdf:resource="http://semanticscience.org/resource/CHEMINF_000251"/> <j.0:SIO_000300 rdf:datatype="http://www.w3.org/2001/XMLSchema#double">-0.4311000000000006</j.0:SIO_000300> </rdf:Description> <rdf:RDF xmlns="http://semanticscience.org/sadi/ontology/caffeine.rdf#" xmlns:so="http://semanticscience.org/sadi/ontology/lipinskiserviceontology.owl#" xmlns:owl="http://www.w3.org/2002/07/owl#" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:sio="http://semanticscience.org/resource/" xmlns:xsd="http://www.w3.org/2001/XMLSchema#"> <so:smilesmolecule rdf:about="http://semanticscience.org/sadi/ontology/caffeine.rdf#m"> <sio:SIO_000008 rdf:resource = "http://semanticscience.org/sadi/ontology/caffeine.rdf#msmiles"/> </so:smilesmolecule> <sio:CHEMINF_000018 rdf:about = "http://semanticscience.org/sadi/ontology/caffeine.rdf#msmiles"> <sio:SIO_000300 rdf:datatype="xsd:string">Cn1cnc2n(C)c(=O)n(C)c(=O)c12</sio:SIO_000300> </sio:CHEMINF_000018> </rdf:RDF> curl --data @caffeine.rdf http://cbrass.biordf.net/logpdc/logpc
  • 23. 23 Semantic Health and Research Environment SHARE is an application that execute (SPARQL) queries as workflows over SADI Services
  • 24. “Reckoning” dynamic discovery of instances of OWL classes through synthesis and invocation of a Web Service workflow capable of generating data described by the OWL class restrictions, followed by reasoning to classify the data into that ontology 24
  • 25. ChEBI has (non-SW) data! 25
  • 26. Bio2RDF provides ChEBI in RDF  26
  • 27. Bio2RDF is now serving over 40 billion triples of linked biological data 27
  • 28. Bio2RDF covers the major biological databases 28
  • 29. Bio2RDF is part of a growing web of linked data 29 “Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/”
  • 30. something you can lookup or search for with rich descriptions 30
  • 31. 31 SPARQL is the new cool kid on the query block SQL SPARQL
  • 33. 33
  • 34. Query: Is caffeine a drug-like molecule? 34
  • 35. Benefits • Data remains distributed – as the internet was meant to be! • Data is not “exposed” as a SPARQL endpoint – greater provider-control over computational resources • Service invocation is straightforward and matchmaking by reasoning about ontology-based input/output descriptions 35
  • 36. Summary • Semantic Web technologies offer tantalizing new opportunities to publish, share and query data and services • Bio2RDF provides linked life science data • SADI provides a framework to provide semantic web services • SHARE allows us to simultaneously query and reason about data and services represented using RDF/OWL 36 CASCON: Nov 3, 2010
  • 37. 37 Acknowledgements This research is supported by The Heart + Stroke Foundation of BC and Yukon, Microsoft Research, The Canadian Institutes of Health Research, The Natural Sciences and Engineering Research Council of Canada and CANARIE. Marc-Alexandre Nolin & Francois Belleau (Bio2RDF) Leo Chepelev (implementing the services) Luke McCarthy (SADI technical support) Mark Wilkinson (vision and leadership) Chris Baker (lipidomics) CHEMINF Group Leo Chepelev Janna Hastings Egon Willighagen Nico Adams

Editor's Notes

  1. Can’t answer questions that require background knowledge