SlideShare a Scribd company logo
1 of 37
Download to read offline
Bio2RDF: Convert, Provide And
          Reuse.

     Marc-Alexandre Nolin1, Jacques Corbeil1,
      Luc Lamontagne1, Michel Dumontier1,2

              1
                Laval University, Canada
             2
               Carleton University, Canada

                  manolin@gmail.com
Presentation Plan
        • Bio2RDF
             – Methodology
        •    Data conversion
        •    Data provision
        •    Reuse
        •    Future work



2010-10-13             Bio2RDF: Convert, Provide And Reuse.   2
Bio2RDF
        • Bio2RDF uses open-source Semantic Web
          technologies to provide interlinked life science
          data to support biological knowledge discovery.
             •   Over 40 databases converted
             •   Over 30 billion triples
             •   Global mirroring
             –   Quebec City, Quebec, Canada
             –   Ottawa, Ontario, Canada
             –   Guelph, Ontario, Canada
             –   Brisbane, Australia
        – part of LOD mashup at http://lod.openlinksw.com

2010-10-13                 Bio2RDF: Convert, Provide And Reuse.   3
2010 Linked Open Data Cloud




             “Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/”



2010-10-13                              Bio2RDF: Convert, Provide And Reuse.                                   4
Bio2RDF is the major contributor
                 to the Life Sciences LOD




             “Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/”

2010-10-13                              Bio2RDF: Convert, Provide And Reuse.                                   5
What is Linked Open Data?
        • Linked open data is
             •   data that is free to use
             •   machine understandable (uses RDF/OWL)
             •   can be looked up using web protocols
             •   has meaningful relations between data items
                 (generated from supplied cross-references, or
                 text-based mappings)




2010-10-13                 Bio2RDF: Convert, Provide And Reuse.   6
LOD Methodology
        Applying Tim Berners-Lee 4 rules
             1. Use URIs as names for things
             2. Use HTTP URIs so that people can look up those
                names
             3. When someone looks up a URI, provide useful
                information, using the standards (RDF*, SPARQL)
             4. Include links to other URIs, so that they can
                discover more things

             –   http://www.w3.org/DesignIssues/LinkedData


2010-10-13               Bio2RDF: Convert, Provide And Reuse.     7
Bio2RDF Methodology
        We extend LOD rules by:
             1. Consistently naming all resources
                http://bio2rdf.org/namespace:identifier
             2. Resolving Bio2RDF URIs to a set of statements
                about the requested resource




2010-10-13                Bio2RDF: Convert, Provide And Reuse.   8
Data providers have been
                linking data for years
      •      Links are done from
             one HTML page to
             another
      •      This works for human
             consumption, but
             doesn’t scale with huge
             amounts of data




2010-10-13                Bio2RDF: Convert, Provide And Reuse.   9
However, most linked open data is
 created and provided without the
 help of the original data provider
Why be part of the linked data
                       cloud?
        •     Enable queries that span over more than 1
              database.
               –   Example: filtering a PubMed search by a
                   microarray level of expression filter
        •     Reduce the size of a database by only
              referencing data instead of including it in
              a database record (e.g. citations)



2010-10-13             Bio2RDF: Convert, Provide And Reuse.   11
But something is missing !




             derived from Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-
                  cloud.net/”


2010-10-13                              Bio2RDF: Convert, Provide And Reuse.                                    12
Presentation Plan
        •    Bio2RDF
        •    Data conversion
        •    Data provision
        •    Reuse
        •    Future work




2010-10-13            Bio2RDF: Convert, Provide And Reuse.   13
Convert




        • Take a step back and think about what your data
          represents.
        • Forget about the way it is currently represented in
          your relational database
2010-10-13             Bio2RDF: Convert, Provide And Reuse.     14
What is RDFizing?
        • RDFizing converts legacy data into one or
          more RDF statements or triples:
                     <Subject> <Predicate> <Object>
        • Triples have correspondence for a standard
          database
             – Subject  row identifier
             – Predicate  column name
             – Object  value

2010-10-13              Bio2RDF: Convert, Provide And Reuse.   15
Convert
        • Converting to RDF just for the sake of providing
          another format will not add much to your offering
          if the result is your old relational database format
          … in RDF
        • Get rid of relational database artifacts while
             rdfizing
        1. Uses simple and stable identifiers to name
           resources
        2. Create types for the entities that your data refers to
           with in your data and specify the nature of the
           relations that hold between them

2010-10-13              Bio2RDF: Convert, Provide And Reuse.        16
Presentation Plan
        •    Bio2RDF
        •    Data conversion
        •    Data provision
        •    Reuse
        •    Future work




2010-10-13             Bio2RDF: Convert, Provide And Reuse.   17
Provide – RDF Documents
        • URIs can be created with a REST-like look
        • Example of stable URIs
               http://purl.uniprot.org/uniprot/P19367
                  http://bio2rdf.org/uniprot:P19367
        • Documents containing statements should be
          accessible using web technology (HTTP protocol)
        • Provide data dumps
        • But in the LOD world, we want to query online
          databases!
2010-10-13            Bio2RDF: Convert, Provide And Reuse.   18
Provide - SPARQL
        • SPARQL is the query language for RDF/OWL
          that uses web technology (HTTP)
        • SPARQL endpoints make it possible to query
          databases using SPARQL
        • Distributed SPARQL will carve up the query and
          determine which endpoints need to be queried




2010-10-13            Bio2RDF: Convert, Provide And Reuse.   19
Provide
        • Publish the scheme you will use for your URIs so
          that other providers may use it

        • Provide access to documents with resolvable URIs
          (can be looked up using a web browser)
               http://geneprovider.com/gene:identifier
        • Now other data providers can use this identifer
          instead of copying the data into their own!

2010-10-13             Bio2RDF: Convert, Provide And Reuse.   20
Presentation Plan
        •    Bio2RDF
        •    Data conversion
        •    Data provision
        •    Reuse
        •    Future work




2010-10-13             Bio2RDF: Convert, Provide And Reuse.   21
Reuse
        • RDF version of your documents without
          resolvable external links is just another file
          format
        • One of the most problematic issues is that
          the RDF generated by some providers are
          only inward looking -> they don’t reuse
          published URIs (Polite URI)


2010-10-13            Bio2RDF: Convert, Provide And Reuse.   22
Reuse
        • Example with Uniprot RDF
        • Uniprot is one of the first data providers to offers
          stable and resolvable URI for its documents.
          However, we can’t use directly the RDF they
          provide. Look at this extract of Human HK1 in RDF
        <rdf:Description rdf:about="http://purl.uniprot.org/uniprot/P19367">
            <rdf:type rdf:resource="http://purl.uniprot.org/core/Protein" />
            <rdfs:seeAlso rdf:resource="http://purl.uniprot.org/refseq/NP_277035.2"/>
        </rdf:Description>

        • The problem is that http://purl.uniprot.org/refseq/NP_277035.2
          resolves to the NCBI HTML page of NP_277035.2
        • Since NCBI does not provide RDF, it’s a dead end


2010-10-13                      Bio2RDF: Convert, Provide And Reuse.                    23
Reuse
        • Uniprot did the same things we have done at
          Bio2RDF. They create a URI in their namespace.
          From that URI which they control, they decided to
          redirect to the original HTML document of the
          specified ID

        • The difference with Bio2RDF is that we also
          resolve the other URI to an RDF document


2010-10-13            Bio2RDF: Convert, Provide And Reuse.    24
Reuse
        • Relational database artifacts : copying of data
          from one provider to another
        • Uniprot Citation entry IN a protein document
        <rdf:Description rdf:about="http://purl.uniprot.org/citations/10686099">
        <rdf:type rdf:resource="http://purl.uniprot.org/core/Journal_Citation" />
        <title>Crystal structures of mutant monomeric hexokinase I reveal multiple
            ...</title>
        <author>Aleshin A.E.</author>
        <author>Kirby C.</author>
        <skos:exactMatch rdf:resource="http://purl.uniprot.org/medline/20223513" />
        <skos:exactMatch rdf:resource="http://purl.uniprot.org/pubmed/10686099" />
        </>

        • In a Linked Data world, you only need to have a
          link to the PubMed URI at NCBI. The up to date
          information is there.


2010-10-13                      Bio2RDF: Convert, Provide And Reuse.                  25
With all this, how is now the LOD for life
               sciences by original data providers ?




2010-10-13              Bio2RDF: Convert, Provide And Reuse.   26
What is still missing !




             “Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/”




2010-10-13                              Bio2RDF: Convert, Provide And Reuse.                                   27
Some handshake




2010-10-13    Bio2RDF: Convert, Provide And Reuse.   28
The new version of LOD for
             life sciences by original providers




2010-10-13          Bio2RDF: Convert, Provide And Reuse.   29
Something is missing !




             “Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/”




2010-10-13                              Bio2RDF: Convert, Provide And Reuse.                                   30
Bottom line
        • That is not quite what we had at the beginning
        • The original network made by third party provider still
          exist. Use it like if it was another provider.
        • What about documents without an RDF version?
             – From NAR, there is 1230 databases. Too much for Bio2RDF or
               any other to convert entirely.
             – Use third party providers, like Bio2RDF, NeuroCommons,
               LinkedLifeData, Shared Names, etc.
             – Add these in addition to original data URIs
             – Some databases, for various reason, might never create RDF
               version of their data. So third party will continue to have their use
               by providing their data in RDF

2010-10-13                    Bio2RDF: Convert, Provide And Reuse.                     31
By applying those conversion, publication and
         reuse rules to your data, we will witness the
         birth of a more stable network of linked data
          … and the death (in the very long term) of
                           Bio2RDF.

                Let's kill Bio2RDF together !!



2010-10-13           Bio2RDF: Convert, Provide And Reuse.   32
Presentation Plan
        •    Bio2RDF
        •    Data conversion
        •    Data provision
        •    Reuse
        •    Future work




2010-10-13             Bio2RDF: Convert, Provide And Reuse.   33
Future Work
        • Data processing workflow
        • New facet-based user interface to browse
          and formulate sophisticated queries
        • Full text indexing for autocompletion
          support
        • Exploring knowledge discovery
          possibilities in the linked data network

2010-10-13           Bio2RDF: Convert, Provide And Reuse.   34
Thanks
        • Bio2RDF community
             – Centre de recherche du CHUL
             – Dumontier Lab members
             – QUT eResearch Center
        • Triplestore provided by Openlink Virtuoso
        • François Belleau



2010-10-13             Bio2RDF: Convert, Provide And Reuse.   35
Acknowledgment
        • Marc-Alexandre Nolin funding provided by
          CANARIE via the C-BRASS project
        • Servers in Quebec City are provided by
          Jacques Corbeil of Laval University




2010-10-13          Bio2RDF: Convert, Provide And Reuse.   36
Contact Information
        •    Mailing list : bio2rdf@googlegroups.com
        •    URL : http://bio2rdf.org
        •    Wiki : http://sourceforge.net/apps/mediawiki/bio2rdf
        •    Blog : http://bio2rdf.blogspot.com




2010-10-13                 Bio2RDF: Convert, Provide And Reuse.     37

More Related Content

What's hot

MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013
MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013
MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 20133 Round Stones
 
The bX project: Federating and Mining Usage Logs from Linking Servers
The bX project: Federating and Mining Usage Logs from Linking ServersThe bX project: Federating and Mining Usage Logs from Linking Servers
The bX project: Federating and Mining Usage Logs from Linking ServersHerbert Van de Sompel
 
Using Architectures for Semantic Interoperability to Create Journal Clubs for...
Using Architectures for Semantic Interoperability to Create Journal Clubs for...Using Architectures for Semantic Interoperability to Create Journal Clubs for...
Using Architectures for Semantic Interoperability to Create Journal Clubs for...James Powell
 
Lecture linked data cloud & sparql
Lecture linked data cloud & sparqlLecture linked data cloud & sparql
Lecture linked data cloud & sparqlDhavalkumar Thakker
 
Dublin Core Registry to Support Multilinguality : Te Reo Māori Dublin Core Me...
Dublin Core Registry to Support Multilinguality : Te Reo Māori Dublin Core Me...Dublin Core Registry to Support Multilinguality : Te Reo Māori Dublin Core Me...
Dublin Core Registry to Support Multilinguality : Te Reo Māori Dublin Core Me...Karen R
 
Querying Linked Data on Android
Querying Linked Data on AndroidQuerying Linked Data on Android
Querying Linked Data on AndroidEUCLID project
 
Karen Coyle Keynote - R&D: Can Resource Description become Rigorous Data?
Karen Coyle Keynote - R&D: Can Resource Description become Rigorous Data?Karen Coyle Keynote - R&D: Can Resource Description become Rigorous Data?
Karen Coyle Keynote - R&D: Can Resource Description become Rigorous Data?eby
 
RDAP13 Jian Qin: Functional and Architectural Requirements for Metadata
RDAP13 Jian Qin: Functional and Architectural Requirements for MetadataRDAP13 Jian Qin: Functional and Architectural Requirements for Metadata
RDAP13 Jian Qin: Functional and Architectural Requirements for MetadataASIS&T
 
2012 02 aos-johanneskeizer
2012 02 aos-johanneskeizer2012 02 aos-johanneskeizer
2012 02 aos-johanneskeizerJohannes Keizer
 
EURISCO and GBIF, at the European genbank network meeting (Bonn, April 2004)
EURISCO and GBIF, at the European genbank network meeting (Bonn, April 2004)EURISCO and GBIF, at the European genbank network meeting (Bonn, April 2004)
EURISCO and GBIF, at the European genbank network meeting (Bonn, April 2004)Dag Endresen
 
Building Collections in IRs from External Data Sources
Building Collections in IRs from External Data SourcesBuilding Collections in IRs from External Data Sources
Building Collections in IRs from External Data SourcesSusan Matveyeva
 
The Semantic Web #5 - RDF (2)
The Semantic Web #5 - RDF (2)The Semantic Web #5 - RDF (2)
The Semantic Web #5 - RDF (2)Myungjin Lee
 
David Shotton - Research Integrity: Integrity of the published record
David Shotton - Research Integrity: Integrity of the published recordDavid Shotton - Research Integrity: Integrity of the published record
David Shotton - Research Integrity: Integrity of the published recordJisc
 
Type inference through the analysis of Wikipedia links
Type inference through the analysis of Wikipedia linksType inference through the analysis of Wikipedia links
Type inference through the analysis of Wikipedia linksAndrea Nuzzolese
 
The Semantic Web #4 - RDF (1)
The Semantic Web #4 - RDF (1)The Semantic Web #4 - RDF (1)
The Semantic Web #4 - RDF (1)Myungjin Lee
 
Database Integration toward Semantic Web: Development of Ontologies and RDF ...
Database Integration toward Semantic Web: Development of  Ontologies and RDF ...Database Integration toward Semantic Web: Development of  Ontologies and RDF ...
Database Integration toward Semantic Web: Development of Ontologies and RDF ...Database Center for Life Science
 

What's hot (19)

MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013
MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013
MIT CSAIL Linked Data Ventures Class: Linked Open Data for Entrepreneurs 2013
 
The bX project: Federating and Mining Usage Logs from Linking Servers
The bX project: Federating and Mining Usage Logs from Linking ServersThe bX project: Federating and Mining Usage Logs from Linking Servers
The bX project: Federating and Mining Usage Logs from Linking Servers
 
Using Architectures for Semantic Interoperability to Create Journal Clubs for...
Using Architectures for Semantic Interoperability to Create Journal Clubs for...Using Architectures for Semantic Interoperability to Create Journal Clubs for...
Using Architectures for Semantic Interoperability to Create Journal Clubs for...
 
NISO Forum, Denver, Sept. 24, 2012: Opening Keynote: The Many and the One: BC...
NISO Forum, Denver, Sept. 24, 2012: Opening Keynote: The Many and the One: BC...NISO Forum, Denver, Sept. 24, 2012: Opening Keynote: The Many and the One: BC...
NISO Forum, Denver, Sept. 24, 2012: Opening Keynote: The Many and the One: BC...
 
Lecture linked data cloud & sparql
Lecture linked data cloud & sparqlLecture linked data cloud & sparql
Lecture linked data cloud & sparql
 
Dublin Core Registry to Support Multilinguality : Te Reo Māori Dublin Core Me...
Dublin Core Registry to Support Multilinguality : Te Reo Māori Dublin Core Me...Dublin Core Registry to Support Multilinguality : Te Reo Māori Dublin Core Me...
Dublin Core Registry to Support Multilinguality : Te Reo Māori Dublin Core Me...
 
Gap Analysis
Gap AnalysisGap Analysis
Gap Analysis
 
Querying Linked Data on Android
Querying Linked Data on AndroidQuerying Linked Data on Android
Querying Linked Data on Android
 
Karen Coyle Keynote - R&D: Can Resource Description become Rigorous Data?
Karen Coyle Keynote - R&D: Can Resource Description become Rigorous Data?Karen Coyle Keynote - R&D: Can Resource Description become Rigorous Data?
Karen Coyle Keynote - R&D: Can Resource Description become Rigorous Data?
 
RDAP13 Jian Qin: Functional and Architectural Requirements for Metadata
RDAP13 Jian Qin: Functional and Architectural Requirements for MetadataRDAP13 Jian Qin: Functional and Architectural Requirements for Metadata
RDAP13 Jian Qin: Functional and Architectural Requirements for Metadata
 
2012 02 aos-johanneskeizer
2012 02 aos-johanneskeizer2012 02 aos-johanneskeizer
2012 02 aos-johanneskeizer
 
EURISCO and GBIF, at the European genbank network meeting (Bonn, April 2004)
EURISCO and GBIF, at the European genbank network meeting (Bonn, April 2004)EURISCO and GBIF, at the European genbank network meeting (Bonn, April 2004)
EURISCO and GBIF, at the European genbank network meeting (Bonn, April 2004)
 
Building Collections in IRs from External Data Sources
Building Collections in IRs from External Data SourcesBuilding Collections in IRs from External Data Sources
Building Collections in IRs from External Data Sources
 
The Semantic Web #5 - RDF (2)
The Semantic Web #5 - RDF (2)The Semantic Web #5 - RDF (2)
The Semantic Web #5 - RDF (2)
 
David Shotton - Research Integrity: Integrity of the published record
David Shotton - Research Integrity: Integrity of the published recordDavid Shotton - Research Integrity: Integrity of the published record
David Shotton - Research Integrity: Integrity of the published record
 
Type inference through the analysis of Wikipedia links
Type inference through the analysis of Wikipedia linksType inference through the analysis of Wikipedia links
Type inference through the analysis of Wikipedia links
 
Why open drug discovery needs four simple rules for licensing data and models
Why open drug discovery needs four simple rules for licensing data and modelsWhy open drug discovery needs four simple rules for licensing data and models
Why open drug discovery needs four simple rules for licensing data and models
 
The Semantic Web #4 - RDF (1)
The Semantic Web #4 - RDF (1)The Semantic Web #4 - RDF (1)
The Semantic Web #4 - RDF (1)
 
Database Integration toward Semantic Web: Development of Ontologies and RDF ...
Database Integration toward Semantic Web: Development of  Ontologies and RDF ...Database Integration toward Semantic Web: Development of  Ontologies and RDF ...
Database Integration toward Semantic Web: Development of Ontologies and RDF ...
 

Viewers also liked

AcornRedecom2010PhotoArchiveSession1Room2
AcornRedecom2010PhotoArchiveSession1Room2AcornRedecom2010PhotoArchiveSession1Room2
AcornRedecom2010PhotoArchiveSession1Room2MarcioAranha
 
AcornRedecom2010PhotoArchiveSession5Room2
AcornRedecom2010PhotoArchiveSession5Room2AcornRedecom2010PhotoArchiveSession5Room2
AcornRedecom2010PhotoArchiveSession5Room2MarcioAranha
 
Providing named entity based search with a common biological database naming ...
Providing named entity based search with a common biological database naming ...Providing named entity based search with a common biological database naming ...
Providing named entity based search with a common biological database naming ...nolmar01
 
Acorn-Redecom 2010 Photo Archive Keynote Speaker Mario Ripper
Acorn-Redecom 2010 Photo Archive Keynote Speaker Mario RipperAcorn-Redecom 2010 Photo Archive Keynote Speaker Mario Ripper
Acorn-Redecom 2010 Photo Archive Keynote Speaker Mario RipperMarcioAranha
 
Bio2 Rdf Presentation V3
Bio2 Rdf Presentation V3Bio2 Rdf Presentation V3
Bio2 Rdf Presentation V3nolmar01
 
SOC Business Opportunity Flipchart
SOC Business Opportunity FlipchartSOC Business Opportunity Flipchart
SOC Business Opportunity FlipchartJulieCollins
 
AcornRedecom2010PhotoArchiveSession2Auditorium
AcornRedecom2010PhotoArchiveSession2AuditoriumAcornRedecom2010PhotoArchiveSession2Auditorium
AcornRedecom2010PhotoArchiveSession2AuditoriumMarcioAranha
 
AcornRedecom2010PhotoArchiveSession6Room1
AcornRedecom2010PhotoArchiveSession6Room1AcornRedecom2010PhotoArchiveSession6Room1
AcornRedecom2010PhotoArchiveSession6Room1MarcioAranha
 
AcornRedecom2010PhotoArchiveCoffeeBreak1
AcornRedecom2010PhotoArchiveCoffeeBreak1AcornRedecom2010PhotoArchiveCoffeeBreak1
AcornRedecom2010PhotoArchiveCoffeeBreak1MarcioAranha
 
AcornRedecom2010PhotoArchiveSession5Room1
AcornRedecom2010PhotoArchiveSession5Room1AcornRedecom2010PhotoArchiveSession5Room1
AcornRedecom2010PhotoArchiveSession5Room1MarcioAranha
 
Insurance Lead Services Presentation
Insurance Lead Services PresentationInsurance Lead Services Presentation
Insurance Lead Services PresentationDEG Consulting
 
Fate of Nanoparticles
Fate of NanoparticlesFate of Nanoparticles
Fate of Nanoparticlesslacerda
 
Animal Reproduction
Animal ReproductionAnimal Reproduction
Animal Reproductionazuerseccion
 

Viewers also liked (15)

AcornRedecom2010PhotoArchiveSession1Room2
AcornRedecom2010PhotoArchiveSession1Room2AcornRedecom2010PhotoArchiveSession1Room2
AcornRedecom2010PhotoArchiveSession1Room2
 
AcornRedecom2010PhotoArchiveSession5Room2
AcornRedecom2010PhotoArchiveSession5Room2AcornRedecom2010PhotoArchiveSession5Room2
AcornRedecom2010PhotoArchiveSession5Room2
 
Providing named entity based search with a common biological database naming ...
Providing named entity based search with a common biological database naming ...Providing named entity based search with a common biological database naming ...
Providing named entity based search with a common biological database naming ...
 
Acorn-Redecom 2010 Photo Archive Keynote Speaker Mario Ripper
Acorn-Redecom 2010 Photo Archive Keynote Speaker Mario RipperAcorn-Redecom 2010 Photo Archive Keynote Speaker Mario Ripper
Acorn-Redecom 2010 Photo Archive Keynote Speaker Mario Ripper
 
Bio2 Rdf Presentation V3
Bio2 Rdf Presentation V3Bio2 Rdf Presentation V3
Bio2 Rdf Presentation V3
 
SOC Business Opportunity Flipchart
SOC Business Opportunity FlipchartSOC Business Opportunity Flipchart
SOC Business Opportunity Flipchart
 
AcornRedecom2010PhotoArchiveSession2Auditorium
AcornRedecom2010PhotoArchiveSession2AuditoriumAcornRedecom2010PhotoArchiveSession2Auditorium
AcornRedecom2010PhotoArchiveSession2Auditorium
 
AcornRedecom2010PhotoArchiveSession6Room1
AcornRedecom2010PhotoArchiveSession6Room1AcornRedecom2010PhotoArchiveSession6Room1
AcornRedecom2010PhotoArchiveSession6Room1
 
AcornRedecom2010PhotoArchiveCoffeeBreak1
AcornRedecom2010PhotoArchiveCoffeeBreak1AcornRedecom2010PhotoArchiveCoffeeBreak1
AcornRedecom2010PhotoArchiveCoffeeBreak1
 
AcornRedecom2010PhotoArchiveSession5Room1
AcornRedecom2010PhotoArchiveSession5Room1AcornRedecom2010PhotoArchiveSession5Room1
AcornRedecom2010PhotoArchiveSession5Room1
 
Springpractice
SpringpracticeSpringpractice
Springpractice
 
Insurance Lead Services Presentation
Insurance Lead Services PresentationInsurance Lead Services Presentation
Insurance Lead Services Presentation
 
Jeopardy Usa
Jeopardy UsaJeopardy Usa
Jeopardy Usa
 
Fate of Nanoparticles
Fate of NanoparticlesFate of Nanoparticles
Fate of Nanoparticles
 
Animal Reproduction
Animal ReproductionAnimal Reproduction
Animal Reproduction
 

Similar to W4 4 marc-alexandre-nolin-v2

Charper.lawdi.20120601
Charper.lawdi.20120601Charper.lawdi.20120601
Charper.lawdi.20120601charper
 
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...Anita de Waard
 
Wed batsakis tut_challenges of preservations
Wed batsakis tut_challenges of preservationsWed batsakis tut_challenges of preservations
Wed batsakis tut_challenges of preservationseswcsummerschool
 
Wed batsakis tut_chalasdlenges of preservations
Wed batsakis tut_chalasdlenges of preservationsWed batsakis tut_chalasdlenges of preservations
Wed batsakis tut_chalasdlenges of preservationseswcsummerschool
 
‘Facilitating User Engagement by Enriching Library Data using Semantic Techno...
‘Facilitating User Engagement by Enriching Library Data using Semantic Techno...‘Facilitating User Engagement by Enriching Library Data using Semantic Techno...
‘Facilitating User Engagement by Enriching Library Data using Semantic Techno...CONUL Conference
 
What is New in W3C land?
What is New in W3C land?What is New in W3C land?
What is New in W3C land?Ivan Herman
 
ISWC GoodRelations Tutorial Part 2
ISWC GoodRelations Tutorial Part 2ISWC GoodRelations Tutorial Part 2
ISWC GoodRelations Tutorial Part 2Martin Hepp
 
GoodRelations Tutorial Part 2
GoodRelations Tutorial Part 2GoodRelations Tutorial Part 2
GoodRelations Tutorial Part 2guestecacad2
 
Creating a sustainable business model for a digital repository: the Dryad exp...
Creating a sustainable business model for a digital repository: the Dryad exp...Creating a sustainable business model for a digital repository: the Dryad exp...
Creating a sustainable business model for a digital repository: the Dryad exp...ASIS&T
 
OCLC Linked Data Roundtable event IFLA 2012
OCLC Linked Data Roundtable event IFLA 2012OCLC Linked Data Roundtable event IFLA 2012
OCLC Linked Data Roundtable event IFLA 2012nw13
 
The web of interlinked data and knowledge stripped
The web of interlinked data and knowledge strippedThe web of interlinked data and knowledge stripped
The web of interlinked data and knowledge strippedSören Auer
 
Vila LOD-innovacion- bib-semweb-redux
Vila LOD-innovacion- bib-semweb-reduxVila LOD-innovacion- bib-semweb-redux
Vila LOD-innovacion- bib-semweb-reduxLIS EPI Meeting
 
Usage of Linked Data: Introduction and Application Scenarios
Usage of Linked Data: Introduction and Application ScenariosUsage of Linked Data: Introduction and Application Scenarios
Usage of Linked Data: Introduction and Application ScenariosEUCLID project
 

Similar to W4 4 marc-alexandre-nolin-v2 (20)

Charper.lawdi.20120601
Charper.lawdi.20120601Charper.lawdi.20120601
Charper.lawdi.20120601
 
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
 
Wed batsakis tut_challenges of preservations
Wed batsakis tut_challenges of preservationsWed batsakis tut_challenges of preservations
Wed batsakis tut_challenges of preservations
 
Wed batsakis tut_chalasdlenges of preservations
Wed batsakis tut_chalasdlenges of preservationsWed batsakis tut_chalasdlenges of preservations
Wed batsakis tut_chalasdlenges of preservations
 
‘Facilitating User Engagement by Enriching Library Data using Semantic Techno...
‘Facilitating User Engagement by Enriching Library Data using Semantic Techno...‘Facilitating User Engagement by Enriching Library Data using Semantic Techno...
‘Facilitating User Engagement by Enriching Library Data using Semantic Techno...
 
Linked Data Basics
Linked Data BasicsLinked Data Basics
Linked Data Basics
 
What is New in W3C land?
What is New in W3C land?What is New in W3C land?
What is New in W3C land?
 
ISWC GoodRelations Tutorial Part 2
ISWC GoodRelations Tutorial Part 2ISWC GoodRelations Tutorial Part 2
ISWC GoodRelations Tutorial Part 2
 
GoodRelations Tutorial Part 2
GoodRelations Tutorial Part 2GoodRelations Tutorial Part 2
GoodRelations Tutorial Part 2
 
NISO/DCMI September 25 Webinar: Implementing Linked Data in Developing Countr...
NISO/DCMI September 25 Webinar: Implementing Linked Data in Developing Countr...NISO/DCMI September 25 Webinar: Implementing Linked Data in Developing Countr...
NISO/DCMI September 25 Webinar: Implementing Linked Data in Developing Countr...
 
Creating a sustainable business model for a digital repository: the Dryad exp...
Creating a sustainable business model for a digital repository: the Dryad exp...Creating a sustainable business model for a digital repository: the Dryad exp...
Creating a sustainable business model for a digital repository: the Dryad exp...
 
OCLC Linked Data Roundtable event IFLA 2012
OCLC Linked Data Roundtable event IFLA 2012OCLC Linked Data Roundtable event IFLA 2012
OCLC Linked Data Roundtable event IFLA 2012
 
LOD2 Webinar Series: SILK
LOD2 Webinar Series: SILKLOD2 Webinar Series: SILK
LOD2 Webinar Series: SILK
 
Bio2RDF @ W3C HCLS2009
Bio2RDF @ W3C HCLS2009Bio2RDF @ W3C HCLS2009
Bio2RDF @ W3C HCLS2009
 
Sparling and Cohen "BIBFRAME Implementation at the University of Alberta Libr...
Sparling and Cohen "BIBFRAME Implementation at the University of Alberta Libr...Sparling and Cohen "BIBFRAME Implementation at the University of Alberta Libr...
Sparling and Cohen "BIBFRAME Implementation at the University of Alberta Libr...
 
Converting GHO to RDF
Converting GHO to RDFConverting GHO to RDF
Converting GHO to RDF
 
The web of interlinked data and knowledge stripped
The web of interlinked data and knowledge strippedThe web of interlinked data and knowledge stripped
The web of interlinked data and knowledge stripped
 
Vila LOD-innovacion- bib-semweb-redux
Vila LOD-innovacion- bib-semweb-reduxVila LOD-innovacion- bib-semweb-redux
Vila LOD-innovacion- bib-semweb-redux
 
Usage of Linked Data: Introduction and Application Scenarios
Usage of Linked Data: Introduction and Application ScenariosUsage of Linked Data: Introduction and Application Scenarios
Usage of Linked Data: Introduction and Application Scenarios
 
LOD2 Webinar Series: D2R and Sparqlify
LOD2 Webinar Series: D2R and SparqlifyLOD2 Webinar Series: D2R and Sparqlify
LOD2 Webinar Series: D2R and Sparqlify
 

Recently uploaded

Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxOH TEIK BIN
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
Painted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of IndiaPainted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of IndiaVirag Sontakke
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionSafetyChain Software
 
Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsKarinaGenton
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdfssuser54595a
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Celine George
 
Final demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxFinal demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxAvyJaneVismanos
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxthorishapillay1
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application ) Sakshi Ghasle
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxHistory Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxsocialsciencegdgrohi
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting DataJhengPantaleon
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfSumit Tiwari
 

Recently uploaded (20)

Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptx
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
Painted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of IndiaPainted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of India
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory Inspection
 
Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its Characteristics
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
 
Final demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxFinal demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptx
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptx
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application )
 
9953330565 Low Rate Call Girls In Rohini Delhi NCR
9953330565 Low Rate Call Girls In Rohini  Delhi NCR9953330565 Low Rate Call Girls In Rohini  Delhi NCR
9953330565 Low Rate Call Girls In Rohini Delhi NCR
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxHistory Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
 

W4 4 marc-alexandre-nolin-v2

  • 1. Bio2RDF: Convert, Provide And Reuse. Marc-Alexandre Nolin1, Jacques Corbeil1, Luc Lamontagne1, Michel Dumontier1,2 1 Laval University, Canada 2 Carleton University, Canada manolin@gmail.com
  • 2. Presentation Plan • Bio2RDF – Methodology • Data conversion • Data provision • Reuse • Future work 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 2
  • 3. Bio2RDF • Bio2RDF uses open-source Semantic Web technologies to provide interlinked life science data to support biological knowledge discovery. • Over 40 databases converted • Over 30 billion triples • Global mirroring – Quebec City, Quebec, Canada – Ottawa, Ontario, Canada – Guelph, Ontario, Canada – Brisbane, Australia – part of LOD mashup at http://lod.openlinksw.com 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 3
  • 4. 2010 Linked Open Data Cloud “Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/” 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 4
  • 5. Bio2RDF is the major contributor to the Life Sciences LOD “Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/” 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 5
  • 6. What is Linked Open Data? • Linked open data is • data that is free to use • machine understandable (uses RDF/OWL) • can be looked up using web protocols • has meaningful relations between data items (generated from supplied cross-references, or text-based mappings) 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 6
  • 7. LOD Methodology Applying Tim Berners-Lee 4 rules 1. Use URIs as names for things 2. Use HTTP URIs so that people can look up those names 3. When someone looks up a URI, provide useful information, using the standards (RDF*, SPARQL) 4. Include links to other URIs, so that they can discover more things – http://www.w3.org/DesignIssues/LinkedData 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 7
  • 8. Bio2RDF Methodology We extend LOD rules by: 1. Consistently naming all resources http://bio2rdf.org/namespace:identifier 2. Resolving Bio2RDF URIs to a set of statements about the requested resource 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 8
  • 9. Data providers have been linking data for years • Links are done from one HTML page to another • This works for human consumption, but doesn’t scale with huge amounts of data 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 9
  • 10. However, most linked open data is created and provided without the help of the original data provider
  • 11. Why be part of the linked data cloud? • Enable queries that span over more than 1 database. – Example: filtering a PubMed search by a microarray level of expression filter • Reduce the size of a database by only referencing data instead of including it in a database record (e.g. citations) 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 11
  • 12. But something is missing ! derived from Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod- cloud.net/” 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 12
  • 13. Presentation Plan • Bio2RDF • Data conversion • Data provision • Reuse • Future work 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 13
  • 14. Convert • Take a step back and think about what your data represents. • Forget about the way it is currently represented in your relational database 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 14
  • 15. What is RDFizing? • RDFizing converts legacy data into one or more RDF statements or triples: <Subject> <Predicate> <Object> • Triples have correspondence for a standard database – Subject  row identifier – Predicate  column name – Object  value 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 15
  • 16. Convert • Converting to RDF just for the sake of providing another format will not add much to your offering if the result is your old relational database format … in RDF • Get rid of relational database artifacts while rdfizing 1. Uses simple and stable identifiers to name resources 2. Create types for the entities that your data refers to with in your data and specify the nature of the relations that hold between them 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 16
  • 17. Presentation Plan • Bio2RDF • Data conversion • Data provision • Reuse • Future work 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 17
  • 18. Provide – RDF Documents • URIs can be created with a REST-like look • Example of stable URIs http://purl.uniprot.org/uniprot/P19367 http://bio2rdf.org/uniprot:P19367 • Documents containing statements should be accessible using web technology (HTTP protocol) • Provide data dumps • But in the LOD world, we want to query online databases! 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 18
  • 19. Provide - SPARQL • SPARQL is the query language for RDF/OWL that uses web technology (HTTP) • SPARQL endpoints make it possible to query databases using SPARQL • Distributed SPARQL will carve up the query and determine which endpoints need to be queried 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 19
  • 20. Provide • Publish the scheme you will use for your URIs so that other providers may use it • Provide access to documents with resolvable URIs (can be looked up using a web browser) http://geneprovider.com/gene:identifier • Now other data providers can use this identifer instead of copying the data into their own! 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 20
  • 21. Presentation Plan • Bio2RDF • Data conversion • Data provision • Reuse • Future work 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 21
  • 22. Reuse • RDF version of your documents without resolvable external links is just another file format • One of the most problematic issues is that the RDF generated by some providers are only inward looking -> they don’t reuse published URIs (Polite URI) 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 22
  • 23. Reuse • Example with Uniprot RDF • Uniprot is one of the first data providers to offers stable and resolvable URI for its documents. However, we can’t use directly the RDF they provide. Look at this extract of Human HK1 in RDF <rdf:Description rdf:about="http://purl.uniprot.org/uniprot/P19367"> <rdf:type rdf:resource="http://purl.uniprot.org/core/Protein" /> <rdfs:seeAlso rdf:resource="http://purl.uniprot.org/refseq/NP_277035.2"/> </rdf:Description> • The problem is that http://purl.uniprot.org/refseq/NP_277035.2 resolves to the NCBI HTML page of NP_277035.2 • Since NCBI does not provide RDF, it’s a dead end 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 23
  • 24. Reuse • Uniprot did the same things we have done at Bio2RDF. They create a URI in their namespace. From that URI which they control, they decided to redirect to the original HTML document of the specified ID • The difference with Bio2RDF is that we also resolve the other URI to an RDF document 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 24
  • 25. Reuse • Relational database artifacts : copying of data from one provider to another • Uniprot Citation entry IN a protein document <rdf:Description rdf:about="http://purl.uniprot.org/citations/10686099"> <rdf:type rdf:resource="http://purl.uniprot.org/core/Journal_Citation" /> <title>Crystal structures of mutant monomeric hexokinase I reveal multiple ...</title> <author>Aleshin A.E.</author> <author>Kirby C.</author> <skos:exactMatch rdf:resource="http://purl.uniprot.org/medline/20223513" /> <skos:exactMatch rdf:resource="http://purl.uniprot.org/pubmed/10686099" /> </> • In a Linked Data world, you only need to have a link to the PubMed URI at NCBI. The up to date information is there. 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 25
  • 26. With all this, how is now the LOD for life sciences by original data providers ? 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 26
  • 27. What is still missing ! “Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/” 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 27
  • 28. Some handshake 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 28
  • 29. The new version of LOD for life sciences by original providers 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 29
  • 30. Something is missing ! “Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/” 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 30
  • 31. Bottom line • That is not quite what we had at the beginning • The original network made by third party provider still exist. Use it like if it was another provider. • What about documents without an RDF version? – From NAR, there is 1230 databases. Too much for Bio2RDF or any other to convert entirely. – Use third party providers, like Bio2RDF, NeuroCommons, LinkedLifeData, Shared Names, etc. – Add these in addition to original data URIs – Some databases, for various reason, might never create RDF version of their data. So third party will continue to have their use by providing their data in RDF 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 31
  • 32. By applying those conversion, publication and reuse rules to your data, we will witness the birth of a more stable network of linked data … and the death (in the very long term) of Bio2RDF. Let's kill Bio2RDF together !! 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 32
  • 33. Presentation Plan • Bio2RDF • Data conversion • Data provision • Reuse • Future work 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 33
  • 34. Future Work • Data processing workflow • New facet-based user interface to browse and formulate sophisticated queries • Full text indexing for autocompletion support • Exploring knowledge discovery possibilities in the linked data network 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 34
  • 35. Thanks • Bio2RDF community – Centre de recherche du CHUL – Dumontier Lab members – QUT eResearch Center • Triplestore provided by Openlink Virtuoso • François Belleau 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 35
  • 36. Acknowledgment • Marc-Alexandre Nolin funding provided by CANARIE via the C-BRASS project • Servers in Quebec City are provided by Jacques Corbeil of Laval University 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 36
  • 37. Contact Information • Mailing list : bio2rdf@googlegroups.com • URL : http://bio2rdf.org • Wiki : http://sourceforge.net/apps/mediawiki/bio2rdf • Blog : http://bio2rdf.blogspot.com 2010-10-13 Bio2RDF: Convert, Provide And Reuse. 37