SlideShare a Scribd company logo
1 of 11
Download to read offline
Collaborative development of cross-
    database Bio2RDF queries


                Peter Ansell
Microsoft Queensland University of Technology
              eResearch Centre

             p.ansell@qut.edu.au
Introduction
●   Large number of cross-disciplinary datasources in
    different locations
●   Scientists require simplified access to many of the
    datasources for complex research
●   Recently a large number of datasources have been
    published using the RDF syntax
●   Now, we need to learn how to query across them and
    be able to share that knowledge with others

     Sydney, Australia   3rd eResearch Australasia Conference   9-13 Nov 2009

                                                                                2
Sydney, Australia   3rd eResearch Australasia Conference   9-13 Nov 2009

                                                                           3
Linked Data
1) Use URIs as names for things
2) Use HTTP URIs so that people can look up those
names.
3) When someone looks up a URI, provide useful
information, using the standards (RDF, SPARQL)
4) Include links to other URIs. so that they can
discover more things.
http://www.w3.org/DesignIssues/LinkedData.html

    Sydney, Australia   3rd eResearch Australasia Conference   9-13 Nov 2009

                                                                               4
Linked Data querying
●   Strategy 1 (Naive) :
           –   Retrieve resources
           –   Retrieve linked resources
           –   Cache the resources and perform queries locally
●   Strategy 2 (Search engine):
           –   Retrieve resources
           –   Retrieve directly linked resources
           –   Query a semantic search engine for related resources
           –   Cache the resources and perform queries locally
     Sydney, Australia        3rd eResearch Australasia Conference   9-13 Nov 2009

                                                                                     5
Linked Data querying
●   Strategy 3 (Distributed query) :
           –   Mix SPARQL endpoint queries with URI based
                resolution to avoid having a large local cache
           –   Normalise results from each site to form final query
                result
           –   Users can process the results, and perform one or more
                queries based on their interpretation of the results




     Sydney, Australia        3rd eResearch Australasia Conference   9-13 Nov 2009

                                                                                     6
Bio2RDF distributed queries
●   Assign Namespaces to providers
●   Query across relevant providers given a users query
●   Aggregate all results into a single RDF document
    and return to the user
●   It works: 700000 queries during the last month
●   Largest dataset has 10 billion triples, the Protein
    Databank, with others making up about 5 billion
    triples

     Sydney, Australia   3rd eResearch Australasia Conference   9-13 Nov 2009

                                                                                7
Workflow
                    Resolved URI: http://bio2rdf.org/label/go:0000345



        Host name: http://bio2rdf.org/              Query: label/go:0000345



                        Regular expression: label/([w-]+):(.+)



        http://bio2rdf.org/query:labelsearch

                                    http://bio2rdf.org/query:labelsearchforgo




Sydney, Australia              3rd eResearch Australasia Conference       9-13 Nov 2009

                                                                                          8
Collaboration
●   Query and provider definitions have an RDF
    representation
●   Any other person is able to take a definition and
    change it to suit their needs and redistribute their
    definition
●   If definitions are Linked Data themselves, as the
    Bio2RDF configuration item are, the HTTP URI can
    be used by others to pull the definition into their
    own software
     Sydney, Australia    3rd eResearch Australasia Conference   9-13 Nov 2009

                                                                                 9
Provenance for queries and data
●   Provenance can be attached to each item, including
    details such as OpenID URI's and dates
●   The sources and queries that were used for a
    particular URI can be found by utilising the query
    plan option
           –   http://bio2rdf.org/queryplan/label/go:0000345
           –   Enables automatic query collaboration, as you could
                take this query plan and add or modify queries
                inside the query plan

     Sydney, Australia      3rd eResearch Australasia Conference   9-13 Nov 2009

                                                                                   10
Conclusion
●   Many large distributed datasources
●   Single interface, RDF
●   Distribute queries efficiently across the endpoints
●   Enabling people to create the definitions of what
    they did so other people can collaborate, via a single
    server or copy and paste



     Sydney, Australia   3rd eResearch Australasia Conference   9-13 Nov 2009

                                                                                11

More Related Content

What's hot

The expanding dataverse
The expanding dataverseThe expanding dataverse
The expanding dataverseMerce Crosas
 
Using Neo4j for exploring the research graph connections made by RD-Switchboard
Using Neo4j for exploring the research graph connections made by RD-SwitchboardUsing Neo4j for exploring the research graph connections made by RD-Switchboard
Using Neo4j for exploring the research graph connections made by RD-Switchboardamiraryani
 
鏈結資料在圖書館的應用20131107
鏈結資料在圖書館的應用20131107鏈結資料在圖書館的應用20131107
鏈結資料在圖書館的應用20131107皓仁 柯
 
TDWG and GBIF, at European genbank network meeting (Bonn, April 2004)
TDWG and GBIF, at European genbank network meeting (Bonn, April 2004)TDWG and GBIF, at European genbank network meeting (Bonn, April 2004)
TDWG and GBIF, at European genbank network meeting (Bonn, April 2004)Dag Endresen
 
Data Citation in The Dataverse Network
Data Citation in The Dataverse NetworkData Citation in The Dataverse Network
Data Citation in The Dataverse NetworkMicah Altman
 
Standardization and integration of molecular biology information with DAS
Standardization and integration of molecular biology information with DASStandardization and integration of molecular biology information with DAS
Standardization and integration of molecular biology information with DASRafael C. Jimenez
 
GBIF BIFA mentoring, Day 5a Data management, July 2016
GBIF BIFA mentoring, Day 5a Data management, July 2016GBIF BIFA mentoring, Day 5a Data management, July 2016
GBIF BIFA mentoring, Day 5a Data management, July 2016Dag Endresen
 
EURISCO and GBIF IPT, at the Vavilov Institute in St Petersburg (27 April 2010)
EURISCO and GBIF IPT, at the Vavilov Institute in St Petersburg (27 April 2010)EURISCO and GBIF IPT, at the Vavilov Institute in St Petersburg (27 April 2010)
EURISCO and GBIF IPT, at the Vavilov Institute in St Petersburg (27 April 2010)Dag Endresen
 
DataCite – Bridging the gap and helping to find, access and reuse data – Herb...
DataCite – Bridging the gap and helping to find, access and reuse data – Herb...DataCite – Bridging the gap and helping to find, access and reuse data – Herb...
DataCite – Bridging the gap and helping to find, access and reuse data – Herb...OpenAIRE
 
Data exchange alternatives, GIGA TAG (2009)
Data exchange alternatives, GIGA TAG (2009)Data exchange alternatives, GIGA TAG (2009)
Data exchange alternatives, GIGA TAG (2009)Dag Endresen
 
FAIR Software (and Data) Citation: Europe, Research Object Systems, Networks ...
FAIR Software (and Data) Citation: Europe, Research Object Systems, Networks ...FAIR Software (and Data) Citation: Europe, Research Object Systems, Networks ...
FAIR Software (and Data) Citation: Europe, Research Object Systems, Networks ...Carole Goble
 
Dataset Citation and Identification
Dataset Citation and IdentificationDataset Citation and Identification
Dataset Citation and Identificationguest453b14
 
re3data.org – a Registry of Research Data Repositories
re3data.org – a Registry of Research Data Repositoriesre3data.org – a Registry of Research Data Repositories
re3data.org – a Registry of Research Data RepositoriesHeinz Pampel
 
Global Biodiversity Information Facility - 2013
Global Biodiversity Information Facility - 2013Global Biodiversity Information Facility - 2013
Global Biodiversity Information Facility - 2013Dag Endresen
 
Web service technologies, at CGIAR ICT-KM workshop in Rome (2005)
Web service technologies, at CGIAR ICT-KM workshop in Rome (2005)Web service technologies, at CGIAR ICT-KM workshop in Rome (2005)
Web service technologies, at CGIAR ICT-KM workshop in Rome (2005)Dag Endresen
 

What's hot (20)

The expanding dataverse
The expanding dataverseThe expanding dataverse
The expanding dataverse
 
Using Neo4j for exploring the research graph connections made by RD-Switchboard
Using Neo4j for exploring the research graph connections made by RD-SwitchboardUsing Neo4j for exploring the research graph connections made by RD-Switchboard
Using Neo4j for exploring the research graph connections made by RD-Switchboard
 
鏈結資料在圖書館的應用20131107
鏈結資料在圖書館的應用20131107鏈結資料在圖書館的應用20131107
鏈結資料在圖書館的應用20131107
 
TDWG and GBIF, at European genbank network meeting (Bonn, April 2004)
TDWG and GBIF, at European genbank network meeting (Bonn, April 2004)TDWG and GBIF, at European genbank network meeting (Bonn, April 2004)
TDWG and GBIF, at European genbank network meeting (Bonn, April 2004)
 
Data Citation in The Dataverse Network
Data Citation in The Dataverse NetworkData Citation in The Dataverse Network
Data Citation in The Dataverse Network
 
Scholze liber 2015-06-25_final
Scholze liber 2015-06-25_finalScholze liber 2015-06-25_final
Scholze liber 2015-06-25_final
 
Standardization and integration of molecular biology information with DAS
Standardization and integration of molecular biology information with DASStandardization and integration of molecular biology information with DAS
Standardization and integration of molecular biology information with DAS
 
GBIF BIFA mentoring, Day 5a Data management, July 2016
GBIF BIFA mentoring, Day 5a Data management, July 2016GBIF BIFA mentoring, Day 5a Data management, July 2016
GBIF BIFA mentoring, Day 5a Data management, July 2016
 
EURISCO and GBIF IPT, at the Vavilov Institute in St Petersburg (27 April 2010)
EURISCO and GBIF IPT, at the Vavilov Institute in St Petersburg (27 April 2010)EURISCO and GBIF IPT, at the Vavilov Institute in St Petersburg (27 April 2010)
EURISCO and GBIF IPT, at the Vavilov Institute in St Petersburg (27 April 2010)
 
Scholze goportis 4-11-14
Scholze goportis 4-11-14Scholze goportis 4-11-14
Scholze goportis 4-11-14
 
DataCite – Bridging the gap and helping to find, access and reuse data – Herb...
DataCite – Bridging the gap and helping to find, access and reuse data – Herb...DataCite – Bridging the gap and helping to find, access and reuse data – Herb...
DataCite – Bridging the gap and helping to find, access and reuse data – Herb...
 
Data exchange alternatives, GIGA TAG (2009)
Data exchange alternatives, GIGA TAG (2009)Data exchange alternatives, GIGA TAG (2009)
Data exchange alternatives, GIGA TAG (2009)
 
FAIR Software (and Data) Citation: Europe, Research Object Systems, Networks ...
FAIR Software (and Data) Citation: Europe, Research Object Systems, Networks ...FAIR Software (and Data) Citation: Europe, Research Object Systems, Networks ...
FAIR Software (and Data) Citation: Europe, Research Object Systems, Networks ...
 
20151102koyama
20151102koyama20151102koyama
20151102koyama
 
Dataset Citation and Identification
Dataset Citation and IdentificationDataset Citation and Identification
Dataset Citation and Identification
 
re3data.org – a Registry of Research Data Repositories
re3data.org – a Registry of Research Data Repositoriesre3data.org – a Registry of Research Data Repositories
re3data.org – a Registry of Research Data Repositories
 
Opendata repository-v2
Opendata repository-v2Opendata repository-v2
Opendata repository-v2
 
Global Biodiversity Information Facility - 2013
Global Biodiversity Information Facility - 2013Global Biodiversity Information Facility - 2013
Global Biodiversity Information Facility - 2013
 
Elab 16 5-13-re3data-scholze-final
Elab 16 5-13-re3data-scholze-finalElab 16 5-13-re3data-scholze-final
Elab 16 5-13-re3data-scholze-final
 
Web service technologies, at CGIAR ICT-KM workshop in Rome (2005)
Web service technologies, at CGIAR ICT-KM workshop in Rome (2005)Web service technologies, at CGIAR ICT-KM workshop in Rome (2005)
Web service technologies, at CGIAR ICT-KM workshop in Rome (2005)
 

Viewers also liked

HIKM2010 - Query Resolution for Biology and Medicine
HIKM2010 - Query Resolution for Biology and MedicineHIKM2010 - Query Resolution for Biology and Medicine
HIKM2010 - Query Resolution for Biology and MedicinePeter Ansell
 
Bio2RDF Distributed Querying model
Bio2RDF Distributed Querying modelBio2RDF Distributed Querying model
Bio2RDF Distributed Querying modelPeter Ansell
 
Lynes Presentation 1
Lynes Presentation 1Lynes Presentation 1
Lynes Presentation 1Aama Projects
 
Mathematics Of Life
Mathematics Of LifeMathematics Of Life
Mathematics Of Lifetamil nenjam
 
students using statistics
students using statisticsstudents using statistics
students using statisticspetsat
 
32 Ways a Digital Marketing Consultant Can Help Grow Your Business
32 Ways a Digital Marketing Consultant Can Help Grow Your Business32 Ways a Digital Marketing Consultant Can Help Grow Your Business
32 Ways a Digital Marketing Consultant Can Help Grow Your BusinessBarry Feldman
 

Viewers also liked (8)

HIKM2010 - Query Resolution for Biology and Medicine
HIKM2010 - Query Resolution for Biology and MedicineHIKM2010 - Query Resolution for Biology and Medicine
HIKM2010 - Query Resolution for Biology and Medicine
 
12 Jyotirlingas
12 Jyotirlingas12 Jyotirlingas
12 Jyotirlingas
 
Take The Time
Take The TimeTake The Time
Take The Time
 
Bio2RDF Distributed Querying model
Bio2RDF Distributed Querying modelBio2RDF Distributed Querying model
Bio2RDF Distributed Querying model
 
Lynes Presentation 1
Lynes Presentation 1Lynes Presentation 1
Lynes Presentation 1
 
Mathematics Of Life
Mathematics Of LifeMathematics Of Life
Mathematics Of Life
 
students using statistics
students using statisticsstudents using statistics
students using statistics
 
32 Ways a Digital Marketing Consultant Can Help Grow Your Business
32 Ways a Digital Marketing Consultant Can Help Grow Your Business32 Ways a Digital Marketing Consultant Can Help Grow Your Business
32 Ways a Digital Marketing Consultant Can Help Grow Your Business
 

Similar to Customisable cross-database Bio2RDF queries

W4 4 marc-alexandre-nolin-v2
W4 4 marc-alexandre-nolin-v2W4 4 marc-alexandre-nolin-v2
W4 4 marc-alexandre-nolin-v2nolmar01
 
Linking Open Government Data at Scale
Linking Open Government Data at Scale Linking Open Government Data at Scale
Linking Open Government Data at Scale Bernadette Hyland-Wood
 
Making Data Dynamic: Views from UC3, CDL
Making Data Dynamic: Views from UC3, CDLMaking Data Dynamic: Views from UC3, CDL
Making Data Dynamic: Views from UC3, CDLCarly Strasser
 
The Heterogenous Zone: Six use cases for six research data collections in Edi...
The Heterogenous Zone: Six use cases for six research data collections in Edi...The Heterogenous Zone: Six use cases for six research data collections in Edi...
The Heterogenous Zone: Six use cases for six research data collections in Edi...EDINA, University of Edinburgh
 
Data Access & Storage @ UWA - UWA Research Week September 2017
Data Access & Storage @ UWA - UWA Research Week September 2017Data Access & Storage @ UWA - UWA Research Week September 2017
Data Access & Storage @ UWA - UWA Research Week September 2017Katina Toufexis
 
Building a linked data based content discovery service for the RTÉ Archives
Building a linked data based content discovery service for the RTÉ ArchivesBuilding a linked data based content discovery service for the RTÉ Archives
Building a linked data based content discovery service for the RTÉ ArchivesMediaMixerCommunity
 
Exploring the Semantic Web
Exploring the Semantic WebExploring the Semantic Web
Exploring the Semantic WebRoberto García
 
Sandra Collins - Building a linked data based content discovery service for t...
Sandra Collins - Building a linked data based content discovery service for t...Sandra Collins - Building a linked data based content discovery service for t...
Sandra Collins - Building a linked data based content discovery service for t...dri_ireland
 
Researcher Identifiers and National Federated Search Portal for Japanese Inst...
Researcher Identifiers and National Federated Search Portal for Japanese Inst...Researcher Identifiers and National Federated Search Portal for Japanese Inst...
Researcher Identifiers and National Federated Search Portal for Japanese Inst...National Institute of Informatics
 
Metadata for Research Objects
Metadata for Research ObjectsMetadata for Research Objects
Metadata for Research Objectsseanb
 
ElN - repository integration at the University of Goettingen
ElN - repository integration at the University of GoettingenElN - repository integration at the University of Goettingen
ElN - repository integration at the University of Goettingenrmacneil88
 
Geospatial Data Insfrastructures, Cybercartography and Open Data: The Need f...
Geospatial Data Insfrastructures, Cybercartography and Open Data:  The Need f...Geospatial Data Insfrastructures, Cybercartography and Open Data:  The Need f...
Geospatial Data Insfrastructures, Cybercartography and Open Data: The Need f...ProgCity
 

Similar to Customisable cross-database Bio2RDF queries (20)

W4 4 marc-alexandre-nolin-v2
W4 4 marc-alexandre-nolin-v2W4 4 marc-alexandre-nolin-v2
W4 4 marc-alexandre-nolin-v2
 
RDM Programme @ Edinburgh - Service Interoperation
RDM Programme @ Edinburgh - Service InteroperationRDM Programme @ Edinburgh - Service Interoperation
RDM Programme @ Edinburgh - Service Interoperation
 
Linking Open Government Data at Scale
Linking Open Government Data at Scale Linking Open Government Data at Scale
Linking Open Government Data at Scale
 
Making Data Dynamic: Views from UC3, CDL
Making Data Dynamic: Views from UC3, CDLMaking Data Dynamic: Views from UC3, CDL
Making Data Dynamic: Views from UC3, CDL
 
The Heterogenous Zone: Six use cases for six research data collections in Edi...
The Heterogenous Zone: Six use cases for six research data collections in Edi...The Heterogenous Zone: Six use cases for six research data collections in Edi...
The Heterogenous Zone: Six use cases for six research data collections in Edi...
 
Engaging the Researcher in RDM
Engaging the Researcher in RDMEngaging the Researcher in RDM
Engaging the Researcher in RDM
 
RDA Update
RDA UpdateRDA Update
RDA Update
 
Data Access & Storage @ UWA - UWA Research Week September 2017
Data Access & Storage @ UWA - UWA Research Week September 2017Data Access & Storage @ UWA - UWA Research Week September 2017
Data Access & Storage @ UWA - UWA Research Week September 2017
 
DSpace for Data Revisited
DSpace for Data RevisitedDSpace for Data Revisited
DSpace for Data Revisited
 
Building a linked data based content discovery service for the RTÉ Archives
Building a linked data based content discovery service for the RTÉ ArchivesBuilding a linked data based content discovery service for the RTÉ Archives
Building a linked data based content discovery service for the RTÉ Archives
 
Edinburgh DataShare - DSpace for Data
Edinburgh DataShare - DSpace for DataEdinburgh DataShare - DSpace for Data
Edinburgh DataShare - DSpace for Data
 
Exploring the Semantic Web
Exploring the Semantic WebExploring the Semantic Web
Exploring the Semantic Web
 
Sandra Collins - Building a linked data based content discovery service for t...
Sandra Collins - Building a linked data based content discovery service for t...Sandra Collins - Building a linked data based content discovery service for t...
Sandra Collins - Building a linked data based content discovery service for t...
 
Researcher Identifiers and National Federated Search Portal for Japanese Inst...
Researcher Identifiers and National Federated Search Portal for Japanese Inst...Researcher Identifiers and National Federated Search Portal for Japanese Inst...
Researcher Identifiers and National Federated Search Portal for Japanese Inst...
 
UWA Research Week 2016
UWA Research Week 2016UWA Research Week 2016
UWA Research Week 2016
 
Metadata for Research Objects
Metadata for Research ObjectsMetadata for Research Objects
Metadata for Research Objects
 
ElN - repository integration at the University of Goettingen
ElN - repository integration at the University of GoettingenElN - repository integration at the University of Goettingen
ElN - repository integration at the University of Goettingen
 
Geospatial Data Insfrastructures, Cybercartography and Open Data: The Need f...
Geospatial Data Insfrastructures, Cybercartography and Open Data:  The Need f...Geospatial Data Insfrastructures, Cybercartography and Open Data:  The Need f...
Geospatial Data Insfrastructures, Cybercartography and Open Data: The Need f...
 
Geospatial Data Insfrastructures, Cybercartography and Open Data: The Need f...
Geospatial Data Insfrastructures, Cybercartography and Open Data:  The Need f...Geospatial Data Insfrastructures, Cybercartography and Open Data:  The Need f...
Geospatial Data Insfrastructures, Cybercartography and Open Data: The Need f...
 
Introduction to Research Data Management
Introduction to Research Data ManagementIntroduction to Research Data Management
Introduction to Research Data Management
 

Recently uploaded

Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application ) Sakshi Ghasle
 
भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,Virag Sontakke
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
Science lesson Moon for 4th quarter lesson
Science lesson Moon for 4th quarter lessonScience lesson Moon for 4th quarter lesson
Science lesson Moon for 4th quarter lessonJericReyAuditor
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTiammrhaywood
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17Celine George
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionSafetyChain Software
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsanshu789521
 
Final demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxFinal demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxAvyJaneVismanos
 
Pharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfPharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfMahmoud M. Sallam
 
ENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptx
ENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptxENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptx
ENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptxAnaBeatriceAblay2
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxmanuelaromero2013
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...Marc Dusseiller Dusjagr
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxEPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxRaymartEstabillo3
 
History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxHistory Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxsocialsciencegdgrohi
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Celine George
 

Recently uploaded (20)

Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application )
 
भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,
 
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
Science lesson Moon for 4th quarter lesson
Science lesson Moon for 4th quarter lessonScience lesson Moon for 4th quarter lesson
Science lesson Moon for 4th quarter lesson
 
Staff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSDStaff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSD
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory Inspection
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha elections
 
Final demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxFinal demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptx
 
Pharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfPharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdf
 
ENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptx
ENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptxENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptx
ENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptx
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptx
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxEPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
 
History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxHistory Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
 

Customisable cross-database Bio2RDF queries

  • 1. Collaborative development of cross- database Bio2RDF queries Peter Ansell Microsoft Queensland University of Technology eResearch Centre p.ansell@qut.edu.au
  • 2. Introduction ● Large number of cross-disciplinary datasources in different locations ● Scientists require simplified access to many of the datasources for complex research ● Recently a large number of datasources have been published using the RDF syntax ● Now, we need to learn how to query across them and be able to share that knowledge with others Sydney, Australia 3rd eResearch Australasia Conference 9-13 Nov 2009 2
  • 3. Sydney, Australia 3rd eResearch Australasia Conference 9-13 Nov 2009 3
  • 4. Linked Data 1) Use URIs as names for things 2) Use HTTP URIs so that people can look up those names. 3) When someone looks up a URI, provide useful information, using the standards (RDF, SPARQL) 4) Include links to other URIs. so that they can discover more things. http://www.w3.org/DesignIssues/LinkedData.html Sydney, Australia 3rd eResearch Australasia Conference 9-13 Nov 2009 4
  • 5. Linked Data querying ● Strategy 1 (Naive) : – Retrieve resources – Retrieve linked resources – Cache the resources and perform queries locally ● Strategy 2 (Search engine): – Retrieve resources – Retrieve directly linked resources – Query a semantic search engine for related resources – Cache the resources and perform queries locally Sydney, Australia 3rd eResearch Australasia Conference 9-13 Nov 2009 5
  • 6. Linked Data querying ● Strategy 3 (Distributed query) : – Mix SPARQL endpoint queries with URI based resolution to avoid having a large local cache – Normalise results from each site to form final query result – Users can process the results, and perform one or more queries based on their interpretation of the results Sydney, Australia 3rd eResearch Australasia Conference 9-13 Nov 2009 6
  • 7. Bio2RDF distributed queries ● Assign Namespaces to providers ● Query across relevant providers given a users query ● Aggregate all results into a single RDF document and return to the user ● It works: 700000 queries during the last month ● Largest dataset has 10 billion triples, the Protein Databank, with others making up about 5 billion triples Sydney, Australia 3rd eResearch Australasia Conference 9-13 Nov 2009 7
  • 8. Workflow Resolved URI: http://bio2rdf.org/label/go:0000345 Host name: http://bio2rdf.org/ Query: label/go:0000345 Regular expression: label/([w-]+):(.+) http://bio2rdf.org/query:labelsearch http://bio2rdf.org/query:labelsearchforgo Sydney, Australia 3rd eResearch Australasia Conference 9-13 Nov 2009 8
  • 9. Collaboration ● Query and provider definitions have an RDF representation ● Any other person is able to take a definition and change it to suit their needs and redistribute their definition ● If definitions are Linked Data themselves, as the Bio2RDF configuration item are, the HTTP URI can be used by others to pull the definition into their own software Sydney, Australia 3rd eResearch Australasia Conference 9-13 Nov 2009 9
  • 10. Provenance for queries and data ● Provenance can be attached to each item, including details such as OpenID URI's and dates ● The sources and queries that were used for a particular URI can be found by utilising the query plan option – http://bio2rdf.org/queryplan/label/go:0000345 – Enables automatic query collaboration, as you could take this query plan and add or modify queries inside the query plan Sydney, Australia 3rd eResearch Australasia Conference 9-13 Nov 2009 10
  • 11. Conclusion ● Many large distributed datasources ● Single interface, RDF ● Distribute queries efficiently across the endpoints ● Enabling people to create the definitions of what they did so other people can collaborate, via a single server or copy and paste Sydney, Australia 3rd eResearch Australasia Conference 9-13 Nov 2009 11