SlideShare a Scribd company logo
1 of 58
Arabidopsis Information Portal: A
Community-Extensible Platform
for Open Data
Matt Vaughn
Director, Life Sciences Computing
Texas Advanced Computing Center
University of Texas at Austin
vaughn@tacc.utexas.edu | @mattdotvaughn | www.slideshare.net/mattdotvaughn
The Rationale for Araport
• Loss of TAIR as a publicly funded shared resource
for data mining and basic bioinformatics (plus
technical obsolescence)
• Centralization as a key contributing factor
– Loading of new data into database
– Development of new user experience
– Curation and annotation
– Community support mission
• Araport is designed to be de-centralized and thus
sustainable
Modules Proposed: IAIC
3
‘The design of the AIP will provide core functionality while remaining
flexible to encourage multiple contributors and constant innovation.’
IAIC Whitepaper (2012) Plant Cell: ”Taking the next step”.
Modules Realized
4
Modules Realized
5
Core web applications for
integration and indexing.
Modules Realized
6
Core web applications for
integration and indexing.
In-house
Science Apps
Community
Science Apps
7
8
Global or faceted search. Will soon extend to community-provided modules.
9
The GMOD JBrowse app lets you select which data tracks to display (left).
An Araport extension, SeqLighter, lets you zoom to sequence (inset).
10
JBrowse users can select additional epigenomics tracks (obtained live from EPIC CoGe).
User may filter by many attributes (shown: Lab name = Jacobsen Lab).
11
Araport 11: Evidence-based re-annotation of A. thaliana Col-0, incorporating 113 public
RNAseq data sets binned by tissue type. Available in pre-release v3 at araport.org
TAIR10 Araport11
Protein coding loci 27,416 28,565 Total
1,162 Loci novel to Araport 11
554 Loci pending resolution
5,665 10,946 Loci with splice isoforms
21,690 Loci with same functional description
6,875 Loci with new functional description
Transcripts 35,385 50,203 Total including isoforms
933 Transcripts, CDS altered in Araport 11
25,079 Transcripts, UTR altered in Araport 11
Non-coding RNA 1,359 31,694 Total
12
13
14
Panther
Homologs
Phytozome
Homologs
Physical
Interactions
via BAR
Coming soon: genetic interactions va IntAct.
Expression
Patterns
via BAR
Co-expression
via ATTED
ThaleMine gene report pages
include…
15
ThaleMine gene report pages
include…
NCBI
GeneRIFs
NCBI
Publications
Gene
Ontology
associations
16
Saved query + user parameters
Display a dynamic table
Modify the query
ThaleMine template queries
17
Alter query
filters
Save
results
Analyze
columns
Alter display
columns
ThaleMine query results
18
ThaleMine results list
manipulation
19
20
Predicted phasiRNA sites in Arabidopsis.
Blake Meyers Lab, University of Delaware.
Deepti Vemaraju, Mayumi Nakano.
Arabidopsis citations by year and category.
Nick Provart Lab, University of Toronto.
Asher Pasha, Jamie Waese
Science Apps backed by
Web Services
21
Asher Pasha & Nicholas Provart from BAR at University of Toronto.
How It Works
22
https://www.araport.org
JavaScript in the Browser…
if ( gene && gene.length > 0 )
… calls Araport web services by URL…
$.get('https://api.araport.org…
… which in turn call BAR web services by URL…
http://bar.utoronto.ca/webservi
ces/get_expressologs.php
How It Works
23
https://www.araport.org
The graph is interactive.
• Users can rearrange nodes by dragging.
• Users can get details by clicking.
This is Cytoscape.
• The graph is drawn by Cytoscape.js
• This is a free library for JavaScript.
There are many libraries to choose from!
• jsPhyloSVG: phylogenetic trees
• HighCharts: statistical charts
• jQuery DataTables: interactive tables
• d3.js: all sorts of cool stuff
Code Re-use
24
http://bar.utoronto.ca
The Araport science app (left) reuses code from the pre-existing BAR app (right).
The apps look different by choice but they could be made identical.
https://www.araport.org
Key Points
• BAR Interactions Science App
– Example of visualization module
– Uses Open-source Cytoscape Javascript library
– Displays data from BAR web services via Araport-
mediated web service API
– Developed at BAR by developers who attended Araport
Workshop in 2014
– Similar codes deployed at Araport and BAR
• We invite you to develop a visualization module
– Araport engineers available to provide technical support
and advice
25
Pure Data Web Services
26
Contributed by SUBA group Nov 2015
Cornelia Hooper & Ian Castleden from University of Western Australia.
Pure Data Web Services
27
Pure Data Web Services
28
at2g46830
Automatic
Documentation
and User Interface
29
suba3
These are the service endpoints
• The endpoint is the verb in the URL.
• Verb is followed by one or more parameters
• Example: araport/suba/search?locus=AT2G46830
Standard service endpoints at Araport
• /list = which IDs work with this service?
• /search = what are the details for a given ID
• /prov = who provided this data and how?
• /stats = number of accesses, number of unique users
• /health = results of status check on underlying service
Automatic documentation is generated based on simple metadata provided when the
service is created. We translate it to the OpenAPI (née Swagger) spec for API interfaces.
That, in turn, is used to build UI (and language libraries)
How does it work?
30
suba3
This URL
Returns this data
Javascript-friendly
JSON format*
This transcription
factor localizes to
nucleus
*Not mandatory for Araport APIs but preferred!
Web Service Module
• SUBA provides a web service module to Araport
– URL query takes an Arabidopsis locus as parameter
– URL responds with a web page full of data
• The data is not formatted for display to humans (e.g. HTML)
• The data is formatted for Javascript parsers (in JSON format)
– The service is REST-like in that the data exchange is achieved with just the
standard web protocol, HTTP. This is a modern standard for data exchange.
• We can all use this module!
– Build a Science App that colors genes and pathway
– Build a Science App that scores predicted interactions
– ThaleMine could add subcellular localization to gene lists with ease
• We invite you to develop a web service module
– Araport will provide tech support plus documentation & indexing
– Araport will promote auto-discovery, interoperability
– There’s even provisional support for hosting data that doesn’t have an existing
web presence yet
– The docs and tooling are getting better all the time (but…)
31
suba3
SUBA module was developed
without Araport Staff Intervention
32
suba3
1. SUBA created a web service at their university.
• Added local URLs that return JSON instead of HTML.
• Re-used their existing database and web server.
2. SUBA wrote an Araport adapter to transform their service into a
REST-like API
• Wrote a small program in Python (2 and 3 supported)
• Program calls their URL & prints results in JSON format.
• Added metadata in YAML format (for auto documentation)
• Saved code to a source code repository on bitbucket.
3. SUBA deployed the adapter to the Araport platform
• Used ‘curl’ to send Araport the URL of the source code repository.
• Araport checks out the code, compiles it, containerizes it, deploys it.
• Araport generates interactive documentation using Swagger.
http://suba.plantenergy.uwa.edu.au/suba-app…
https://bitbucket.org/athaliana/suba-araport
$ curl –kL -X POST –H ”$BEARER_TOKEN” –F
"git_repository=https://bitbucket.org/athaliana/suba-araport”
https://api.araport.org/community/v0.3
Araport Developer Support: 2016
33
Summary
• The Araport project
– Provides evidence-based annotation for Col-0
– Performs extensive baseline data integration
– Supports the Arabidopsis research community
• The Araport platform
– Enables and hosts modules from community contributors
• Members gain visibility, accessibility, discoverability
• Members benefit from documentation, tech support
• Community Modules can come in multiple forms
– Visualization Science Apps using JavaScript libraries
– Pure data interchange as REST-like Web Services
– Computation Science Apps (analysis code + support for running it)
– JBrowse tracks as RESTful web services
• Technical improvements are ongoing all the time
– Improved developer support and tooling
– Federated search, ontology-based interoperation
– User workspaces, drag & drop combinations
34
Araport Developer Workshops
35
Deploying the Atted Science App Tutorial at
AIP Developer Workshop, TACC, Nov 2014.
The Atted Science App Tutorial is freely
available on GitHub. Other training material
at araport.org/devzone
Sign up to get updates on 2016 workshop
https://www.araport.org/contact
Acknowledgements
36
Araport Data Sources and Module Providers
Acknowledgements
37
J Craig Venter Institute
• Chris Town (PI)
• Jason Miller
• Agnes Chan
• Erik Ferlanti
• Irina Belyaeva
• Chia-Yi Cheng
• Vivek Krishnakumar
Alumni: Konstantinos Krampis,
Svetlana Karamycheva, Maria
Kim, Ben Rosen, Christopher
Nelson, Seth Schobel
University of Cambridge
• Gos Micklem
• Sergio Contrino
Funding Agencies
Texas Advanced Computing Center
• Matt Vaughn
• Josue Balandrano Coronel
• Matt Hanlon
• Rion Dooley
• Joe Stubbs
• Alex Rocha
• John Gentle
Alumni: Walter Moreira, Steve Mock
38
Araport11 Genome Annotation
Araport11
Protein Coding Genes
UniProt
Update
NCBI
Novel
Models
Maker
Novel
Models
NCBI SRA
RNA-seq
PASA, Trinity, BLAST,…
https://www.araport.org/data/araport11
TAIR10
Annotation
113 public
RNAseq
samples
Araport11 Pre-release 3 (Dec 2015)
• Available via ThaleMine, JBrowse, FTP, APIs
Categories TAIR10 Araport11
Gene Loci
Protein coding loci 27,416 27,667
Novel loci in Araport11 719
Gene loci with splice isoform 5,665 10,698
Transcripts
Transcript isoforms 35,385 48,389
Transcripts altered in Araport11
CDS altered 1,191
UTR altered 24,185
41
To install a Science App:
📝 Fill out this form.
🕒 Wait a few minutes.
😃 Test the app.
📬 Notify the App Store.
Araport provisions a virtual machine.
Araport obtains the source code.
Araport installs the program.
42
Web Services
Also known as APIs:
• Application Programmer Interfaces
Computer programs that…
• Run on a web server.
• Use HTTP for communication.
The query is a URL:
• http://my.url/gene?AT2G46830
The response is a “web page”:
• Format is JSON not HTML.
• Simple to read, simple to parse.
43
Query by URL
Response in JSON
Araport online documetnation
Web Service: SUBA
44
Query by URL
Response in JSON
Araport online documetnation
Web Service: KEGG
45
Science Apps
Computer programs that…
• Hosted on a web server.
• Run in the browser.
• Written in JavaScript.
Obtains data by…
• web services!
Useful for…
• Interactive science.
• Cool visualizations.
46
Web Service Science App
Computer programs that…
• Hosted on a web server.
• Run in the browser.
• Written in JavaScript.
Obtains data by…
• web services!
Useful for…
• Interactive science.
• Cool visualizations.
Computer programs that…
• Run on a web server.
• Use HTTP for communication.
The query is a URL:
• http://my.url/gene?AT2G46830
The response is a “web page”:
• Format is JSON not HTML.
• Simple to read, simple to parse.
47
JCVI Expression Profile web service (left) and science app (right).
Erik Ferlanti, JCVI senior software engineer.
Web Service Science App
48
KEGG Pathways web service (left) and science app (right).
Brian Liu, intern at JCVI.
Web Service Science App
49
PhosPhAt Phosphorylation web service (left) and science app (right).
Ismail Liban, intern at JCVI.
Web Service Science App
Araport Developer Workshops
50
Deploying the Atted Science App Tutorial at
AIP Developer Workshop, TACC, Nov 2014.
The Atted Science App Tutorial is available
as open source on GitHub.
Next workshop: Winter 2015
Acknowledgements
J Craig Venter Institute
• Chris Town
• Jason Miller
• Agnes Chan
• Maria Kim
• Erik Ferlanti
• Seth Schobel
• Irina Belyaeva
• Chia-Yi Cheng
• Vivek Krishnakumar
Former members
• Ben Rosen
• Christopher Nelson
• Konstantinos Krampis
• Svetlana Karamycheva
University of Cambridge
• Gos Micklem
• Sergio Contrino
Texas Advanced Computing Center
• Matt Vaughn
• Steve Mock
• Rion Dooley
• Matt Hanlon
• Joe Stubbs
• Walter Moreira
• Chris Jordan
51
Funding Agencies
Data Sources
Araport User Workspaces
• Status
– Prototype available in 2015
• Grid layout (user adds rows or columns)
• User adds Science Apps to grid (app isolation is goal)
– Coming soon
• Drag and drop
• Communications bus (blast app sends results to viz app)
– Coming later
• Automatic discovery (blast app finds my aligment app)
• Shared workspaces
52
Infrastructure Challenges 1
• Federated search
– Prototype: Single search returns results from
• The Araport content management system (Drupal)
• The Araport data warehouse (ThaleMine, Lucene)
• The Araport genome browser (JBrowse metadata)
– Goal during development:
• Extend search to 3rd party indexes (NCBI, EBI, etc)
• Develop web services APIs for distributed indexes
• Implement rapid response distributed search
• Automatic discovery
– Araport components to discover each other at run time
– User sees available options based on current results
53
Infrastructure Challenges 2
• Interoperable web services
– Prototype: support controlled vocabularies
• Sequence Ontology (SO) for data organization & display
• Gene Ontology (GO) associations for display and search
• Science apps integrate diverse web services
– Goals for development:
• Ontologies for phenotype, reactome, metabolome
• Community-driven adoption of controlled vocabularies
• Web service integration with snap-in easy
• Redundant web services
– Establish web services equivalence classes
– Automatic fail over should primary provider fail
54
Infrastructure Challenges 3
• Provenance
– Prototype:
• Submitters provide web services metadata for auto display
• Submitters may provide an “About” page on Science Apps
• Web services logs show number of users, number of hits
– Goals for development:
• Automatic and provenance display on every submission
• Monthly reporting to contributors (e.g. Google Analytics)
• Standards compliance e.g. W3C PROV spec
• Community adoption
55
Server
Browser
JSCSS
DBDB
HTML
<form>
CGI
HTML
<table>
URL HTML
WebServices
JavaScript
<table>
URL
HTML3
CSS3
HTML5
Server
Browser
Traditional
Active server, static client.
Submit one form, display one result.
Server provides data and its format.
Modern
Active client, dynamic pages.
Continual client/server interaction.
Server provides data, client formats it.
Web Design for Dynamic Pages
HTTP HTTP
56
External programsPortal programs (www.araport.org)
API (api.araport.org)
Agave Core
keep metadata
enroll users
ADAMA
format data
enroll services
a b c d e f
CGI
Computing
Storage
Databases
ThaleMine JBrowse
Authentication, metering, logging, versioning, security.
a b c d e f
Apps
Jobs
Systems
CGI
InterMines
Others
Tripal
SOAP
CGI
REST
Science Apps
Requisite Architectural Diagram
57
Abstract
• The Araport platform for scalable information exchange in genomics.
• The Arabidopsis Information Portal (Araport) is a web resource for genome science.
Araport is a new and free service centered on Arabidopsis thaliana, the plant whose
genome sequence serves as a model for all of plant biology. Araport integrates data
from major sources including NCBI, UniProt, PubMed, TAIR, BAR, EPIC CoGe, IntAct,
Atted II, KEGG, and the 1001 Genomes Project. Araport also exposes its own
“Araport11” update to the organism’s structural and functional gene annotation.
Araport was conceived as a new kind of model organism database, one that could keep
pace with ever-growing data sets while not burdening funding agencies with an ever-
growing data warehouse. Araport is a platform for data sharing, data integration, and
data federation. Araport provides means for scientists in the community to develop and
deploy web services that expose data residing elsewhere on the internet. Araport
provides means for scientists to develop and deploy “Science Apps” that can perform
computational analysis and visualization of distributed data. Araport already hosts over
20 Science Apps and almost 100 web services linked to a dozen data sources. Currently
shifting from prototype to development mode, Araport provides a model for
sustainable growth of model organism community resources. Primarily an information
science project, Araport takes on scalability challenges related to real-time integration
of distributed services, interoperability between diverse services, indexing for federated
search, reliability & responsiveness, security & logging, open-source development for
software portability, and usability through automated documentation.
58

More Related Content

What's hot

A guided tour of Araport
A guided tour of AraportA guided tour of Araport
A guided tour of AraportAraport
 
ICAR 2015 Workshop - Nick Provart
ICAR 2015 Workshop - Nick ProvartICAR 2015 Workshop - Nick Provart
ICAR 2015 Workshop - Nick ProvartAraport
 
NISO's IOTA OpenURL Quality Initiative @ ALA & SLA 2012
NISO's IOTA OpenURL Quality Initiative @ ALA & SLA 2012NISO's IOTA OpenURL Quality Initiative @ ALA & SLA 2012
NISO's IOTA OpenURL Quality Initiative @ ALA & SLA 2012Rafal Kasprowski
 
ICAR 2015 Poster - Araport
ICAR 2015 Poster - AraportICAR 2015 Poster - Araport
ICAR 2015 Poster - AraportAraport
 
EDS Web-scale Panel (Preprint), 2012 Charleston Conference
EDS Web-scale Panel (Preprint), 2012 Charleston ConferenceEDS Web-scale Panel (Preprint), 2012 Charleston Conference
EDS Web-scale Panel (Preprint), 2012 Charleston ConferenceRafal Kasprowski
 
IOTA @ NASIG 2011: Measuring the Quality of OpenURL Links
IOTA @ NASIG 2011: Measuring the Quality of OpenURL LinksIOTA @ NASIG 2011: Measuring the Quality of OpenURL Links
IOTA @ NASIG 2011: Measuring the Quality of OpenURL LinksRafal Kasprowski
 
Introducing ProtAnnot - Araport workshop at PAG 2016
Introducing ProtAnnot - Araport workshop at PAG 2016Introducing ProtAnnot - Araport workshop at PAG 2016
Introducing ProtAnnot - Araport workshop at PAG 2016Ann Loraine
 
IRJET-Multi -Stage Smart Deep Web Crawling Systems: A Review
IRJET-Multi -Stage Smart Deep Web Crawling Systems: A ReviewIRJET-Multi -Stage Smart Deep Web Crawling Systems: A Review
IRJET-Multi -Stage Smart Deep Web Crawling Systems: A ReviewIRJET Journal
 
ELUNA2013:Providing Voyager catalog data in a custom, open source web applica...
ELUNA2013:Providing Voyager catalog data in a custom, open source web applica...ELUNA2013:Providing Voyager catalog data in a custom, open source web applica...
ELUNA2013:Providing Voyager catalog data in a custom, open source web applica...Michael Cummings
 
Data management, data sharing: the SysMO-SEEK Story
Data management, data sharing: the SysMO-SEEK StoryData management, data sharing: the SysMO-SEEK Story
Data management, data sharing: the SysMO-SEEK StoryCarole Goble
 
Web scale discovery vs google scholar
Web scale discovery vs google scholarWeb scale discovery vs google scholar
Web scale discovery vs google scholarNikesh Narayanan
 
ISA-Tab Standards at Metabolomics Society Meeting, Tsuruoka 2014, Japan
ISA-Tab Standards at Metabolomics Society Meeting, Tsuruoka 2014, JapanISA-Tab Standards at Metabolomics Society Meeting, Tsuruoka 2014, Japan
ISA-Tab Standards at Metabolomics Society Meeting, Tsuruoka 2014, JapanPhilippe Rocca-Serra
 
Emerging technologies in academic libraries
Emerging technologies in academic librariesEmerging technologies in academic libraries
Emerging technologies in academic librariesMichael Cummings
 
Rhea: Adaptively Sampling Authoritative Content from Social Activity Streams
Rhea: Adaptively Sampling Authoritative Content from Social Activity StreamsRhea: Adaptively Sampling Authoritative Content from Social Activity Streams
Rhea: Adaptively Sampling Authoritative Content from Social Activity StreamsPanagiotis Liakos
 
REST APIs for the Internet of Things
REST APIs for the Internet of ThingsREST APIs for the Internet of Things
REST APIs for the Internet of ThingsMichael Koster
 
Developing Apps: Exposing Your Data Through Araport
Developing Apps: Exposing Your Data Through AraportDeveloping Apps: Exposing Your Data Through Araport
Developing Apps: Exposing Your Data Through AraportMatthew Vaughn
 
FAIR Workflows and Research Objects get a Workout
FAIR Workflows and Research Objects get a Workout FAIR Workflows and Research Objects get a Workout
FAIR Workflows and Research Objects get a Workout Carole Goble
 
Research Object Community Update
Research Object Community UpdateResearch Object Community Update
Research Object Community UpdateCarole Goble
 
Linked APIs for Life Sciences Tutorial at SWAT4LS 3011
Linked APIs for Life Sciences Tutorial at SWAT4LS 3011Linked APIs for Life Sciences Tutorial at SWAT4LS 3011
Linked APIs for Life Sciences Tutorial at SWAT4LS 3011sspeiser
 

What's hot (20)

A guided tour of Araport
A guided tour of AraportA guided tour of Araport
A guided tour of Araport
 
ICAR 2015 Workshop - Nick Provart
ICAR 2015 Workshop - Nick ProvartICAR 2015 Workshop - Nick Provart
ICAR 2015 Workshop - Nick Provart
 
NISO's IOTA OpenURL Quality Initiative @ ALA & SLA 2012
NISO's IOTA OpenURL Quality Initiative @ ALA & SLA 2012NISO's IOTA OpenURL Quality Initiative @ ALA & SLA 2012
NISO's IOTA OpenURL Quality Initiative @ ALA & SLA 2012
 
ICAR 2015 Poster - Araport
ICAR 2015 Poster - AraportICAR 2015 Poster - Araport
ICAR 2015 Poster - Araport
 
EDS Web-scale Panel (Preprint), 2012 Charleston Conference
EDS Web-scale Panel (Preprint), 2012 Charleston ConferenceEDS Web-scale Panel (Preprint), 2012 Charleston Conference
EDS Web-scale Panel (Preprint), 2012 Charleston Conference
 
IOTA @ NASIG 2011: Measuring the Quality of OpenURL Links
IOTA @ NASIG 2011: Measuring the Quality of OpenURL LinksIOTA @ NASIG 2011: Measuring the Quality of OpenURL Links
IOTA @ NASIG 2011: Measuring the Quality of OpenURL Links
 
Introducing ProtAnnot - Araport workshop at PAG 2016
Introducing ProtAnnot - Araport workshop at PAG 2016Introducing ProtAnnot - Araport workshop at PAG 2016
Introducing ProtAnnot - Araport workshop at PAG 2016
 
IRJET-Multi -Stage Smart Deep Web Crawling Systems: A Review
IRJET-Multi -Stage Smart Deep Web Crawling Systems: A ReviewIRJET-Multi -Stage Smart Deep Web Crawling Systems: A Review
IRJET-Multi -Stage Smart Deep Web Crawling Systems: A Review
 
Echoes Project
Echoes ProjectEchoes Project
Echoes Project
 
ELUNA2013:Providing Voyager catalog data in a custom, open source web applica...
ELUNA2013:Providing Voyager catalog data in a custom, open source web applica...ELUNA2013:Providing Voyager catalog data in a custom, open source web applica...
ELUNA2013:Providing Voyager catalog data in a custom, open source web applica...
 
Data management, data sharing: the SysMO-SEEK Story
Data management, data sharing: the SysMO-SEEK StoryData management, data sharing: the SysMO-SEEK Story
Data management, data sharing: the SysMO-SEEK Story
 
Web scale discovery vs google scholar
Web scale discovery vs google scholarWeb scale discovery vs google scholar
Web scale discovery vs google scholar
 
ISA-Tab Standards at Metabolomics Society Meeting, Tsuruoka 2014, Japan
ISA-Tab Standards at Metabolomics Society Meeting, Tsuruoka 2014, JapanISA-Tab Standards at Metabolomics Society Meeting, Tsuruoka 2014, Japan
ISA-Tab Standards at Metabolomics Society Meeting, Tsuruoka 2014, Japan
 
Emerging technologies in academic libraries
Emerging technologies in academic librariesEmerging technologies in academic libraries
Emerging technologies in academic libraries
 
Rhea: Adaptively Sampling Authoritative Content from Social Activity Streams
Rhea: Adaptively Sampling Authoritative Content from Social Activity StreamsRhea: Adaptively Sampling Authoritative Content from Social Activity Streams
Rhea: Adaptively Sampling Authoritative Content from Social Activity Streams
 
REST APIs for the Internet of Things
REST APIs for the Internet of ThingsREST APIs for the Internet of Things
REST APIs for the Internet of Things
 
Developing Apps: Exposing Your Data Through Araport
Developing Apps: Exposing Your Data Through AraportDeveloping Apps: Exposing Your Data Through Araport
Developing Apps: Exposing Your Data Through Araport
 
FAIR Workflows and Research Objects get a Workout
FAIR Workflows and Research Objects get a Workout FAIR Workflows and Research Objects get a Workout
FAIR Workflows and Research Objects get a Workout
 
Research Object Community Update
Research Object Community UpdateResearch Object Community Update
Research Object Community Update
 
Linked APIs for Life Sciences Tutorial at SWAT4LS 3011
Linked APIs for Life Sciences Tutorial at SWAT4LS 3011Linked APIs for Life Sciences Tutorial at SWAT4LS 3011
Linked APIs for Life Sciences Tutorial at SWAT4LS 3011
 

Similar to Arabidopsis Information Portal: A Community-Extensible Platform for Open Data

Tripal within the Arabidopsis Information Portal - PAG XXIII
Tripal within the Arabidopsis Information Portal - PAG XXIIITripal within the Arabidopsis Information Portal - PAG XXIII
Tripal within the Arabidopsis Information Portal - PAG XXIIIVivek Krishnakumar
 
Arabidopsis Information Portal overview from Plant Biology Europe 2014
Arabidopsis Information Portal overview from Plant Biology Europe 2014Arabidopsis Information Portal overview from Plant Biology Europe 2014
Arabidopsis Information Portal overview from Plant Biology Europe 2014Matthew Vaughn
 
aip_developer_overview_icar_2014
aip_developer_overview_icar_2014aip_developer_overview_icar_2014
aip_developer_overview_icar_2014Matthew Vaughn
 
Arabidopsis Information Portal, Developer Workshop 2014, Introduction
Arabidopsis Information Portal, Developer Workshop 2014, IntroductionArabidopsis Information Portal, Developer Workshop 2014, Introduction
Arabidopsis Information Portal, Developer Workshop 2014, IntroductionJasonRafeMiller
 
ICAR 2015 Workshop - Matt Vaughn
ICAR 2015 Workshop - Matt VaughnICAR 2015 Workshop - Matt Vaughn
ICAR 2015 Workshop - Matt VaughnAraport
 
Show and tell program 04 2014-09-04
Show and tell program 04 2014-09-04Show and tell program 04 2014-09-04
Show and tell program 04 2014-09-04nihshowandtell
 
Vaughn aip walkthru_pag2015
Vaughn aip walkthru_pag2015Vaughn aip walkthru_pag2015
Vaughn aip walkthru_pag2015Araport
 
aip-workshop1-dev-tutorial
aip-workshop1-dev-tutorialaip-workshop1-dev-tutorial
aip-workshop1-dev-tutorialMatthew Vaughn
 
Show and tell program 04 2014-09-04
Show and tell program 04 2014-09-04Show and tell program 04 2014-09-04
Show and tell program 04 2014-09-04David Phillips
 
Linked Services for the Web of Data
Linked Services for the Web of DataLinked Services for the Web of Data
Linked Services for the Web of DataCarlos Pedrinaci
 
FAIR Software (and Data) Citation: Europe, Research Object Systems, Networks ...
FAIR Software (and Data) Citation: Europe, Research Object Systems, Networks ...FAIR Software (and Data) Citation: Europe, Research Object Systems, Networks ...
FAIR Software (and Data) Citation: Europe, Research Object Systems, Networks ...Carole Goble
 
Tripal v3, the Collaborative Online Database Platform Supporting an Internati...
Tripal v3, the Collaborative Online Database Platform Supporting an Internati...Tripal v3, the Collaborative Online Database Platform Supporting an Internati...
Tripal v3, the Collaborative Online Database Platform Supporting an Internati...Bradford Condon
 
A BASILar Approach for Building Web APIs on top of SPARQL Endpoints
A BASILar Approach for Building Web APIs on top of SPARQL EndpointsA BASILar Approach for Building Web APIs on top of SPARQL Endpoints
A BASILar Approach for Building Web APIs on top of SPARQL EndpointsEnrico Daga
 
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer SchoolCarole Goble
 
Linking Scientific Instruments and Computation
Linking Scientific Instruments and ComputationLinking Scientific Instruments and Computation
Linking Scientific Instruments and ComputationIan Foster
 
aip-developer-intro_pag2015
aip-developer-intro_pag2015aip-developer-intro_pag2015
aip-developer-intro_pag2015Matthew Vaughn
 

Similar to Arabidopsis Information Portal: A Community-Extensible Platform for Open Data (20)

Tripal within the Arabidopsis Information Portal - PAG XXIII
Tripal within the Arabidopsis Information Portal - PAG XXIIITripal within the Arabidopsis Information Portal - PAG XXIII
Tripal within the Arabidopsis Information Portal - PAG XXIII
 
Arabidopsis Information Portal overview from Plant Biology Europe 2014
Arabidopsis Information Portal overview from Plant Biology Europe 2014Arabidopsis Information Portal overview from Plant Biology Europe 2014
Arabidopsis Information Portal overview from Plant Biology Europe 2014
 
aip_developer_overview_icar_2014
aip_developer_overview_icar_2014aip_developer_overview_icar_2014
aip_developer_overview_icar_2014
 
Arabidopsis Information Portal, Developer Workshop 2014, Introduction
Arabidopsis Information Portal, Developer Workshop 2014, IntroductionArabidopsis Information Portal, Developer Workshop 2014, Introduction
Arabidopsis Information Portal, Developer Workshop 2014, Introduction
 
ICAR 2015 Workshop - Matt Vaughn
ICAR 2015 Workshop - Matt VaughnICAR 2015 Workshop - Matt Vaughn
ICAR 2015 Workshop - Matt Vaughn
 
Show and tell program 04 2014-09-04
Show and tell program 04 2014-09-04Show and tell program 04 2014-09-04
Show and tell program 04 2014-09-04
 
Vaughn aip walkthru_pag2015
Vaughn aip walkthru_pag2015Vaughn aip walkthru_pag2015
Vaughn aip walkthru_pag2015
 
aip-workshop1-dev-tutorial
aip-workshop1-dev-tutorialaip-workshop1-dev-tutorial
aip-workshop1-dev-tutorial
 
Show and tell program 04 2014-09-04
Show and tell program 04 2014-09-04Show and tell program 04 2014-09-04
Show and tell program 04 2014-09-04
 
Linked Services for the Web of Data
Linked Services for the Web of DataLinked Services for the Web of Data
Linked Services for the Web of Data
 
FAIR Software (and Data) Citation: Europe, Research Object Systems, Networks ...
FAIR Software (and Data) Citation: Europe, Research Object Systems, Networks ...FAIR Software (and Data) Citation: Europe, Research Object Systems, Networks ...
FAIR Software (and Data) Citation: Europe, Research Object Systems, Networks ...
 
Marrying ACDLabs technologies to eScience Projects at the Royal Society of C...
Marrying ACDLabs technologies to eScience Projects at the  Royal Society of C...Marrying ACDLabs technologies to eScience Projects at the  Royal Society of C...
Marrying ACDLabs technologies to eScience Projects at the Royal Society of C...
 
Tripal v3, the Collaborative Online Database Platform Supporting an Internati...
Tripal v3, the Collaborative Online Database Platform Supporting an Internati...Tripal v3, the Collaborative Online Database Platform Supporting an Internati...
Tripal v3, the Collaborative Online Database Platform Supporting an Internati...
 
A BASILar Approach for Building Web APIs on top of SPARQL Endpoints
A BASILar Approach for Building Web APIs on top of SPARQL EndpointsA BASILar Approach for Building Web APIs on top of SPARQL Endpoints
A BASILar Approach for Building Web APIs on top of SPARQL Endpoints
 
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
 
Linking Scientific Instruments and Computation
Linking Scientific Instruments and ComputationLinking Scientific Instruments and Computation
Linking Scientific Instruments and Computation
 
20171003 lancaster data conversations Chue-Hong
20171003 lancaster data conversations Chue-Hong20171003 lancaster data conversations Chue-Hong
20171003 lancaster data conversations Chue-Hong
 
aip-developer-intro_pag2015
aip-developer-intro_pag2015aip-developer-intro_pag2015
aip-developer-intro_pag2015
 
OpenML Tutorial ECMLPKDD 2015
OpenML Tutorial ECMLPKDD 2015OpenML Tutorial ECMLPKDD 2015
OpenML Tutorial ECMLPKDD 2015
 
Grid computing
Grid computingGrid computing
Grid computing
 

More from Matthew Vaughn

On-Demand Cloud Computing for Life Sciences Research and Education
On-Demand Cloud Computing for Life Sciences Research and EducationOn-Demand Cloud Computing for Life Sciences Research and Education
On-Demand Cloud Computing for Life Sciences Research and EducationMatthew Vaughn
 
Towards a (united) federation of Bioinformatics resources
Towards a (united) federation of Bioinformatics resourcesTowards a (united) federation of Bioinformatics resources
Towards a (united) federation of Bioinformatics resourcesMatthew Vaughn
 
CYVERSE: TRANSFORMING LIFE SCIENCE RESEARCH VIA CYBERINFRASTRUCTURE
CYVERSE: TRANSFORMING LIFE SCIENCE RESEARCH VIA CYBERINFRASTRUCTURECYVERSE: TRANSFORMING LIFE SCIENCE RESEARCH VIA CYBERINFRASTRUCTURE
CYVERSE: TRANSFORMING LIFE SCIENCE RESEARCH VIA CYBERINFRASTRUCTUREMatthew Vaughn
 
Jetstream: Accessible cloud computing for the national science and engineerin...
Jetstream: Accessible cloud computing for the national science and engineerin...Jetstream: Accessible cloud computing for the national science and engineerin...
Jetstream: Accessible cloud computing for the national science and engineerin...Matthew Vaughn
 
How Cyverse.org enables scalable data discoverability and re-use
How Cyverse.org enables scalable data discoverability and re-useHow Cyverse.org enables scalable data discoverability and re-use
How Cyverse.org enables scalable data discoverability and re-useMatthew Vaughn
 
Clouds, Clusters, and Containers: Tools for responsible, collaborative computing
Clouds, Clusters, and Containers: Tools for responsible, collaborative computingClouds, Clusters, and Containers: Tools for responsible, collaborative computing
Clouds, Clusters, and Containers: Tools for responsible, collaborative computingMatthew Vaughn
 
Packaging computational biology tools for broad distribution and ease-of-reuse
Packaging computational biology tools for broad distribution and ease-of-reusePackaging computational biology tools for broad distribution and ease-of-reuse
Packaging computational biology tools for broad distribution and ease-of-reuseMatthew Vaughn
 
Jetstream: Adding Cloud-based Computing to the National Cyberinfrastructure
Jetstream: Adding Cloud-based Computing to the National CyberinfrastructureJetstream: Adding Cloud-based Computing to the National Cyberinfrastructure
Jetstream: Adding Cloud-based Computing to the National CyberinfrastructureMatthew Vaughn
 
Scaling People, Not Just Systems, to Take On Big Data Challenges
Scaling People, Not Just Systems, to Take On Big Data ChallengesScaling People, Not Just Systems, to Take On Big Data Challenges
Scaling People, Not Just Systems, to Take On Big Data ChallengesMatthew Vaughn
 
Dinosaur bioinformatics
Dinosaur bioinformaticsDinosaur bioinformatics
Dinosaur bioinformaticsMatthew Vaughn
 
iplant-highlights-pag2015
iplant-highlights-pag2015iplant-highlights-pag2015
iplant-highlights-pag2015Matthew Vaughn
 

More from Matthew Vaughn (11)

On-Demand Cloud Computing for Life Sciences Research and Education
On-Demand Cloud Computing for Life Sciences Research and EducationOn-Demand Cloud Computing for Life Sciences Research and Education
On-Demand Cloud Computing for Life Sciences Research and Education
 
Towards a (united) federation of Bioinformatics resources
Towards a (united) federation of Bioinformatics resourcesTowards a (united) federation of Bioinformatics resources
Towards a (united) federation of Bioinformatics resources
 
CYVERSE: TRANSFORMING LIFE SCIENCE RESEARCH VIA CYBERINFRASTRUCTURE
CYVERSE: TRANSFORMING LIFE SCIENCE RESEARCH VIA CYBERINFRASTRUCTURECYVERSE: TRANSFORMING LIFE SCIENCE RESEARCH VIA CYBERINFRASTRUCTURE
CYVERSE: TRANSFORMING LIFE SCIENCE RESEARCH VIA CYBERINFRASTRUCTURE
 
Jetstream: Accessible cloud computing for the national science and engineerin...
Jetstream: Accessible cloud computing for the national science and engineerin...Jetstream: Accessible cloud computing for the national science and engineerin...
Jetstream: Accessible cloud computing for the national science and engineerin...
 
How Cyverse.org enables scalable data discoverability and re-use
How Cyverse.org enables scalable data discoverability and re-useHow Cyverse.org enables scalable data discoverability and re-use
How Cyverse.org enables scalable data discoverability and re-use
 
Clouds, Clusters, and Containers: Tools for responsible, collaborative computing
Clouds, Clusters, and Containers: Tools for responsible, collaborative computingClouds, Clusters, and Containers: Tools for responsible, collaborative computing
Clouds, Clusters, and Containers: Tools for responsible, collaborative computing
 
Packaging computational biology tools for broad distribution and ease-of-reuse
Packaging computational biology tools for broad distribution and ease-of-reusePackaging computational biology tools for broad distribution and ease-of-reuse
Packaging computational biology tools for broad distribution and ease-of-reuse
 
Jetstream: Adding Cloud-based Computing to the National Cyberinfrastructure
Jetstream: Adding Cloud-based Computing to the National CyberinfrastructureJetstream: Adding Cloud-based Computing to the National Cyberinfrastructure
Jetstream: Adding Cloud-based Computing to the National Cyberinfrastructure
 
Scaling People, Not Just Systems, to Take On Big Data Challenges
Scaling People, Not Just Systems, to Take On Big Data ChallengesScaling People, Not Just Systems, to Take On Big Data Challenges
Scaling People, Not Just Systems, to Take On Big Data Challenges
 
Dinosaur bioinformatics
Dinosaur bioinformaticsDinosaur bioinformatics
Dinosaur bioinformatics
 
iplant-highlights-pag2015
iplant-highlights-pag2015iplant-highlights-pag2015
iplant-highlights-pag2015
 

Recently uploaded

Orientation, design and principles of polyhouse
Orientation, design and principles of polyhouseOrientation, design and principles of polyhouse
Orientation, design and principles of polyhousejana861314
 
Caco-2 cell permeability assay for drug absorption
Caco-2 cell permeability assay for drug absorptionCaco-2 cell permeability assay for drug absorption
Caco-2 cell permeability assay for drug absorptionPriyansha Singh
 
Grafana in space: Monitoring Japan's SLIM moon lander in real time
Grafana in space: Monitoring Japan's SLIM moon lander  in real timeGrafana in space: Monitoring Japan's SLIM moon lander  in real time
Grafana in space: Monitoring Japan's SLIM moon lander in real timeSatoshi NAKAHIRA
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTSérgio Sacani
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​kaibalyasahoo82800
 
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡anilsa9823
 
Types of different blotting techniques.pptx
Types of different blotting techniques.pptxTypes of different blotting techniques.pptx
Types of different blotting techniques.pptxkhadijarafiq2012
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Nistarini College, Purulia (W.B) India
 
Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Patrick Diehl
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )aarthirajkumar25
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfSumit Kumar yadav
 
Work, Energy and Power for class 10 ICSE Physics
Work, Energy and Power for class 10 ICSE PhysicsWork, Energy and Power for class 10 ICSE Physics
Work, Energy and Power for class 10 ICSE Physicsvishikhakeshava1
 
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |aasikanpl
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)PraveenaKalaiselvan1
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxgindu3009
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bSérgio Sacani
 
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxAnalytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxSwapnil Therkar
 
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxPhysiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxAArockiyaNisha
 

Recently uploaded (20)

Orientation, design and principles of polyhouse
Orientation, design and principles of polyhouseOrientation, design and principles of polyhouse
Orientation, design and principles of polyhouse
 
Caco-2 cell permeability assay for drug absorption
Caco-2 cell permeability assay for drug absorptionCaco-2 cell permeability assay for drug absorption
Caco-2 cell permeability assay for drug absorption
 
Grafana in space: Monitoring Japan's SLIM moon lander in real time
Grafana in space: Monitoring Japan's SLIM moon lander  in real timeGrafana in space: Monitoring Japan's SLIM moon lander  in real time
Grafana in space: Monitoring Japan's SLIM moon lander in real time
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOST
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​
 
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
 
Types of different blotting techniques.pptx
Types of different blotting techniques.pptxTypes of different blotting techniques.pptx
Types of different blotting techniques.pptx
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...
 
Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdf
 
Work, Energy and Power for class 10 ICSE Physics
Work, Energy and Power for class 10 ICSE PhysicsWork, Energy and Power for class 10 ICSE Physics
Work, Energy and Power for class 10 ICSE Physics
 
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
 
CELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdfCELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdf
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptx
 
The Philosophy of Science
The Philosophy of ScienceThe Philosophy of Science
The Philosophy of Science
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
 
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxAnalytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
 
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxPhysiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
 

Arabidopsis Information Portal: A Community-Extensible Platform for Open Data

  • 1. Arabidopsis Information Portal: A Community-Extensible Platform for Open Data Matt Vaughn Director, Life Sciences Computing Texas Advanced Computing Center University of Texas at Austin vaughn@tacc.utexas.edu | @mattdotvaughn | www.slideshare.net/mattdotvaughn
  • 2. The Rationale for Araport • Loss of TAIR as a publicly funded shared resource for data mining and basic bioinformatics (plus technical obsolescence) • Centralization as a key contributing factor – Loading of new data into database – Development of new user experience – Curation and annotation – Community support mission • Araport is designed to be de-centralized and thus sustainable
  • 3. Modules Proposed: IAIC 3 ‘The design of the AIP will provide core functionality while remaining flexible to encourage multiple contributors and constant innovation.’ IAIC Whitepaper (2012) Plant Cell: ”Taking the next step”.
  • 5. Modules Realized 5 Core web applications for integration and indexing.
  • 6. Modules Realized 6 Core web applications for integration and indexing. In-house Science Apps Community Science Apps
  • 7. 7
  • 8. 8 Global or faceted search. Will soon extend to community-provided modules.
  • 9. 9 The GMOD JBrowse app lets you select which data tracks to display (left). An Araport extension, SeqLighter, lets you zoom to sequence (inset).
  • 10. 10 JBrowse users can select additional epigenomics tracks (obtained live from EPIC CoGe). User may filter by many attributes (shown: Lab name = Jacobsen Lab).
  • 11. 11 Araport 11: Evidence-based re-annotation of A. thaliana Col-0, incorporating 113 public RNAseq data sets binned by tissue type. Available in pre-release v3 at araport.org TAIR10 Araport11 Protein coding loci 27,416 28,565 Total 1,162 Loci novel to Araport 11 554 Loci pending resolution 5,665 10,946 Loci with splice isoforms 21,690 Loci with same functional description 6,875 Loci with new functional description Transcripts 35,385 50,203 Total including isoforms 933 Transcripts, CDS altered in Araport 11 25,079 Transcripts, UTR altered in Araport 11 Non-coding RNA 1,359 31,694 Total
  • 12. 12
  • 13. 13
  • 14. 14 Panther Homologs Phytozome Homologs Physical Interactions via BAR Coming soon: genetic interactions va IntAct. Expression Patterns via BAR Co-expression via ATTED ThaleMine gene report pages include…
  • 15. 15 ThaleMine gene report pages include… NCBI GeneRIFs NCBI Publications Gene Ontology associations
  • 16. 16 Saved query + user parameters Display a dynamic table Modify the query ThaleMine template queries
  • 19. 19
  • 20. 20 Predicted phasiRNA sites in Arabidopsis. Blake Meyers Lab, University of Delaware. Deepti Vemaraju, Mayumi Nakano. Arabidopsis citations by year and category. Nick Provart Lab, University of Toronto. Asher Pasha, Jamie Waese Science Apps backed by Web Services
  • 21. 21 Asher Pasha & Nicholas Provart from BAR at University of Toronto.
  • 22. How It Works 22 https://www.araport.org JavaScript in the Browser… if ( gene && gene.length > 0 ) … calls Araport web services by URL… $.get('https://api.araport.org… … which in turn call BAR web services by URL… http://bar.utoronto.ca/webservi ces/get_expressologs.php
  • 23. How It Works 23 https://www.araport.org The graph is interactive. • Users can rearrange nodes by dragging. • Users can get details by clicking. This is Cytoscape. • The graph is drawn by Cytoscape.js • This is a free library for JavaScript. There are many libraries to choose from! • jsPhyloSVG: phylogenetic trees • HighCharts: statistical charts • jQuery DataTables: interactive tables • d3.js: all sorts of cool stuff
  • 24. Code Re-use 24 http://bar.utoronto.ca The Araport science app (left) reuses code from the pre-existing BAR app (right). The apps look different by choice but they could be made identical. https://www.araport.org
  • 25. Key Points • BAR Interactions Science App – Example of visualization module – Uses Open-source Cytoscape Javascript library – Displays data from BAR web services via Araport- mediated web service API – Developed at BAR by developers who attended Araport Workshop in 2014 – Similar codes deployed at Araport and BAR • We invite you to develop a visualization module – Araport engineers available to provide technical support and advice 25
  • 26. Pure Data Web Services 26 Contributed by SUBA group Nov 2015 Cornelia Hooper & Ian Castleden from University of Western Australia.
  • 27. Pure Data Web Services 27
  • 28. Pure Data Web Services 28 at2g46830
  • 29. Automatic Documentation and User Interface 29 suba3 These are the service endpoints • The endpoint is the verb in the URL. • Verb is followed by one or more parameters • Example: araport/suba/search?locus=AT2G46830 Standard service endpoints at Araport • /list = which IDs work with this service? • /search = what are the details for a given ID • /prov = who provided this data and how? • /stats = number of accesses, number of unique users • /health = results of status check on underlying service Automatic documentation is generated based on simple metadata provided when the service is created. We translate it to the OpenAPI (née Swagger) spec for API interfaces. That, in turn, is used to build UI (and language libraries)
  • 30. How does it work? 30 suba3 This URL Returns this data Javascript-friendly JSON format* This transcription factor localizes to nucleus *Not mandatory for Araport APIs but preferred!
  • 31. Web Service Module • SUBA provides a web service module to Araport – URL query takes an Arabidopsis locus as parameter – URL responds with a web page full of data • The data is not formatted for display to humans (e.g. HTML) • The data is formatted for Javascript parsers (in JSON format) – The service is REST-like in that the data exchange is achieved with just the standard web protocol, HTTP. This is a modern standard for data exchange. • We can all use this module! – Build a Science App that colors genes and pathway – Build a Science App that scores predicted interactions – ThaleMine could add subcellular localization to gene lists with ease • We invite you to develop a web service module – Araport will provide tech support plus documentation & indexing – Araport will promote auto-discovery, interoperability – There’s even provisional support for hosting data that doesn’t have an existing web presence yet – The docs and tooling are getting better all the time (but…) 31 suba3
  • 32. SUBA module was developed without Araport Staff Intervention 32 suba3 1. SUBA created a web service at their university. • Added local URLs that return JSON instead of HTML. • Re-used their existing database and web server. 2. SUBA wrote an Araport adapter to transform their service into a REST-like API • Wrote a small program in Python (2 and 3 supported) • Program calls their URL & prints results in JSON format. • Added metadata in YAML format (for auto documentation) • Saved code to a source code repository on bitbucket. 3. SUBA deployed the adapter to the Araport platform • Used ‘curl’ to send Araport the URL of the source code repository. • Araport checks out the code, compiles it, containerizes it, deploys it. • Araport generates interactive documentation using Swagger. http://suba.plantenergy.uwa.edu.au/suba-app… https://bitbucket.org/athaliana/suba-araport $ curl –kL -X POST –H ”$BEARER_TOKEN” –F "git_repository=https://bitbucket.org/athaliana/suba-araport” https://api.araport.org/community/v0.3
  • 34. Summary • The Araport project – Provides evidence-based annotation for Col-0 – Performs extensive baseline data integration – Supports the Arabidopsis research community • The Araport platform – Enables and hosts modules from community contributors • Members gain visibility, accessibility, discoverability • Members benefit from documentation, tech support • Community Modules can come in multiple forms – Visualization Science Apps using JavaScript libraries – Pure data interchange as REST-like Web Services – Computation Science Apps (analysis code + support for running it) – JBrowse tracks as RESTful web services • Technical improvements are ongoing all the time – Improved developer support and tooling – Federated search, ontology-based interoperation – User workspaces, drag & drop combinations 34
  • 35. Araport Developer Workshops 35 Deploying the Atted Science App Tutorial at AIP Developer Workshop, TACC, Nov 2014. The Atted Science App Tutorial is freely available on GitHub. Other training material at araport.org/devzone Sign up to get updates on 2016 workshop https://www.araport.org/contact
  • 37. Acknowledgements 37 J Craig Venter Institute • Chris Town (PI) • Jason Miller • Agnes Chan • Erik Ferlanti • Irina Belyaeva • Chia-Yi Cheng • Vivek Krishnakumar Alumni: Konstantinos Krampis, Svetlana Karamycheva, Maria Kim, Ben Rosen, Christopher Nelson, Seth Schobel University of Cambridge • Gos Micklem • Sergio Contrino Funding Agencies Texas Advanced Computing Center • Matt Vaughn • Josue Balandrano Coronel • Matt Hanlon • Rion Dooley • Joe Stubbs • Alex Rocha • John Gentle Alumni: Walter Moreira, Steve Mock
  • 38. 38
  • 39. Araport11 Genome Annotation Araport11 Protein Coding Genes UniProt Update NCBI Novel Models Maker Novel Models NCBI SRA RNA-seq PASA, Trinity, BLAST,… https://www.araport.org/data/araport11 TAIR10 Annotation 113 public RNAseq samples
  • 40. Araport11 Pre-release 3 (Dec 2015) • Available via ThaleMine, JBrowse, FTP, APIs Categories TAIR10 Araport11 Gene Loci Protein coding loci 27,416 27,667 Novel loci in Araport11 719 Gene loci with splice isoform 5,665 10,698 Transcripts Transcript isoforms 35,385 48,389 Transcripts altered in Araport11 CDS altered 1,191 UTR altered 24,185
  • 41. 41 To install a Science App: 📝 Fill out this form. 🕒 Wait a few minutes. 😃 Test the app. 📬 Notify the App Store. Araport provisions a virtual machine. Araport obtains the source code. Araport installs the program.
  • 42. 42 Web Services Also known as APIs: • Application Programmer Interfaces Computer programs that… • Run on a web server. • Use HTTP for communication. The query is a URL: • http://my.url/gene?AT2G46830 The response is a “web page”: • Format is JSON not HTML. • Simple to read, simple to parse.
  • 43. 43 Query by URL Response in JSON Araport online documetnation Web Service: SUBA
  • 44. 44 Query by URL Response in JSON Araport online documetnation Web Service: KEGG
  • 45. 45 Science Apps Computer programs that… • Hosted on a web server. • Run in the browser. • Written in JavaScript. Obtains data by… • web services! Useful for… • Interactive science. • Cool visualizations.
  • 46. 46 Web Service Science App Computer programs that… • Hosted on a web server. • Run in the browser. • Written in JavaScript. Obtains data by… • web services! Useful for… • Interactive science. • Cool visualizations. Computer programs that… • Run on a web server. • Use HTTP for communication. The query is a URL: • http://my.url/gene?AT2G46830 The response is a “web page”: • Format is JSON not HTML. • Simple to read, simple to parse.
  • 47. 47 JCVI Expression Profile web service (left) and science app (right). Erik Ferlanti, JCVI senior software engineer. Web Service Science App
  • 48. 48 KEGG Pathways web service (left) and science app (right). Brian Liu, intern at JCVI. Web Service Science App
  • 49. 49 PhosPhAt Phosphorylation web service (left) and science app (right). Ismail Liban, intern at JCVI. Web Service Science App
  • 50. Araport Developer Workshops 50 Deploying the Atted Science App Tutorial at AIP Developer Workshop, TACC, Nov 2014. The Atted Science App Tutorial is available as open source on GitHub. Next workshop: Winter 2015
  • 51. Acknowledgements J Craig Venter Institute • Chris Town • Jason Miller • Agnes Chan • Maria Kim • Erik Ferlanti • Seth Schobel • Irina Belyaeva • Chia-Yi Cheng • Vivek Krishnakumar Former members • Ben Rosen • Christopher Nelson • Konstantinos Krampis • Svetlana Karamycheva University of Cambridge • Gos Micklem • Sergio Contrino Texas Advanced Computing Center • Matt Vaughn • Steve Mock • Rion Dooley • Matt Hanlon • Joe Stubbs • Walter Moreira • Chris Jordan 51 Funding Agencies Data Sources
  • 52. Araport User Workspaces • Status – Prototype available in 2015 • Grid layout (user adds rows or columns) • User adds Science Apps to grid (app isolation is goal) – Coming soon • Drag and drop • Communications bus (blast app sends results to viz app) – Coming later • Automatic discovery (blast app finds my aligment app) • Shared workspaces 52
  • 53. Infrastructure Challenges 1 • Federated search – Prototype: Single search returns results from • The Araport content management system (Drupal) • The Araport data warehouse (ThaleMine, Lucene) • The Araport genome browser (JBrowse metadata) – Goal during development: • Extend search to 3rd party indexes (NCBI, EBI, etc) • Develop web services APIs for distributed indexes • Implement rapid response distributed search • Automatic discovery – Araport components to discover each other at run time – User sees available options based on current results 53
  • 54. Infrastructure Challenges 2 • Interoperable web services – Prototype: support controlled vocabularies • Sequence Ontology (SO) for data organization & display • Gene Ontology (GO) associations for display and search • Science apps integrate diverse web services – Goals for development: • Ontologies for phenotype, reactome, metabolome • Community-driven adoption of controlled vocabularies • Web service integration with snap-in easy • Redundant web services – Establish web services equivalence classes – Automatic fail over should primary provider fail 54
  • 55. Infrastructure Challenges 3 • Provenance – Prototype: • Submitters provide web services metadata for auto display • Submitters may provide an “About” page on Science Apps • Web services logs show number of users, number of hits – Goals for development: • Automatic and provenance display on every submission • Monthly reporting to contributors (e.g. Google Analytics) • Standards compliance e.g. W3C PROV spec • Community adoption 55
  • 56. Server Browser JSCSS DBDB HTML <form> CGI HTML <table> URL HTML WebServices JavaScript <table> URL HTML3 CSS3 HTML5 Server Browser Traditional Active server, static client. Submit one form, display one result. Server provides data and its format. Modern Active client, dynamic pages. Continual client/server interaction. Server provides data, client formats it. Web Design for Dynamic Pages HTTP HTTP 56
  • 57. External programsPortal programs (www.araport.org) API (api.araport.org) Agave Core keep metadata enroll users ADAMA format data enroll services a b c d e f CGI Computing Storage Databases ThaleMine JBrowse Authentication, metering, logging, versioning, security. a b c d e f Apps Jobs Systems CGI InterMines Others Tripal SOAP CGI REST Science Apps Requisite Architectural Diagram 57
  • 58. Abstract • The Araport platform for scalable information exchange in genomics. • The Arabidopsis Information Portal (Araport) is a web resource for genome science. Araport is a new and free service centered on Arabidopsis thaliana, the plant whose genome sequence serves as a model for all of plant biology. Araport integrates data from major sources including NCBI, UniProt, PubMed, TAIR, BAR, EPIC CoGe, IntAct, Atted II, KEGG, and the 1001 Genomes Project. Araport also exposes its own “Araport11” update to the organism’s structural and functional gene annotation. Araport was conceived as a new kind of model organism database, one that could keep pace with ever-growing data sets while not burdening funding agencies with an ever- growing data warehouse. Araport is a platform for data sharing, data integration, and data federation. Araport provides means for scientists in the community to develop and deploy web services that expose data residing elsewhere on the internet. Araport provides means for scientists to develop and deploy “Science Apps” that can perform computational analysis and visualization of distributed data. Araport already hosts over 20 Science Apps and almost 100 web services linked to a dozen data sources. Currently shifting from prototype to development mode, Araport provides a model for sustainable growth of model organism community resources. Primarily an information science project, Araport takes on scalability challenges related to real-time integration of distributed services, interoperability between diverse services, indexing for federated search, reliability & responsiveness, security & logging, open-source development for software portability, and usability through automated documentation. 58

Editor's Notes

  1. 4:22