SlideShare a Scribd company logo
1 of 21
A Method to Select
e-Infrastructure Components
to Sustain
Daniel S. Katz
dkatz@nsf.gov & d.katz@ieee.org
Program Director, Division of
Advanced Cyberinfrastructure,
National Science Foundation
&
David Proctor
djproctor@gmail.com
International Consortium of
Research Staff Associations
Reusable Infrastructure
• Systems and components created by one or
more people and intended to be used by others
• Over the last century, research infrastructure
(from microscopes to telescopes, and from
sequencers to colliders) has become essential
for many types of research
• Over the past few decades, most interfaces to
research infrastructure, and often the
infrastructure itself, have become digital
e-Infrastructure Defined
• e-Infrastructure, or cyberinfrastructure has
been defined by Craig Stewart as consisting of
“computing systems, data storage systems,
advanced instruments and data repositories,
visualization environments, and people, all
linked together by software and high
performance networks to improve research
productivity and enable breakthroughs not
otherwise possible”
• Can discuss both e-Infrastructure itself and
e-Infrastructure elements
e-Infrastructure Element Contexts
• Technical
– Architecture: How does it fit into the overall
infrastructure? How does it interact with other
infrastructure elements?
• Social
– Developers: Who has developed the element?
– Users: Who uses the element?
– Purpose: What is the intended use of the element?
• Political
– Funders: Who funds the development and
maintenance?
– Scope: Is the element national? International?
– Impact: How is the element valued by researchers
and funders
e-Infrastructure Element
• What resources are needed to create it,
and how can those resources be
assembled and applied?
• What resources are needed to sustain it,
and how can those resources be
assembled and applied?
e-Infrastructure Element
• What resources are needed to create it,
and how can those resources be
assembled and applied?
• What resources are needed to sustain it,
and how can those resources be
assembled and applied?
• Focus on 2nd half of questions here, since
amount and type of needed resources
vary widely by element
Infrastructure Challenges (1)
• Science
– Larger teams, more disciplines, more countries
• Data
– Size, complexity, rates all increasing rapidly
– Need for interoperability (systems and policies)
• Systems
– More cores, more architectures (GPUs), more memory
hierarchy
– Changing balances (latency versus bandwidth)
– Changing limits (power, funds)
– System architecture and business models changing
(clouds)
– Network capacity growing; increased networks ->
increased security
Infrastructure Challenges (2)
• Software
– Multiphysics algorithms, frameworks
– Programing models and abstractions for science,
data, and hardware
– Validation and verification, reproducibility
– Resilience & fault tolerance
• People
– Education and training
– Career paths
– Credit and attribution
e-Infrastructure Space
Temporal DurationSpatialExtent
Temporal Duration
Temporal DurationSpatialExtent
• Decisions that are made
about creating and
sustaining infrastructure
elements need to include
awareness of the expected
lifetime of the element.
• Rough estimates:
– Computer systems: 5 years
– Networks & instruments: 10
years
– Software: 20 years
– People: 40 years
– Data (& publications):
forever (or until they are no
longer useful, whichever
comes first)
Spatial Extent
Temporal DurationSpatialExtent
• In academia, could range:
– Lab
– Department
– College or school
– University
– University system or
regional alliance
– Nation
– Beyond
• General research institutions,
e.g. national labs, might have
similar type units with
different names, e.g. ‘Division’
rather than ‘Department’
Purpose
Temporal DurationSpatialExtent
• Ranges from:
– Used for one problem
• Maybe not infrastructure
– Used for variety of problems
in a discipline (e.g., climate
data from Arctic ice cores)
– Used for variety of problems
across many disciplines
(e.g., molecular dynamics
software)
– Used across all disciplines
(e.g., network, HPC system)
• Linked to temporal duration
– Lifetime of software element
may be 20 years, if the
element isn’t useful, its
lifetime will be shortened
Scale
Temporal DurationSpatialExtent
• Number of users (and
cost) should be larger
the farther the
element is from the
origin in any direction
• Generically called
‘scale’
• Scale is a metric of
the space, though not
orthogonal to any of
the three axes
Defining Sustainability
• [Environment Change]: Response
• [Dependent Infrastructure] When infrastructure on which the
element relies change, will element continue to provide the
same functionality?
• [Collaborative Infrastructure] As both collaborative elements
and the element changes, can the element still be
combined with other elements to meet user needs?
• [New Users] Are functionality and usability of the element
clearly explained to new users? Do users have a way to ask
questions and learn about the element?
• [Existing Users] Does the element provide the functionality
that current users want? As future needs develop, is the
element modular and adaptable so that it can meet them?
• [Science] As new science and theory develop, will the
element incorporate and implement them?
WSSPE2 Sustainability Discussion
Enablers
********** healthy and vibrant communities; vibrant community to champion
software
********** designing for growth and extension - open development
******* culture in community for reuse
**** portability
**** culture in developer community to support transition between
developers
*** interdisciplinary people: science + IT experience
** planning for end of life
** make smart choices about dependencies
* thinking of software as product lines - long term vs short term view
not all communities need new software
converting use into resources
WSSPE2 Sustainability Discussion
Barriers
******* lack of incentives, including promotion and tenure process; promotion
and tenure process in academic is incompatible with sustainability
***** absent or poor documentation
***** funding to ensure sustainability is difficult to obtain
**** developers are not computer scientists; don't have software
engineering practices (in particular, those needed to scale-up projects
to support and be developed by a large sustainable community)
*** overreliance on one or two people - bus test
** rate of change of underlying technologies
** lack of business models for sustainability
* lack of training for how to build sustainability into the system
maintenance needed for software is not visible, appears to ``just
happen’’
licensing issues
staff turnover - lack of continuity
Sustainability Models
• Open source
• Closed partnership
• For profit
• Dual licensing
• Open source and paid support
• Foundation or government
Governance
• Governance tells the community how the project makes
decisions and how they can be involved.
• Community = users and/or developers and/or advocates
...
• Important particularly when open source is involved, to a
lesser extent in the other models
• Examples of open source governance models
– Benevolent dictatorship, as in Linux kernel
– Meritocracy, as in Apache Foundation projects
• Can be considered top-down and bottom-up governance,
respectively
• Note: orthogonal to top-down (cathedral) and bottom-up
(bazaar) development
Which models work where?
• Research is needed to understand and correlate
success or failure of models with different
portions of infrastructure element space
• Some sample questions:
– Do the cathedral or bazaar governance model
correlate with successful projects along any or all
axes?
• For example, perhaps one works better at small scale,
and the other works better at large scale
– Do particular resource assembly and application
models cluster along any or all axes?
• Temporal duration: government funding at large values
of; mix of models at middle values; closed partnerships
at small values?
Conclusions
• Aim: encourage thought about how
e-Infrastructure elements and e-Infrastructure
itself should be considered
• In terms of how different elements may have
commonalities and differences across types of
element, user communities, etc.
• Want to begin a discussion about these issues
• Potential questions:
– Are the axes meaningful?
– What factors correlate with them?
– Are there any clusters?
• We are eager to receive feedback
Reference
Katz, D.S. and Proctor, D.
“A Framework for Discussing e-
Research Infrastructure Sustainability”
Journal of Open Research Software
2(1):e13, 2014.
DOI: 10.5334/jors.av

More Related Content

What's hot

Xsede for-nlhpc
Xsede for-nlhpcXsede for-nlhpc
Xsede for-nlhpcJohn Towns
 
Introduction to Systems Engineering
Introduction to Systems EngineeringIntroduction to Systems Engineering
Introduction to Systems EngineeringAnimesh Chaturvedi
 
Infrastructure, relationships, trust, and RDA
Infrastructure, relationships, trust, and RDAInfrastructure, relationships, trust, and RDA
Infrastructure, relationships, trust, and RDAResearch Data Alliance
 
Embracing Social Software And Semantic Web In Digital Libraries
Embracing Social Software And Semantic Web In Digital LibrariesEmbracing Social Software And Semantic Web In Digital Libraries
Embracing Social Software And Semantic Web In Digital LibrariesAkhmad Riza Faizal
 
Reality Mining
Reality MiningReality Mining
Reality MiningCI&T
 
Social Machines of Science and Scholarship
Social Machines of Science and ScholarshipSocial Machines of Science and Scholarship
Social Machines of Science and ScholarshipDavid De Roure
 
Infrastructure As Afterthought
Infrastructure As AfterthoughtInfrastructure As Afterthought
Infrastructure As AfterthoughtMaxKemman
 
Making our mark: the important role of social scientists in the ‘era of big d...
Making our mark: the important role of social scientists in the ‘era of big d...Making our mark: the important role of social scientists in the ‘era of big d...
Making our mark: the important role of social scientists in the ‘era of big d...The Higher Education Academy
 
A Semantics-based Approach to Machine Perception
A Semantics-based Approach to Machine PerceptionA Semantics-based Approach to Machine Perception
A Semantics-based Approach to Machine PerceptionCory Andrew Henson
 
ELIXIR . Technical Coordinator
ELIXIR. Technical CoordinatorELIXIR. Technical Coordinator
ELIXIR . Technical CoordinatorRafael C. Jimenez
 
Data Processing and Semantics for Advanced Internet of Things (IoT) Applicati...
Data Processing and Semantics for Advanced Internet of Things (IoT) Applicati...Data Processing and Semantics for Advanced Internet of Things (IoT) Applicati...
Data Processing and Semantics for Advanced Internet of Things (IoT) Applicati...Artificial Intelligence Institute at UofSC
 
What's up at Kno.e.sis?
What's up at Kno.e.sis? What's up at Kno.e.sis?
What's up at Kno.e.sis? Amit Sheth
 
Resume sima das
Resume sima dasResume sima das
Resume sima dasSima-Das
 
Bridging Gaps and Broadening Participation in Today's and Future Research Com...
Bridging Gaps and Broadening Participation inToday's and Future Research Com...Bridging Gaps and Broadening Participation inToday's and Future Research Com...
Bridging Gaps and Broadening Participation in Today's and Future Research Com...Sandra Gesing
 
SPARC Repositories conference in Baltimore - Nov 2010
SPARC Repositories conference in Baltimore - Nov 2010SPARC Repositories conference in Baltimore - Nov 2010
SPARC Repositories conference in Baltimore - Nov 2010Jisc
 
Reality Mining (Nathan Eagle)
Reality Mining (Nathan Eagle)Reality Mining (Nathan Eagle)
Reality Mining (Nathan Eagle)Jan Sifra
 
BYU CS Colloquium Presentation
BYU CS Colloquium PresentationBYU CS Colloquium Presentation
BYU CS Colloquium PresentationDerek Hansen
 
The importance of FAIR and the Community of Data Driven Insights - the road t...
The importance of FAIR and the Community of Data Driven Insights - the road t...The importance of FAIR and the Community of Data Driven Insights - the road t...
The importance of FAIR and the Community of Data Driven Insights - the road t...Carlos Utrilla Guerrero
 

What's hot (20)

Xsede for-nlhpc
Xsede for-nlhpcXsede for-nlhpc
Xsede for-nlhpc
 
Introduction to Systems Engineering
Introduction to Systems EngineeringIntroduction to Systems Engineering
Introduction to Systems Engineering
 
Network Science: Theory, Modeling and Applications
Network Science: Theory, Modeling and ApplicationsNetwork Science: Theory, Modeling and Applications
Network Science: Theory, Modeling and Applications
 
Infrastructure, relationships, trust, and RDA
Infrastructure, relationships, trust, and RDAInfrastructure, relationships, trust, and RDA
Infrastructure, relationships, trust, and RDA
 
Embracing Social Software And Semantic Web In Digital Libraries
Embracing Social Software And Semantic Web In Digital LibrariesEmbracing Social Software And Semantic Web In Digital Libraries
Embracing Social Software And Semantic Web In Digital Libraries
 
Reality Mining
Reality MiningReality Mining
Reality Mining
 
Sgci nsf-si2-2-21-17
Sgci nsf-si2-2-21-17Sgci nsf-si2-2-21-17
Sgci nsf-si2-2-21-17
 
Social Machines of Science and Scholarship
Social Machines of Science and ScholarshipSocial Machines of Science and Scholarship
Social Machines of Science and Scholarship
 
Infrastructure As Afterthought
Infrastructure As AfterthoughtInfrastructure As Afterthought
Infrastructure As Afterthought
 
Making our mark: the important role of social scientists in the ‘era of big d...
Making our mark: the important role of social scientists in the ‘era of big d...Making our mark: the important role of social scientists in the ‘era of big d...
Making our mark: the important role of social scientists in the ‘era of big d...
 
A Semantics-based Approach to Machine Perception
A Semantics-based Approach to Machine PerceptionA Semantics-based Approach to Machine Perception
A Semantics-based Approach to Machine Perception
 
ELIXIR . Technical Coordinator
ELIXIR. Technical CoordinatorELIXIR. Technical Coordinator
ELIXIR . Technical Coordinator
 
Data Processing and Semantics for Advanced Internet of Things (IoT) Applicati...
Data Processing and Semantics for Advanced Internet of Things (IoT) Applicati...Data Processing and Semantics for Advanced Internet of Things (IoT) Applicati...
Data Processing and Semantics for Advanced Internet of Things (IoT) Applicati...
 
What's up at Kno.e.sis?
What's up at Kno.e.sis? What's up at Kno.e.sis?
What's up at Kno.e.sis?
 
Resume sima das
Resume sima dasResume sima das
Resume sima das
 
Bridging Gaps and Broadening Participation in Today's and Future Research Com...
Bridging Gaps and Broadening Participation inToday's and Future Research Com...Bridging Gaps and Broadening Participation inToday's and Future Research Com...
Bridging Gaps and Broadening Participation in Today's and Future Research Com...
 
SPARC Repositories conference in Baltimore - Nov 2010
SPARC Repositories conference in Baltimore - Nov 2010SPARC Repositories conference in Baltimore - Nov 2010
SPARC Repositories conference in Baltimore - Nov 2010
 
Reality Mining (Nathan Eagle)
Reality Mining (Nathan Eagle)Reality Mining (Nathan Eagle)
Reality Mining (Nathan Eagle)
 
BYU CS Colloquium Presentation
BYU CS Colloquium PresentationBYU CS Colloquium Presentation
BYU CS Colloquium Presentation
 
The importance of FAIR and the Community of Data Driven Insights - the road t...
The importance of FAIR and the Community of Data Driven Insights - the road t...The importance of FAIR and the Community of Data Driven Insights - the road t...
The importance of FAIR and the Community of Data Driven Insights - the road t...
 

Similar to Select e-Infrastructure Components for Sustainability

Open Source and Science at the National Science Foundation (NSF)
Open Source and Science at the National Science Foundation (NSF)Open Source and Science at the National Science Foundation (NSF)
Open Source and Science at the National Science Foundation (NSF)Daniel S. Katz
 
Scientific Software Challenges and Community Responses
Scientific Software Challenges and Community ResponsesScientific Software Challenges and Community Responses
Scientific Software Challenges and Community ResponsesDaniel S. Katz
 
Overview of XSEDE Systems Engineering
Overview of XSEDE Systems EngineeringOverview of XSEDE Systems Engineering
Overview of XSEDE Systems EngineeringJohn Towns
 
Asteroid Observations - Real Time Operational Intelligence Series
Asteroid Observations - Real Time Operational Intelligence SeriesAsteroid Observations - Real Time Operational Intelligence Series
Asteroid Observations - Real Time Operational Intelligence SeriesStormBourne, LLC
 
NISI Agile Software Architecture Slide Deck
NISI Agile Software Architecture Slide DeckNISI Agile Software Architecture Slide Deck
NISI Agile Software Architecture Slide DeckUtrecht University
 
XSEDE Overview (March 2014)
XSEDE Overview (March 2014)XSEDE Overview (March 2014)
XSEDE Overview (March 2014)John Towns
 
The Social Semantic Server: A Flexible Framework to Support Informal Learning...
The Social Semantic Server: A Flexible Framework to Support Informal Learning...The Social Semantic Server: A Flexible Framework to Support Informal Learning...
The Social Semantic Server: A Flexible Framework to Support Informal Learning...tobold
 
The Social Semantic Server - A Flexible Framework to Support Informal Learnin...
The Social Semantic Server - A Flexible Framework to Support Informal Learnin...The Social Semantic Server - A Flexible Framework to Support Informal Learnin...
The Social Semantic Server - A Flexible Framework to Support Informal Learnin...Sebastian Dennerlein
 
ORION Workshop: XSEDE and Building a National/International Cyberinfrastructure
ORION Workshop: XSEDE and Building a National/International CyberinfrastructureORION Workshop: XSEDE and Building a National/International Cyberinfrastructure
ORION Workshop: XSEDE and Building a National/International CyberinfrastructureJohn Towns
 
Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...
Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...
Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...SEAD
 
XSEDE and National Cyberinfrastructure
XSEDE and National CyberinfrastructureXSEDE and National Cyberinfrastructure
XSEDE and National CyberinfrastructureJohn Towns
 
Cultivating Sustainable Software For Research
Cultivating Sustainable Software For ResearchCultivating Sustainable Software For Research
Cultivating Sustainable Software For ResearchNeil Chue Hong
 
Software and Education at NSF/ACI
Software and Education at NSF/ACISoftware and Education at NSF/ACI
Software and Education at NSF/ACIDaniel S. Katz
 
cloudcomputingdistributedcomputing-171208050503 (1).pdf
cloudcomputingdistributedcomputing-171208050503 (1).pdfcloudcomputingdistributedcomputing-171208050503 (1).pdf
cloudcomputingdistributedcomputing-171208050503 (1).pdfArchanaPandiyan
 

Similar to Select e-Infrastructure Components for Sustainability (20)

Open Source and Science at the National Science Foundation (NSF)
Open Source and Science at the National Science Foundation (NSF)Open Source and Science at the National Science Foundation (NSF)
Open Source and Science at the National Science Foundation (NSF)
 
Scientific Software Challenges and Community Responses
Scientific Software Challenges and Community ResponsesScientific Software Challenges and Community Responses
Scientific Software Challenges and Community Responses
 
Overview of XSEDE Systems Engineering
Overview of XSEDE Systems EngineeringOverview of XSEDE Systems Engineering
Overview of XSEDE Systems Engineering
 
Asteroid Observations - Real Time Operational Intelligence Series
Asteroid Observations - Real Time Operational Intelligence SeriesAsteroid Observations - Real Time Operational Intelligence Series
Asteroid Observations - Real Time Operational Intelligence Series
 
Data-X-v3.1
Data-X-v3.1Data-X-v3.1
Data-X-v3.1
 
NISI Agile Software Architecture Slide Deck
NISI Agile Software Architecture Slide DeckNISI Agile Software Architecture Slide Deck
NISI Agile Software Architecture Slide Deck
 
XSEDE Overview (March 2014)
XSEDE Overview (March 2014)XSEDE Overview (March 2014)
XSEDE Overview (March 2014)
 
The Social Semantic Server: A Flexible Framework to Support Informal Learning...
The Social Semantic Server: A Flexible Framework to Support Informal Learning...The Social Semantic Server: A Flexible Framework to Support Informal Learning...
The Social Semantic Server: A Flexible Framework to Support Informal Learning...
 
The Social Semantic Server - A Flexible Framework to Support Informal Learnin...
The Social Semantic Server - A Flexible Framework to Support Informal Learnin...The Social Semantic Server - A Flexible Framework to Support Informal Learnin...
The Social Semantic Server - A Flexible Framework to Support Informal Learnin...
 
Data-X-Sparse-v2
Data-X-Sparse-v2Data-X-Sparse-v2
Data-X-Sparse-v2
 
ORION Workshop: XSEDE and Building a National/International Cyberinfrastructure
ORION Workshop: XSEDE and Building a National/International CyberinfrastructureORION Workshop: XSEDE and Building a National/International Cyberinfrastructure
ORION Workshop: XSEDE and Building a National/International Cyberinfrastructure
 
Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...
Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...
Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...
 
XSEDE and National Cyberinfrastructure
XSEDE and National CyberinfrastructureXSEDE and National Cyberinfrastructure
XSEDE and National Cyberinfrastructure
 
Cultivating Sustainable Software For Research
Cultivating Sustainable Software For ResearchCultivating Sustainable Software For Research
Cultivating Sustainable Software For Research
 
Software and Education at NSF/ACI
Software and Education at NSF/ACISoftware and Education at NSF/ACI
Software and Education at NSF/ACI
 
A Methodology for Building the Internet of Things
A Methodology for Building the Internet of ThingsA Methodology for Building the Internet of Things
A Methodology for Building the Internet of Things
 
cloudcomputingdistributedcomputing-171208050503 (1).pdf
cloudcomputingdistributedcomputing-171208050503 (1).pdfcloudcomputingdistributedcomputing-171208050503 (1).pdf
cloudcomputingdistributedcomputing-171208050503 (1).pdf
 
Cloud Computing & Distributed Computing
Cloud Computing & Distributed ComputingCloud Computing & Distributed Computing
Cloud Computing & Distributed Computing
 
Inti escem-tours2012-acs
Inti escem-tours2012-acsInti escem-tours2012-acs
Inti escem-tours2012-acs
 
Grid computing
Grid computingGrid computing
Grid computing
 

More from Daniel S. Katz

Research software susainability
Research software susainabilityResearch software susainability
Research software susainabilityDaniel S. Katz
 
Software Professionals (RSEs) at NCSA
Software Professionals (RSEs) at NCSASoftware Professionals (RSEs) at NCSA
Software Professionals (RSEs) at NCSADaniel S. Katz
 
Parsl: Pervasive Parallel Programming in Python
Parsl: Pervasive Parallel Programming in PythonParsl: Pervasive Parallel Programming in Python
Parsl: Pervasive Parallel Programming in PythonDaniel S. Katz
 
Requiring Publicly-Funded Software, Algorithms, and Workflows to be Made Publ...
Requiring Publicly-Funded Software, Algorithms, and Workflows to be Made Publ...Requiring Publicly-Funded Software, Algorithms, and Workflows to be Made Publ...
Requiring Publicly-Funded Software, Algorithms, and Workflows to be Made Publ...Daniel S. Katz
 
Citation and Research Objects: Toward Active Research Objects
Citation and Research Objects: Toward Active Research ObjectsCitation and Research Objects: Toward Active Research Objects
Citation and Research Objects: Toward Active Research ObjectsDaniel S. Katz
 
FAIR is not Fair Enough, Particularly for Software Citation, Availability, or...
FAIR is not Fair Enough, Particularly for Software Citation, Availability, or...FAIR is not Fair Enough, Particularly for Software Citation, Availability, or...
FAIR is not Fair Enough, Particularly for Software Citation, Availability, or...Daniel S. Katz
 
Fundamentals of software sustainability
Fundamentals of software sustainabilityFundamentals of software sustainability
Fundamentals of software sustainabilityDaniel S. Katz
 
Software Citation in Theory and Practice
Software Citation in Theory and PracticeSoftware Citation in Theory and Practice
Software Citation in Theory and PracticeDaniel S. Katz
 
Research Software Sustainability: WSSSPE & URSSI
Research Software Sustainability: WSSSPE & URSSIResearch Software Sustainability: WSSSPE & URSSI
Research Software Sustainability: WSSSPE & URSSIDaniel S. Katz
 
Expressing and sharing workflows
Expressing and sharing workflowsExpressing and sharing workflows
Expressing and sharing workflowsDaniel S. Katz
 
Citation and reproducibility in software
Citation and reproducibility in softwareCitation and reproducibility in software
Citation and reproducibility in softwareDaniel S. Katz
 
Software Citation: Principles, Implementation, and Impact
Software Citation:  Principles, Implementation, and ImpactSoftware Citation:  Principles, Implementation, and Impact
Software Citation: Principles, Implementation, and ImpactDaniel S. Katz
 
Summary of WSSSPE and its working groups
Summary of WSSSPE and its working groupsSummary of WSSSPE and its working groups
Summary of WSSSPE and its working groupsDaniel S. Katz
 
Working towards Sustainable Software for Science: Practice and Experience (WS...
Working towards Sustainable Software for Science: Practice and Experience (WS...Working towards Sustainable Software for Science: Practice and Experience (WS...
Working towards Sustainable Software for Science: Practice and Experience (WS...Daniel S. Katz
 
20160607 citation4software panel
20160607 citation4software panel20160607 citation4software panel
20160607 citation4software panelDaniel S. Katz
 
20160607 citation4software opening
20160607 citation4software opening20160607 citation4software opening
20160607 citation4software openingDaniel S. Katz
 
What do we need beyond a DOI?
What do we need beyond a DOI?What do we need beyond a DOI?
What do we need beyond a DOI?Daniel S. Katz
 
Looking at Software Sustainability and Productivity Challenges from NSF
Looking at Software Sustainability and Productivity Challenges from NSFLooking at Software Sustainability and Productivity Challenges from NSF
Looking at Software Sustainability and Productivity Challenges from NSFDaniel S. Katz
 

More from Daniel S. Katz (20)

Research software susainability
Research software susainabilityResearch software susainability
Research software susainability
 
Software Professionals (RSEs) at NCSA
Software Professionals (RSEs) at NCSASoftware Professionals (RSEs) at NCSA
Software Professionals (RSEs) at NCSA
 
Parsl: Pervasive Parallel Programming in Python
Parsl: Pervasive Parallel Programming in PythonParsl: Pervasive Parallel Programming in Python
Parsl: Pervasive Parallel Programming in Python
 
Requiring Publicly-Funded Software, Algorithms, and Workflows to be Made Publ...
Requiring Publicly-Funded Software, Algorithms, and Workflows to be Made Publ...Requiring Publicly-Funded Software, Algorithms, and Workflows to be Made Publ...
Requiring Publicly-Funded Software, Algorithms, and Workflows to be Made Publ...
 
Citation and Research Objects: Toward Active Research Objects
Citation and Research Objects: Toward Active Research ObjectsCitation and Research Objects: Toward Active Research Objects
Citation and Research Objects: Toward Active Research Objects
 
FAIR is not Fair Enough, Particularly for Software Citation, Availability, or...
FAIR is not Fair Enough, Particularly for Software Citation, Availability, or...FAIR is not Fair Enough, Particularly for Software Citation, Availability, or...
FAIR is not Fair Enough, Particularly for Software Citation, Availability, or...
 
Fundamentals of software sustainability
Fundamentals of software sustainabilityFundamentals of software sustainability
Fundamentals of software sustainability
 
Software Citation in Theory and Practice
Software Citation in Theory and PracticeSoftware Citation in Theory and Practice
Software Citation in Theory and Practice
 
URSSI
URSSIURSSI
URSSI
 
Research Software Sustainability: WSSSPE & URSSI
Research Software Sustainability: WSSSPE & URSSIResearch Software Sustainability: WSSSPE & URSSI
Research Software Sustainability: WSSSPE & URSSI
 
Software citation
Software citationSoftware citation
Software citation
 
Expressing and sharing workflows
Expressing and sharing workflowsExpressing and sharing workflows
Expressing and sharing workflows
 
Citation and reproducibility in software
Citation and reproducibility in softwareCitation and reproducibility in software
Citation and reproducibility in software
 
Software Citation: Principles, Implementation, and Impact
Software Citation:  Principles, Implementation, and ImpactSoftware Citation:  Principles, Implementation, and Impact
Software Citation: Principles, Implementation, and Impact
 
Summary of WSSSPE and its working groups
Summary of WSSSPE and its working groupsSummary of WSSSPE and its working groups
Summary of WSSSPE and its working groups
 
Working towards Sustainable Software for Science: Practice and Experience (WS...
Working towards Sustainable Software for Science: Practice and Experience (WS...Working towards Sustainable Software for Science: Practice and Experience (WS...
Working towards Sustainable Software for Science: Practice and Experience (WS...
 
20160607 citation4software panel
20160607 citation4software panel20160607 citation4software panel
20160607 citation4software panel
 
20160607 citation4software opening
20160607 citation4software opening20160607 citation4software opening
20160607 citation4software opening
 
What do we need beyond a DOI?
What do we need beyond a DOI?What do we need beyond a DOI?
What do we need beyond a DOI?
 
Looking at Software Sustainability and Productivity Challenges from NSF
Looking at Software Sustainability and Productivity Challenges from NSFLooking at Software Sustainability and Productivity Challenges from NSF
Looking at Software Sustainability and Productivity Challenges from NSF
 

Recently uploaded

Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 

Recently uploaded (20)

Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 

Select e-Infrastructure Components for Sustainability

  • 1. A Method to Select e-Infrastructure Components to Sustain Daniel S. Katz dkatz@nsf.gov & d.katz@ieee.org Program Director, Division of Advanced Cyberinfrastructure, National Science Foundation & David Proctor djproctor@gmail.com International Consortium of Research Staff Associations
  • 2. Reusable Infrastructure • Systems and components created by one or more people and intended to be used by others • Over the last century, research infrastructure (from microscopes to telescopes, and from sequencers to colliders) has become essential for many types of research • Over the past few decades, most interfaces to research infrastructure, and often the infrastructure itself, have become digital
  • 3. e-Infrastructure Defined • e-Infrastructure, or cyberinfrastructure has been defined by Craig Stewart as consisting of “computing systems, data storage systems, advanced instruments and data repositories, visualization environments, and people, all linked together by software and high performance networks to improve research productivity and enable breakthroughs not otherwise possible” • Can discuss both e-Infrastructure itself and e-Infrastructure elements
  • 4. e-Infrastructure Element Contexts • Technical – Architecture: How does it fit into the overall infrastructure? How does it interact with other infrastructure elements? • Social – Developers: Who has developed the element? – Users: Who uses the element? – Purpose: What is the intended use of the element? • Political – Funders: Who funds the development and maintenance? – Scope: Is the element national? International? – Impact: How is the element valued by researchers and funders
  • 5. e-Infrastructure Element • What resources are needed to create it, and how can those resources be assembled and applied? • What resources are needed to sustain it, and how can those resources be assembled and applied?
  • 6. e-Infrastructure Element • What resources are needed to create it, and how can those resources be assembled and applied? • What resources are needed to sustain it, and how can those resources be assembled and applied? • Focus on 2nd half of questions here, since amount and type of needed resources vary widely by element
  • 7. Infrastructure Challenges (1) • Science – Larger teams, more disciplines, more countries • Data – Size, complexity, rates all increasing rapidly – Need for interoperability (systems and policies) • Systems – More cores, more architectures (GPUs), more memory hierarchy – Changing balances (latency versus bandwidth) – Changing limits (power, funds) – System architecture and business models changing (clouds) – Network capacity growing; increased networks -> increased security
  • 8. Infrastructure Challenges (2) • Software – Multiphysics algorithms, frameworks – Programing models and abstractions for science, data, and hardware – Validation and verification, reproducibility – Resilience & fault tolerance • People – Education and training – Career paths – Credit and attribution
  • 10. Temporal Duration Temporal DurationSpatialExtent • Decisions that are made about creating and sustaining infrastructure elements need to include awareness of the expected lifetime of the element. • Rough estimates: – Computer systems: 5 years – Networks & instruments: 10 years – Software: 20 years – People: 40 years – Data (& publications): forever (or until they are no longer useful, whichever comes first)
  • 11. Spatial Extent Temporal DurationSpatialExtent • In academia, could range: – Lab – Department – College or school – University – University system or regional alliance – Nation – Beyond • General research institutions, e.g. national labs, might have similar type units with different names, e.g. ‘Division’ rather than ‘Department’
  • 12. Purpose Temporal DurationSpatialExtent • Ranges from: – Used for one problem • Maybe not infrastructure – Used for variety of problems in a discipline (e.g., climate data from Arctic ice cores) – Used for variety of problems across many disciplines (e.g., molecular dynamics software) – Used across all disciplines (e.g., network, HPC system) • Linked to temporal duration – Lifetime of software element may be 20 years, if the element isn’t useful, its lifetime will be shortened
  • 13. Scale Temporal DurationSpatialExtent • Number of users (and cost) should be larger the farther the element is from the origin in any direction • Generically called ‘scale’ • Scale is a metric of the space, though not orthogonal to any of the three axes
  • 14. Defining Sustainability • [Environment Change]: Response • [Dependent Infrastructure] When infrastructure on which the element relies change, will element continue to provide the same functionality? • [Collaborative Infrastructure] As both collaborative elements and the element changes, can the element still be combined with other elements to meet user needs? • [New Users] Are functionality and usability of the element clearly explained to new users? Do users have a way to ask questions and learn about the element? • [Existing Users] Does the element provide the functionality that current users want? As future needs develop, is the element modular and adaptable so that it can meet them? • [Science] As new science and theory develop, will the element incorporate and implement them?
  • 15. WSSPE2 Sustainability Discussion Enablers ********** healthy and vibrant communities; vibrant community to champion software ********** designing for growth and extension - open development ******* culture in community for reuse **** portability **** culture in developer community to support transition between developers *** interdisciplinary people: science + IT experience ** planning for end of life ** make smart choices about dependencies * thinking of software as product lines - long term vs short term view not all communities need new software converting use into resources
  • 16. WSSPE2 Sustainability Discussion Barriers ******* lack of incentives, including promotion and tenure process; promotion and tenure process in academic is incompatible with sustainability ***** absent or poor documentation ***** funding to ensure sustainability is difficult to obtain **** developers are not computer scientists; don't have software engineering practices (in particular, those needed to scale-up projects to support and be developed by a large sustainable community) *** overreliance on one or two people - bus test ** rate of change of underlying technologies ** lack of business models for sustainability * lack of training for how to build sustainability into the system maintenance needed for software is not visible, appears to ``just happen’’ licensing issues staff turnover - lack of continuity
  • 17. Sustainability Models • Open source • Closed partnership • For profit • Dual licensing • Open source and paid support • Foundation or government
  • 18. Governance • Governance tells the community how the project makes decisions and how they can be involved. • Community = users and/or developers and/or advocates ... • Important particularly when open source is involved, to a lesser extent in the other models • Examples of open source governance models – Benevolent dictatorship, as in Linux kernel – Meritocracy, as in Apache Foundation projects • Can be considered top-down and bottom-up governance, respectively • Note: orthogonal to top-down (cathedral) and bottom-up (bazaar) development
  • 19. Which models work where? • Research is needed to understand and correlate success or failure of models with different portions of infrastructure element space • Some sample questions: – Do the cathedral or bazaar governance model correlate with successful projects along any or all axes? • For example, perhaps one works better at small scale, and the other works better at large scale – Do particular resource assembly and application models cluster along any or all axes? • Temporal duration: government funding at large values of; mix of models at middle values; closed partnerships at small values?
  • 20. Conclusions • Aim: encourage thought about how e-Infrastructure elements and e-Infrastructure itself should be considered • In terms of how different elements may have commonalities and differences across types of element, user communities, etc. • Want to begin a discussion about these issues • Potential questions: – Are the axes meaningful? – What factors correlate with them? – Are there any clusters? • We are eager to receive feedback
  • 21. Reference Katz, D.S. and Proctor, D. “A Framework for Discussing e- Research Infrastructure Sustainability” Journal of Open Research Software 2(1):e13, 2014. DOI: 10.5334/jors.av

Editor's Notes

  1. Open source: a leader (or a set of leaders) promotes a goal of creating an infrastructure element in a public manner and a community voluntarily forms to work together on this goal. Closed partnership: a set of partners works together to create an infrastructure element, but the partnership is not open to external contributions. For profit: a group creates an infrastructure element using its own resources with the goal of later selling, leasing, or licensing the element or its design to recover the expended resource and make a profit. Dual licensing: a group creates an infrastructure element using its own resources with the goal of allowing academic free use (and depending on the license, perhaps gaining further free contributions from that academic community), while also selling, leasing, or licensing the element or its design to industry in order to recover the expended resource and perhaps make a profit, or at least, break even. This model also often has an implicit goal of not allowing other companies to financially profit directly from the element. Open source and paid support: a group supports an open source element in exchange for resources from the users of that support. The support can include helping the users with the existing element, and adding features to the element for the supported users, though these added features become available to all users, not just those who have paid for support. Foundation or government: one or more groups convince an organization that promotes public advancement that creating an infrastructure element will be a public good that should be supported.