SlideShare a Scribd company logo
1 of 29
Download to read offline
Developing
institutional RDM
services
Michael Day
Digital Curation Centre (DCC)
UKOLN, University of Bath
DCC Workshop, Cardiff University 14 May 2013
Session outline
 Managing active data
 Storage options
 Long-term retention of data
 Selection criteria
 Data repositories
 Finding and citing data
 Data registries and metadata
 Presentation based on: Sarah Jones, Graham Pryor and Angus Whyte, How to
Develop Research Data Management Services – a guide for HEIs (DCC, 2013):
 http://www.dcc.ac.uk/resources/how-guides/how-develop-rdm-services
 Some slides reused from RDMRose training materials:
 http://rdmrose.group.shef.ac.uk/
Managing active
data
Managing active data: key tasks
 Researchers:
 Have a duty to ensure that research data is stored securely and backed-up on a
regular basis
 Have choices (e.g. network drives, laptops, external storage devices, online /
cloud-based storage)
 Need to take data security seriously
 This should be considered as part of the data management planning process
 Institutions:
 Need to be constantly review data holdings and RDM practices in order to
evaluate whether current storage infrastructures are sufficient
 May need to make a case for investing in the provision of additional data storage
capability
 Need procedures for the allocation and management of storage
 Need to be flexible, taking account of a diverse range of research contexts and
data storage requirements
Research data storage
 Trend for some HEIs to enhance the capacity of
research data storage facilities
 Extending capacity of existing filestores (e.g. Bath)
 Exploring secure cloud storage
 Utilising High Performance Computing facilities
 Managing storage
 University of Bristol (data.bris) – registered researchers (data
stewards) are allocated 5TB storage to manage, e.g. deciding
how long data should be kept, who has access, etc.
 http://data.blogs.ilrt.org
Options for managing active data
 Cloud storage options
 There may be benefits in terms of costs and expertise
 There may also be risks (e.g. loss of control, jurisdictional
issues)
 Janet Brokerage - promoting the use of cloud and off-site data
centre facilities
 Academic dropbox-like services
 Dropbox is often used for sharing and synching data between
machines, but institutions are keen to retain control
 Systems developed in-house
 Typically developed with an disciplinary focus, e.g. BRISSkit
(biomedicine)
Selection for the long-
term retention of data
Selecting data for retention
 RCUK, Common Principles on Data Policy (2011):
 “Data with acknowledged long-term value should be preserved and remain
accessible and usable for future research”
 http://www.rcuk.ac.uk/research/Pages/DataPolicy.aspx
 Institutions will need to establish clear criteria to guide decisions on what
should be kept
 It will not be possible to retain everything
 Carefully considered selection processes are essential to help prioritise that data
that has long-term value
 Institutional selection processes will need to take account of:
 Data that institutions are legally obliged to retain (or destroy), e.g. for contractual
or regulatory reasons
 Different disciplinary practices (e.g., some disciplines will have mature data
sharing infrastructures and will already deposit data with third party services)
 Researcher sensitivities about losing control of data (deposit agreements)
Developing guidance on selection
 Establishing guidelines, processes and good
practice for data selection and deposit can be
one of the more challenging aspects of an RDM
service
 There is a need for buy-in from researchers
 There is a need for clarity on what kinds of data are
within the remit of an institutional RDM service
 There may be a need to apply different levels of
curation, e.g. depending on the perceived value of the
data accepted
DCC selection categories
 DCC How to Select and Appraise Research Data for
Curation (Whyte and Wilson, 2010) proposes seven
main criteria:
 Relevance to mission
 Scientific or historic value
 Uniqueness
 Potential for redistribution
 Non-replicability
 Economic case
 Full documentation
 http://www.dcc.ac.uk/resources/how-guides/appraise-
select-data
Data repositories
Data repositories
 Focusing on how data will be preserved and
made available for others
 Main options:
 Developing an institutional data repository
 Building, where possible, on existing systems, e.g. IR, CRIS,
etc.
 Essex Research Data demo: http://researchdata.essex.ac.uk/
 Liaising with external research data repositories (or data
centres)
 Often subject based, some UK data centres supported by
funding bodies
 Providing researchers with information on external services
Data catalogues
RCUK Common Principles
 RCUK, Common Principles on Data Policy (2011):
 “To enable research data to be discoverable and
effectively re-used by others, sufficient metadata should
be recorded and made openly available to enable other
researchers to understand the research and re-use
potential of the data. Published results should always
include information on how to access the supporting
data”
 Also EPSRC Principle 6
 http://www.rcuk.ac.uk/research/Pages/DataPolicy.aspx
EPSRC Expectation V
 “Research organisations will ensure that appropriately
structured metadata describing the research data they
hold is published (normally within 12 months of the data
being generated) and made freely accessible on the
internet; in each case the metadata must be sufficient to
allow others to understand what research data exists,
why, when and how it was generated, and how to
access it. Where the research data referred to in the
metadata is a digital object it is expected that the
metadata will include use of a robust digital object
identifier (For example as available through the
DataCite organisation - http://datacite.org).”
 http://www.epsrc.ac.uk/about/standards/researchdata/P
ages/expectations.aspx
May-13
Learning material produced by RDMRose
http://www.sheffield.ac.uk/is/research/projects/rdmros
e
Some questions to consider
 What metadata is required to adequately record
datasets? What is “sufficient metadata” for discovery and
re-use?
 Does any of this metadata already exist?
 If so, where might it be found?
 If not, how can the appropriate metadata be generated or
captured?
 Will there be a need to share this metadata, e.g. with
third-party discovery services? National data services?
 If so, what standards exist to support metadata sharing?
Examples: UKOLN Scoping Study
 Scientific Data Application Profile Scoping Study (UKOLN, 2009)
 Building on work undertaken on the Scholarly Works Application Profile
(SWAP)
 Analysed the metadata used by UK data centres and repositories,
selected domain models (e.g. DDI, CCLRC Metadata Model, CIDOC
CRM)
 Concluded that:
 Simple Dublin Core (e.g., as mandated by OAI-PMH) would be insufficient
 There was sufficient convergence between the different schemas to suggest
that a generic metadata profile could be constructed
 A generic metadata profile would benefit interdisciplinary research and
institution based services (e.g. IRs)
 http://www.ukoln.ac.uk/projects/sdapss/
Examples: DataCite metadata (1)
 DataCite:
Organisation aiming to facilitate easier access
to (and citation of) research data, e.g. through
the use of persistent identifiers (DOIs)
DataCite Metadata Schema (currently v. 2.2,
2011) defines core metadata properties
Broadly based on Dublin Core concepts
http://schema.datacite.org
Examples: DataCite metadata (2)
 Mandatory Properties:
 Identifier
 Creator
 Title
 Publisher
 PublicationYear
 Administrative Metadata
 LastMetadataUpdate
 MetadataVersionNumber
 Optional Properties:
 Subject
 Contributor
 Date
 Language
 ResourceType
 AlternateIdentifier
 RelatedIdentifier
 Size
 Format
 Version
 Rights
 Description
Examples: University of Oxford
 The DaMaRO project at the University of Oxford is developing
a metadata schema for its DataFinder (Rumsey, 2012).
 A three-tier metadata approach:
 Mandatory minimal metadata to enable basic discovery, such as
Creator, Title, Publisher, Date, Location, Access terms &
conditions
 Mandatory contextual metadata (mostly administrative and
partly based on EPSRC expectations), such as Funding Agency,
Grant Number, Last access request date, Project Information,
Data Generation Process, Why the data was generated, Date
(range) of data collection, Reasons for embargoes
 Optional metadata (including discipline-specific metadata) to
enable reuse, such as machine settings and the experimental
conditions under which the data were gathered
May-13
Learning material produced by RDMRose
http://www.sheffield.ac.uk/is/research/projects/rdmros
e
Examples: University of Essex
 RDE Metadata Profile for EPrints
 Based on DataCite, INSPIRE, DDI 2.1 and DataShare
 Mixture of generic schema and standards specific to
social science data
 http://data-
archive.ac.uk/media/375386/rde_eprints_metadatapr
ofile.pdf
 Seems to be convergence on layered approach
Some practical questions (1)
 Technical choices for institutions:
 Developing new institutional services, e.g. the
approach taken by ANDS:
http://www.ands.org.au/guides/metadata-stores-
solutions.html
 Defining metadata stores by their coverage, the granularity of
data that they describe, and the specialisation of their
descriptions (e.g. collection-level, object level, local,
institutional, national and discipline-specific)
 Building upon existing infrastructures, e.g.:
 Institutional Repositories
 CRIS (e.g. Pure, Symplectic, Converis)
Some practical questions (2)
 Research Information Management interaction?
 There is interest in what RIM standards like CERIF can offer RDM (e.g.
potentially richer metadata structures for linking research outputs with
organisational groupings and funding streams, some level of buy-in from
funding bodies), but implementation
 CERIF for Datasets (C4D): http://cerif4datasets.wordpress.com
 We need to think about how metadata can be shared with:
 Discipline-based repositories and data centres
 Emerging national (and international) discovery infrastructures
 Australian National Data Service
 Uses RIF-CS schema (based on ISO 2146:2010) as a data interchange format
 Jisc and DCC are currently exploring the options for collating metadata
about research data at national level
Data citation
Data Citation
 Issues include (Ball & Duke, 2011a and b):
 At what granularity should data be made citeable?
 How to credit each contributor in a dataset that is
assembled from very many contributions?
 Where in a research paper should a data citation be
given (e.g. a paper describing a dataset versus
subsequent papers using it)?
 What to do with frequently updated data?
May-13
Learning material produced by RDMRose
http://www.sheffield.ac.uk/is/research/projects/rdmros
e
DataCite
 DataCite (http://www.datacite.org) is a not-for-profit
organisation that aims to promote and support the
sharing of research data
 They are developing an infrastructure that supports
methods of data citation, discovery, and access
 They are currently leveraging the DOI (Digital Object
Identifier) infrastructure, which is also used for research
articles
 They can provide DOIs for datasets
 DataCite DOIs have to resolve to a public landing page
with information about the dataset and a direct link to it
May-13
Learning material produced by RDMRose
http://www.sheffield.ac.uk/is/research/projects/rdmros
e
DataCite
 Basic form:
 Creator (PublicationYear): Title. Publisher. Identifier
 Version and ResourceType are optional extra elements
 For citation purposes, DataCite recommends that DOI
names are displayed as linkable, permanent URLs
 More info in DataCite (2011)
 University of Poppleton (2011): Precipitation
measurements 1905-2010 taken at Western Bank
weather station. Meteorological service, The University
of Poppleton. http://dx.doi.org/10.1594/UoP.MS.298
May-13
Learning material produced by RDMRose
http://www.sheffield.ac.uk/is/research/projects/rdmros
e
References
 Ball, A., (2009). Scientific Data Application Profile Scoping Study Report. Bath:
UKOLN, University of Bath. Retrieved from: http://www.ukoln.ac.uk/projects/sdapss/
 Ball, A., & Duke, M. (2011a). Data Citation and Linking. DCC Briefing Papers.
Edinburgh: Digital Curation Centre. Retrieved from
http://www.dcc.ac.uk/resources/briefing-papers/introduction-curation/data-citation-
and-linking
 Ball, A., & Duke, M. (2011b). How to Cite Datasets and Link to Publications. DCC
How-To Guides. Edinburgh: Digital Curation Centre. Retrieved from
http://www.dcc.ac.uk/resources/how-guides/cite-datasets
 DataCite (2011). DataCite Metadata Schema for the Publication and Citation of
Research Data. Version 2.2. London: DataCite. Retrieved from
http://schema.datacite.org/meta/kernel-2.2/doc/DataCite-MetadataKernel_v2.2.pdf.
doi:10.5438/0005
 Rumsey, S. (2012). Just enough metadata: Metadata for research datasets in
institutional data repositories [PowerPoint presentation]. Oxford: The University of
Oxford. Retrieved from
http://damaro.oucs.ox.ac.uk/docs/Just%20enough%20metadata%20v3-1.pdf
May-13
Learning material produced by RDMRose
http://www.sheffield.ac.uk/is/research/projects/rdmros
e
Questions?

More Related Content

What's hot

Going for GOLD - Adventures in Open Linked Geospatial Metadata
Going for GOLD - Adventures in Open Linked Geospatial MetadataGoing for GOLD - Adventures in Open Linked Geospatial Metadata
Going for GOLD - Adventures in Open Linked Geospatial MetadataEDINA, University of Edinburgh
 
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShareResearch Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShareHistoric Environment Scotland
 
Developing Research Data Management Policy and Services
Developing Research Data Management Policy and ServicesDeveloping Research Data Management Policy and Services
Developing Research Data Management Policy and ServicesRobin Rice
 
Shibboleth Access Management Federations and Secure SDI: ESDIN Experience
Shibboleth Access Management Federations and Secure SDI: ESDIN Experience Shibboleth Access Management Federations and Secure SDI: ESDIN Experience
Shibboleth Access Management Federations and Secure SDI: ESDIN Experience EDINA, University of Edinburgh
 
Introduction to data support services and resources for public policy
Introduction to data support services and resources for public policyIntroduction to data support services and resources for public policy
Introduction to data support services and resources for public policyHistoric Environment Scotland
 
‘Good, better, best’? Examining the range and rationales of institutional dat...
‘Good, better, best’? Examining the range and rationales of institutional dat...‘Good, better, best’? Examining the range and rationales of institutional dat...
‘Good, better, best’? Examining the range and rationales of institutional dat...Robin Rice
 
SDI – National to Global: perspectives from the UK academic sector
SDI – National to Global: perspectives from the UK academic sector SDI – National to Global: perspectives from the UK academic sector
SDI – National to Global: perspectives from the UK academic sector EDINA, University of Edinburgh
 
Creating a Data Management Plan for your Grant Application
Creating a Data Management Plan for your Grant ApplicationCreating a Data Management Plan for your Grant Application
Creating a Data Management Plan for your Grant ApplicationHistoric Environment Scotland
 
Co-operation for digital preservation and curation: collaboration for collect...
Co-operation for digital preservation and curation: collaboration for collect...Co-operation for digital preservation and curation: collaboration for collect...
Co-operation for digital preservation and curation: collaboration for collect...Michael Day
 
Scottish Digital Library Consortium Meeting: Edinburgh DataShare
Scottish Digital Library Consortium Meeting: Edinburgh DataShareScottish Digital Library Consortium Meeting: Edinburgh DataShare
Scottish Digital Library Consortium Meeting: Edinburgh DataShareRobin Rice
 
Addressing Institutional Research Data Management - University of Edinburgh R...
Addressing Institutional Research Data Management - University of Edinburgh R...Addressing Institutional Research Data Management - University of Edinburgh R...
Addressing Institutional Research Data Management - University of Edinburgh R...EDINA, University of Edinburgh
 
Jisc unleashing data 5 minutes
Jisc unleashing data 5 minutesJisc unleashing data 5 minutes
Jisc unleashing data 5 minutesDaniela G. Duca
 
Research Data Management at Edinburgh: Effecting Culture Change
Research Data Management at Edinburgh: Effecting Culture ChangeResearch Data Management at Edinburgh: Effecting Culture Change
Research Data Management at Edinburgh: Effecting Culture ChangeHistoric Environment Scotland
 

What's hot (20)

Going for GOLD - Adventures in Open Linked Geospatial Metadata
Going for GOLD - Adventures in Open Linked Geospatial MetadataGoing for GOLD - Adventures in Open Linked Geospatial Metadata
Going for GOLD - Adventures in Open Linked Geospatial Metadata
 
Authentication Methods: Shibboleth
Authentication Methods: ShibbolethAuthentication Methods: Shibboleth
Authentication Methods: Shibboleth
 
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShareResearch Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
Research Data Services @ Edinburgh: MANTRA & Edinburgh DataShare
 
National Activities and the UK LOCKSS Alliance
National Activities and the UK LOCKSS AllianceNational Activities and the UK LOCKSS Alliance
National Activities and the UK LOCKSS Alliance
 
Developing Research Data Management Policy and Services
Developing Research Data Management Policy and ServicesDeveloping Research Data Management Policy and Services
Developing Research Data Management Policy and Services
 
Shibboleth Access Management Federations and Secure SDI: ESDIN Experience
Shibboleth Access Management Federations and Secure SDI: ESDIN Experience Shibboleth Access Management Federations and Secure SDI: ESDIN Experience
Shibboleth Access Management Federations and Secure SDI: ESDIN Experience
 
Introduction to data support services and resources for public policy
Introduction to data support services and resources for public policyIntroduction to data support services and resources for public policy
Introduction to data support services and resources for public policy
 
‘Good, better, best’? Examining the range and rationales of institutional dat...
‘Good, better, best’? Examining the range and rationales of institutional dat...‘Good, better, best’? Examining the range and rationales of institutional dat...
‘Good, better, best’? Examining the range and rationales of institutional dat...
 
SDI – National to Global: perspectives from the UK academic sector
SDI – National to Global: perspectives from the UK academic sector SDI – National to Global: perspectives from the UK academic sector
SDI – National to Global: perspectives from the UK academic sector
 
Creating a Data Management Plan for your Grant Application
Creating a Data Management Plan for your Grant ApplicationCreating a Data Management Plan for your Grant Application
Creating a Data Management Plan for your Grant Application
 
Co-operation for digital preservation and curation: collaboration for collect...
Co-operation for digital preservation and curation: collaboration for collect...Co-operation for digital preservation and curation: collaboration for collect...
Co-operation for digital preservation and curation: collaboration for collect...
 
RDM @ UoE
RDM @ UoERDM @ UoE
RDM @ UoE
 
Scottish Digital Library Consortium Meeting: Edinburgh DataShare
Scottish Digital Library Consortium Meeting: Edinburgh DataShareScottish Digital Library Consortium Meeting: Edinburgh DataShare
Scottish Digital Library Consortium Meeting: Edinburgh DataShare
 
OGC Interoperability Experiments and Authentication
OGC Interoperability Experiments and AuthenticationOGC Interoperability Experiments and Authentication
OGC Interoperability Experiments and Authentication
 
Addressing Institutional Research Data Management - University of Edinburgh R...
Addressing Institutional Research Data Management - University of Edinburgh R...Addressing Institutional Research Data Management - University of Edinburgh R...
Addressing Institutional Research Data Management - University of Edinburgh R...
 
Delivering Postgraduate Training - MANTRA
Delivering Postgraduate Training - MANTRADelivering Postgraduate Training - MANTRA
Delivering Postgraduate Training - MANTRA
 
Jisc unleashing data 5 minutes
Jisc unleashing data 5 minutesJisc unleashing data 5 minutes
Jisc unleashing data 5 minutes
 
Who is doing what, and how do we know? [PEPRS]
Who is doing what, and how do we know? [PEPRS]Who is doing what, and how do we know? [PEPRS]
Who is doing what, and how do we know? [PEPRS]
 
Research Data Management at Edinburgh: Effecting Culture Change
Research Data Management at Edinburgh: Effecting Culture ChangeResearch Data Management at Edinburgh: Effecting Culture Change
Research Data Management at Edinburgh: Effecting Culture Change
 
Open Spatial Data: Sources and Tools
Open Spatial Data: Sources and ToolsOpen Spatial Data: Sources and Tools
Open Spatial Data: Sources and Tools
 

Viewers also liked

What can libraries do for researchers?
What can libraries do for researchers?What can libraries do for researchers?
What can libraries do for researchers?Michael Day
 
Implementing digital preservation strategy: collection profiling at the Briti...
Implementing digital preservation strategy: collection profiling at the Briti...Implementing digital preservation strategy: collection profiling at the Briti...
Implementing digital preservation strategy: collection profiling at the Briti...Michael Day
 
DCMI Education Linked Data Session, DC-2009 Conference, Seoul Korea
DCMI Education Linked Data Session, DC-2009 Conference, Seoul KoreaDCMI Education Linked Data Session, DC-2009 Conference, Seoul Korea
DCMI Education Linked Data Session, DC-2009 Conference, Seoul KoreaSarah Currier
 
A Metadata Application Profile for KOS Vocabulary Registries (KOS-AP)
A Metadata Application Profile for KOS Vocabulary Registries (KOS-AP)A Metadata Application Profile for KOS Vocabulary Registries (KOS-AP)
A Metadata Application Profile for KOS Vocabulary Registries (KOS-AP)Marcia Zeng
 
Disciplinary and institutional perspectives on digital curation
Disciplinary and institutional perspectives on digital curationDisciplinary and institutional perspectives on digital curation
Disciplinary and institutional perspectives on digital curationMichael Day
 
DCMI Education Community Brief Update
DCMI Education Community Brief UpdateDCMI Education Community Brief Update
DCMI Education Community Brief UpdateSarah Currier
 
DCC 101: Preservation
DCC 101: PreservationDCC 101: Preservation
DCC 101: PreservationMichael Day
 
Data curation and preservation: the Digital Curation Centre
Data curation and preservation: the Digital Curation CentreData curation and preservation: the Digital Curation Centre
Data curation and preservation: the Digital Curation CentreMichael Day
 
Application Profiles for Subject Domains
Application Profiles for Subject DomainsApplication Profiles for Subject Domains
Application Profiles for Subject DomainsMarcia Zeng
 
Introduction to digital curation
Introduction to digital curationIntroduction to digital curation
Introduction to digital curationMichael Day
 

Viewers also liked (10)

What can libraries do for researchers?
What can libraries do for researchers?What can libraries do for researchers?
What can libraries do for researchers?
 
Implementing digital preservation strategy: collection profiling at the Briti...
Implementing digital preservation strategy: collection profiling at the Briti...Implementing digital preservation strategy: collection profiling at the Briti...
Implementing digital preservation strategy: collection profiling at the Briti...
 
DCMI Education Linked Data Session, DC-2009 Conference, Seoul Korea
DCMI Education Linked Data Session, DC-2009 Conference, Seoul KoreaDCMI Education Linked Data Session, DC-2009 Conference, Seoul Korea
DCMI Education Linked Data Session, DC-2009 Conference, Seoul Korea
 
A Metadata Application Profile for KOS Vocabulary Registries (KOS-AP)
A Metadata Application Profile for KOS Vocabulary Registries (KOS-AP)A Metadata Application Profile for KOS Vocabulary Registries (KOS-AP)
A Metadata Application Profile for KOS Vocabulary Registries (KOS-AP)
 
Disciplinary and institutional perspectives on digital curation
Disciplinary and institutional perspectives on digital curationDisciplinary and institutional perspectives on digital curation
Disciplinary and institutional perspectives on digital curation
 
DCMI Education Community Brief Update
DCMI Education Community Brief UpdateDCMI Education Community Brief Update
DCMI Education Community Brief Update
 
DCC 101: Preservation
DCC 101: PreservationDCC 101: Preservation
DCC 101: Preservation
 
Data curation and preservation: the Digital Curation Centre
Data curation and preservation: the Digital Curation CentreData curation and preservation: the Digital Curation Centre
Data curation and preservation: the Digital Curation Centre
 
Application Profiles for Subject Domains
Application Profiles for Subject DomainsApplication Profiles for Subject Domains
Application Profiles for Subject Domains
 
Introduction to digital curation
Introduction to digital curationIntroduction to digital curation
Introduction to digital curation
 

Similar to Developing Institutional Research Data Management Services

Dc101 oxford sj_16062010
Dc101 oxford sj_16062010Dc101 oxford sj_16062010
Dc101 oxford sj_16062010Sarah Jones
 
RDM for Librarians
RDM for LibrariansRDM for Librarians
RDM for LibrariansMarieke Guy
 
UK Digital Curation Centre: enabling research data management at the coalface
UK Digital Curation Centre: enabling research data management at the coalfaceUK Digital Curation Centre: enabling research data management at the coalface
UK Digital Curation Centre: enabling research data management at the coalfaceLizLyon
 
Disciplinary RDM
Disciplinary RDMDisciplinary RDM
Disciplinary RDMSarah Jones
 
Implementing Open Access: Effective Management of Your Research Data
Implementing Open Access: Effective Management of Your Research DataImplementing Open Access: Effective Management of Your Research Data
Implementing Open Access: Effective Management of Your Research DataMartin Hamilton
 
Digital Preservation Process: Preparation and Requirements
Digital Preservation Process: Preparation and RequirementsDigital Preservation Process: Preparation and Requirements
Digital Preservation Process: Preparation and RequirementsDigitalPreservationEurope
 
RDMRose 1.4 The research data lifecycle
RDMRose 1.4 The research data lifecycleRDMRose 1.4 The research data lifecycle
RDMRose 1.4 The research data lifecycleRDMRose
 
RDM LIASA webinar
RDM LIASA webinarRDM LIASA webinar
RDM LIASA webinarSarah Jones
 
Sarah Jones RDM from a disciplinary perspective
Sarah Jones RDM from a disciplinary perspectiveSarah Jones RDM from a disciplinary perspective
Sarah Jones RDM from a disciplinary perspectiveJisc
 
What infrastructure is necessary for successful research data management (RDM...
What infrastructure is necessary for successful research data management (RDM...What infrastructure is necessary for successful research data management (RDM...
What infrastructure is necessary for successful research data management (RDM...heila1
 
Making research data more resourceful - Jisc digital festival 2015
Making research data more resourceful - Jisc digital festival 2015Making research data more resourceful - Jisc digital festival 2015
Making research data more resourceful - Jisc digital festival 2015Jisc
 

Similar to Developing Institutional Research Data Management Services (20)

Dc101 oxford sj_16062010
Dc101 oxford sj_16062010Dc101 oxford sj_16062010
Dc101 oxford sj_16062010
 
RDM for Librarians
RDM for LibrariansRDM for Librarians
RDM for Librarians
 
DAF methodology
DAF methodologyDAF methodology
DAF methodology
 
Introduction to Research Data Management
Introduction to Research Data ManagementIntroduction to Research Data Management
Introduction to Research Data Management
 
Looking After Your Data: RDM @ Edinburgh
Looking After Your Data: RDM @ EdinburghLooking After Your Data: RDM @ Edinburgh
Looking After Your Data: RDM @ Edinburgh
 
Resources for Research Data Managers - 2014-05-28 - University of Oxford
Resources for Research Data Managers - 2014-05-28 - University of OxfordResources for Research Data Managers - 2014-05-28 - University of Oxford
Resources for Research Data Managers - 2014-05-28 - University of Oxford
 
UK Digital Curation Centre: enabling research data management at the coalface
UK Digital Curation Centre: enabling research data management at the coalfaceUK Digital Curation Centre: enabling research data management at the coalface
UK Digital Curation Centre: enabling research data management at the coalface
 
Introduction to RDM for Geoscience PhD Students
Introduction to RDM for Geoscience PhD StudentsIntroduction to RDM for Geoscience PhD Students
Introduction to RDM for Geoscience PhD Students
 
Disciplinary RDM
Disciplinary RDMDisciplinary RDM
Disciplinary RDM
 
RDM & ELNs @ Edinburgh
RDM & ELNs @ EdinburghRDM & ELNs @ Edinburgh
RDM & ELNs @ Edinburgh
 
User engagement in research data curation
User engagement in research data curationUser engagement in research data curation
User engagement in research data curation
 
Implementing Open Access: Effective Management of Your Research Data
Implementing Open Access: Effective Management of Your Research DataImplementing Open Access: Effective Management of Your Research Data
Implementing Open Access: Effective Management of Your Research Data
 
Digital Preservation Process: Preparation and Requirements
Digital Preservation Process: Preparation and RequirementsDigital Preservation Process: Preparation and Requirements
Digital Preservation Process: Preparation and Requirements
 
RDMRose 1.4 The research data lifecycle
RDMRose 1.4 The research data lifecycleRDMRose 1.4 The research data lifecycle
RDMRose 1.4 The research data lifecycle
 
RDM LIASA webinar
RDM LIASA webinarRDM LIASA webinar
RDM LIASA webinar
 
Sarah Jones RDM from a disciplinary perspective
Sarah Jones RDM from a disciplinary perspectiveSarah Jones RDM from a disciplinary perspective
Sarah Jones RDM from a disciplinary perspective
 
Johnston - How to Curate Research Data
Johnston - How to Curate Research DataJohnston - How to Curate Research Data
Johnston - How to Curate Research Data
 
What infrastructure is necessary for successful research data management (RDM...
What infrastructure is necessary for successful research data management (RDM...What infrastructure is necessary for successful research data management (RDM...
What infrastructure is necessary for successful research data management (RDM...
 
Making research data more resourceful - Jisc digital festival 2015
Making research data more resourceful - Jisc digital festival 2015Making research data more resourceful - Jisc digital festival 2015
Making research data more resourceful - Jisc digital festival 2015
 
Introduction to Research Data Management
Introduction to Research Data ManagementIntroduction to Research Data Management
Introduction to Research Data Management
 

More from Michael Day

Digital Curation 101 (University of Glamorgan)
Digital Curation 101 (University of Glamorgan)Digital Curation 101 (University of Glamorgan)
Digital Curation 101 (University of Glamorgan)Michael Day
 
Continuity and change: Opportunities and challenges for the future of researc...
Continuity and change: Opportunities and challenges for the future of researc...Continuity and change: Opportunities and challenges for the future of researc...
Continuity and change: Opportunities and challenges for the future of researc...Michael Day
 
Developing a Community Capability Model Framework for data-intensive research
Developing a Community Capability Model Framework for data-intensive researchDeveloping a Community Capability Model Framework for data-intensive research
Developing a Community Capability Model Framework for data-intensive researchMichael Day
 
Introduction to research data management
Introduction to research data managementIntroduction to research data management
Introduction to research data managementMichael Day
 
Introduction to Research Data Management: activities, roles and requirements
Introduction to Research Data Management: activities, roles and requirementsIntroduction to Research Data Management: activities, roles and requirements
Introduction to Research Data Management: activities, roles and requirementsMichael Day
 
Digital Preservation
Digital PreservationDigital Preservation
Digital PreservationMichael Day
 
UKOLN activities on research information management
UKOLN activities on research information managementUKOLN activities on research information management
UKOLN activities on research information managementMichael Day
 
UKOLN Programme Support for the JISC Research Information Management Programme
UKOLN Programme Support for the JISC Research Information Management ProgrammeUKOLN Programme Support for the JISC Research Information Management Programme
UKOLN Programme Support for the JISC Research Information Management ProgrammeMichael Day
 
Digital Preservation
Digital PreservationDigital Preservation
Digital PreservationMichael Day
 
Models for integrating institutional repositories and research information ma...
Models for integrating institutional repositories and research information ma...Models for integrating institutional repositories and research information ma...
Models for integrating institutional repositories and research information ma...Michael Day
 
Research Information Management
Research Information ManagementResearch Information Management
Research Information ManagementMichael Day
 
Digital preservation exercises
Digital preservation exercisesDigital preservation exercises
Digital preservation exercisesMichael Day
 
Brief Introduction to Digital Preservation
Brief Introduction to Digital PreservationBrief Introduction to Digital Preservation
Brief Introduction to Digital PreservationMichael Day
 
Curation of Research Data
Curation of Research DataCuration of Research Data
Curation of Research DataMichael Day
 
Digital preservation from a records management perspective
Digital preservation from a records management perspectiveDigital preservation from a records management perspective
Digital preservation from a records management perspectiveMichael Day
 
The Improving Access to Text (IMPACT) project and other European initiatives
The Improving Access to Text (IMPACT) project and other European initiativesThe Improving Access to Text (IMPACT) project and other European initiatives
The Improving Access to Text (IMPACT) project and other European initiativesMichael Day
 
Repositories and digital preservation
Repositories and digital preservationRepositories and digital preservation
Repositories and digital preservationMichael Day
 
Enhancing social tagging with a knowledge organization system
Enhancing social tagging with a knowledge organization systemEnhancing social tagging with a knowledge organization system
Enhancing social tagging with a knowledge organization systemMichael Day
 
Digital Curation 101: Preserve
Digital Curation 101: PreserveDigital Curation 101: Preserve
Digital Curation 101: PreserveMichael Day
 

More from Michael Day (20)

Digital Curation 101 (University of Glamorgan)
Digital Curation 101 (University of Glamorgan)Digital Curation 101 (University of Glamorgan)
Digital Curation 101 (University of Glamorgan)
 
Continuity and change: Opportunities and challenges for the future of researc...
Continuity and change: Opportunities and challenges for the future of researc...Continuity and change: Opportunities and challenges for the future of researc...
Continuity and change: Opportunities and challenges for the future of researc...
 
Developing a Community Capability Model Framework for data-intensive research
Developing a Community Capability Model Framework for data-intensive researchDeveloping a Community Capability Model Framework for data-intensive research
Developing a Community Capability Model Framework for data-intensive research
 
Introduction to research data management
Introduction to research data managementIntroduction to research data management
Introduction to research data management
 
Introduction to Research Data Management: activities, roles and requirements
Introduction to Research Data Management: activities, roles and requirementsIntroduction to Research Data Management: activities, roles and requirements
Introduction to Research Data Management: activities, roles and requirements
 
Digital Preservation
Digital PreservationDigital Preservation
Digital Preservation
 
UKOLN activities on research information management
UKOLN activities on research information managementUKOLN activities on research information management
UKOLN activities on research information management
 
UKOLN Programme Support for the JISC Research Information Management Programme
UKOLN Programme Support for the JISC Research Information Management ProgrammeUKOLN Programme Support for the JISC Research Information Management Programme
UKOLN Programme Support for the JISC Research Information Management Programme
 
Digital Preservation
Digital PreservationDigital Preservation
Digital Preservation
 
EASTER project
EASTER projectEASTER project
EASTER project
 
Models for integrating institutional repositories and research information ma...
Models for integrating institutional repositories and research information ma...Models for integrating institutional repositories and research information ma...
Models for integrating institutional repositories and research information ma...
 
Research Information Management
Research Information ManagementResearch Information Management
Research Information Management
 
Digital preservation exercises
Digital preservation exercisesDigital preservation exercises
Digital preservation exercises
 
Brief Introduction to Digital Preservation
Brief Introduction to Digital PreservationBrief Introduction to Digital Preservation
Brief Introduction to Digital Preservation
 
Curation of Research Data
Curation of Research DataCuration of Research Data
Curation of Research Data
 
Digital preservation from a records management perspective
Digital preservation from a records management perspectiveDigital preservation from a records management perspective
Digital preservation from a records management perspective
 
The Improving Access to Text (IMPACT) project and other European initiatives
The Improving Access to Text (IMPACT) project and other European initiativesThe Improving Access to Text (IMPACT) project and other European initiatives
The Improving Access to Text (IMPACT) project and other European initiatives
 
Repositories and digital preservation
Repositories and digital preservationRepositories and digital preservation
Repositories and digital preservation
 
Enhancing social tagging with a knowledge organization system
Enhancing social tagging with a knowledge organization systemEnhancing social tagging with a knowledge organization system
Enhancing social tagging with a knowledge organization system
 
Digital Curation 101: Preserve
Digital Curation 101: PreserveDigital Curation 101: Preserve
Digital Curation 101: Preserve
 

Recently uploaded

Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetHyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetEnjoy Anytime
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsHyundai Motor Group
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxnull - The Open Security Community
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 

Recently uploaded (20)

Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetHyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 

Developing Institutional Research Data Management Services

  • 1. Developing institutional RDM services Michael Day Digital Curation Centre (DCC) UKOLN, University of Bath DCC Workshop, Cardiff University 14 May 2013
  • 2. Session outline  Managing active data  Storage options  Long-term retention of data  Selection criteria  Data repositories  Finding and citing data  Data registries and metadata  Presentation based on: Sarah Jones, Graham Pryor and Angus Whyte, How to Develop Research Data Management Services – a guide for HEIs (DCC, 2013):  http://www.dcc.ac.uk/resources/how-guides/how-develop-rdm-services  Some slides reused from RDMRose training materials:  http://rdmrose.group.shef.ac.uk/
  • 4. Managing active data: key tasks  Researchers:  Have a duty to ensure that research data is stored securely and backed-up on a regular basis  Have choices (e.g. network drives, laptops, external storage devices, online / cloud-based storage)  Need to take data security seriously  This should be considered as part of the data management planning process  Institutions:  Need to be constantly review data holdings and RDM practices in order to evaluate whether current storage infrastructures are sufficient  May need to make a case for investing in the provision of additional data storage capability  Need procedures for the allocation and management of storage  Need to be flexible, taking account of a diverse range of research contexts and data storage requirements
  • 5. Research data storage  Trend for some HEIs to enhance the capacity of research data storage facilities  Extending capacity of existing filestores (e.g. Bath)  Exploring secure cloud storage  Utilising High Performance Computing facilities  Managing storage  University of Bristol (data.bris) – registered researchers (data stewards) are allocated 5TB storage to manage, e.g. deciding how long data should be kept, who has access, etc.  http://data.blogs.ilrt.org
  • 6. Options for managing active data  Cloud storage options  There may be benefits in terms of costs and expertise  There may also be risks (e.g. loss of control, jurisdictional issues)  Janet Brokerage - promoting the use of cloud and off-site data centre facilities  Academic dropbox-like services  Dropbox is often used for sharing and synching data between machines, but institutions are keen to retain control  Systems developed in-house  Typically developed with an disciplinary focus, e.g. BRISSkit (biomedicine)
  • 7. Selection for the long- term retention of data
  • 8. Selecting data for retention  RCUK, Common Principles on Data Policy (2011):  “Data with acknowledged long-term value should be preserved and remain accessible and usable for future research”  http://www.rcuk.ac.uk/research/Pages/DataPolicy.aspx  Institutions will need to establish clear criteria to guide decisions on what should be kept  It will not be possible to retain everything  Carefully considered selection processes are essential to help prioritise that data that has long-term value  Institutional selection processes will need to take account of:  Data that institutions are legally obliged to retain (or destroy), e.g. for contractual or regulatory reasons  Different disciplinary practices (e.g., some disciplines will have mature data sharing infrastructures and will already deposit data with third party services)  Researcher sensitivities about losing control of data (deposit agreements)
  • 9. Developing guidance on selection  Establishing guidelines, processes and good practice for data selection and deposit can be one of the more challenging aspects of an RDM service  There is a need for buy-in from researchers  There is a need for clarity on what kinds of data are within the remit of an institutional RDM service  There may be a need to apply different levels of curation, e.g. depending on the perceived value of the data accepted
  • 10. DCC selection categories  DCC How to Select and Appraise Research Data for Curation (Whyte and Wilson, 2010) proposes seven main criteria:  Relevance to mission  Scientific or historic value  Uniqueness  Potential for redistribution  Non-replicability  Economic case  Full documentation  http://www.dcc.ac.uk/resources/how-guides/appraise- select-data
  • 12. Data repositories  Focusing on how data will be preserved and made available for others  Main options:  Developing an institutional data repository  Building, where possible, on existing systems, e.g. IR, CRIS, etc.  Essex Research Data demo: http://researchdata.essex.ac.uk/  Liaising with external research data repositories (or data centres)  Often subject based, some UK data centres supported by funding bodies  Providing researchers with information on external services
  • 14. RCUK Common Principles  RCUK, Common Principles on Data Policy (2011):  “To enable research data to be discoverable and effectively re-used by others, sufficient metadata should be recorded and made openly available to enable other researchers to understand the research and re-use potential of the data. Published results should always include information on how to access the supporting data”  Also EPSRC Principle 6  http://www.rcuk.ac.uk/research/Pages/DataPolicy.aspx
  • 15. EPSRC Expectation V  “Research organisations will ensure that appropriately structured metadata describing the research data they hold is published (normally within 12 months of the data being generated) and made freely accessible on the internet; in each case the metadata must be sufficient to allow others to understand what research data exists, why, when and how it was generated, and how to access it. Where the research data referred to in the metadata is a digital object it is expected that the metadata will include use of a robust digital object identifier (For example as available through the DataCite organisation - http://datacite.org).”  http://www.epsrc.ac.uk/about/standards/researchdata/P ages/expectations.aspx May-13 Learning material produced by RDMRose http://www.sheffield.ac.uk/is/research/projects/rdmros e
  • 16. Some questions to consider  What metadata is required to adequately record datasets? What is “sufficient metadata” for discovery and re-use?  Does any of this metadata already exist?  If so, where might it be found?  If not, how can the appropriate metadata be generated or captured?  Will there be a need to share this metadata, e.g. with third-party discovery services? National data services?  If so, what standards exist to support metadata sharing?
  • 17. Examples: UKOLN Scoping Study  Scientific Data Application Profile Scoping Study (UKOLN, 2009)  Building on work undertaken on the Scholarly Works Application Profile (SWAP)  Analysed the metadata used by UK data centres and repositories, selected domain models (e.g. DDI, CCLRC Metadata Model, CIDOC CRM)  Concluded that:  Simple Dublin Core (e.g., as mandated by OAI-PMH) would be insufficient  There was sufficient convergence between the different schemas to suggest that a generic metadata profile could be constructed  A generic metadata profile would benefit interdisciplinary research and institution based services (e.g. IRs)  http://www.ukoln.ac.uk/projects/sdapss/
  • 18. Examples: DataCite metadata (1)  DataCite: Organisation aiming to facilitate easier access to (and citation of) research data, e.g. through the use of persistent identifiers (DOIs) DataCite Metadata Schema (currently v. 2.2, 2011) defines core metadata properties Broadly based on Dublin Core concepts http://schema.datacite.org
  • 19. Examples: DataCite metadata (2)  Mandatory Properties:  Identifier  Creator  Title  Publisher  PublicationYear  Administrative Metadata  LastMetadataUpdate  MetadataVersionNumber  Optional Properties:  Subject  Contributor  Date  Language  ResourceType  AlternateIdentifier  RelatedIdentifier  Size  Format  Version  Rights  Description
  • 20. Examples: University of Oxford  The DaMaRO project at the University of Oxford is developing a metadata schema for its DataFinder (Rumsey, 2012).  A three-tier metadata approach:  Mandatory minimal metadata to enable basic discovery, such as Creator, Title, Publisher, Date, Location, Access terms & conditions  Mandatory contextual metadata (mostly administrative and partly based on EPSRC expectations), such as Funding Agency, Grant Number, Last access request date, Project Information, Data Generation Process, Why the data was generated, Date (range) of data collection, Reasons for embargoes  Optional metadata (including discipline-specific metadata) to enable reuse, such as machine settings and the experimental conditions under which the data were gathered May-13 Learning material produced by RDMRose http://www.sheffield.ac.uk/is/research/projects/rdmros e
  • 21. Examples: University of Essex  RDE Metadata Profile for EPrints  Based on DataCite, INSPIRE, DDI 2.1 and DataShare  Mixture of generic schema and standards specific to social science data  http://data- archive.ac.uk/media/375386/rde_eprints_metadatapr ofile.pdf  Seems to be convergence on layered approach
  • 22. Some practical questions (1)  Technical choices for institutions:  Developing new institutional services, e.g. the approach taken by ANDS: http://www.ands.org.au/guides/metadata-stores- solutions.html  Defining metadata stores by their coverage, the granularity of data that they describe, and the specialisation of their descriptions (e.g. collection-level, object level, local, institutional, national and discipline-specific)  Building upon existing infrastructures, e.g.:  Institutional Repositories  CRIS (e.g. Pure, Symplectic, Converis)
  • 23. Some practical questions (2)  Research Information Management interaction?  There is interest in what RIM standards like CERIF can offer RDM (e.g. potentially richer metadata structures for linking research outputs with organisational groupings and funding streams, some level of buy-in from funding bodies), but implementation  CERIF for Datasets (C4D): http://cerif4datasets.wordpress.com  We need to think about how metadata can be shared with:  Discipline-based repositories and data centres  Emerging national (and international) discovery infrastructures  Australian National Data Service  Uses RIF-CS schema (based on ISO 2146:2010) as a data interchange format  Jisc and DCC are currently exploring the options for collating metadata about research data at national level
  • 25. Data Citation  Issues include (Ball & Duke, 2011a and b):  At what granularity should data be made citeable?  How to credit each contributor in a dataset that is assembled from very many contributions?  Where in a research paper should a data citation be given (e.g. a paper describing a dataset versus subsequent papers using it)?  What to do with frequently updated data? May-13 Learning material produced by RDMRose http://www.sheffield.ac.uk/is/research/projects/rdmros e
  • 26. DataCite  DataCite (http://www.datacite.org) is a not-for-profit organisation that aims to promote and support the sharing of research data  They are developing an infrastructure that supports methods of data citation, discovery, and access  They are currently leveraging the DOI (Digital Object Identifier) infrastructure, which is also used for research articles  They can provide DOIs for datasets  DataCite DOIs have to resolve to a public landing page with information about the dataset and a direct link to it May-13 Learning material produced by RDMRose http://www.sheffield.ac.uk/is/research/projects/rdmros e
  • 27. DataCite  Basic form:  Creator (PublicationYear): Title. Publisher. Identifier  Version and ResourceType are optional extra elements  For citation purposes, DataCite recommends that DOI names are displayed as linkable, permanent URLs  More info in DataCite (2011)  University of Poppleton (2011): Precipitation measurements 1905-2010 taken at Western Bank weather station. Meteorological service, The University of Poppleton. http://dx.doi.org/10.1594/UoP.MS.298 May-13 Learning material produced by RDMRose http://www.sheffield.ac.uk/is/research/projects/rdmros e
  • 28. References  Ball, A., (2009). Scientific Data Application Profile Scoping Study Report. Bath: UKOLN, University of Bath. Retrieved from: http://www.ukoln.ac.uk/projects/sdapss/  Ball, A., & Duke, M. (2011a). Data Citation and Linking. DCC Briefing Papers. Edinburgh: Digital Curation Centre. Retrieved from http://www.dcc.ac.uk/resources/briefing-papers/introduction-curation/data-citation- and-linking  Ball, A., & Duke, M. (2011b). How to Cite Datasets and Link to Publications. DCC How-To Guides. Edinburgh: Digital Curation Centre. Retrieved from http://www.dcc.ac.uk/resources/how-guides/cite-datasets  DataCite (2011). DataCite Metadata Schema for the Publication and Citation of Research Data. Version 2.2. London: DataCite. Retrieved from http://schema.datacite.org/meta/kernel-2.2/doc/DataCite-MetadataKernel_v2.2.pdf. doi:10.5438/0005  Rumsey, S. (2012). Just enough metadata: Metadata for research datasets in institutional data repositories [PowerPoint presentation]. Oxford: The University of Oxford. Retrieved from http://damaro.oucs.ox.ac.uk/docs/Just%20enough%20metadata%20v3-1.pdf May-13 Learning material produced by RDMRose http://www.sheffield.ac.uk/is/research/projects/rdmros e