SlideShare a Scribd company logo
1 of 60
Download to read offline
Facilitating Reusable Third-Party
Annotations in the Digital Edition
Marijn Koolen
(Royal Netherlands Academy of Arts and Sciences - Humanities Cluster)
Peter Boot
(Huygens ING)
Annotation in Scholarly Editions and Research, 22-02-2019, Wuppertal, Germany
● Annotations in Digital Editions
○ Tend to be restricted to critical notes by creators of the edition
○ Users rarely have support from editions for making their own annotations
● Annotation is a scholarly primitive (Unsworth 2000, Palmer et al. 2009)
○ All scholars make annotations, use them to structure thoughts, gather data
○ Either visible only in private copies, or invisible in shared source materials,
○ Add interpretations, explanations and perspectives
● Annotation is broad but vaguely defined concept
○ “nearly every type of digital research activity in the Humanities today is referred to or connected to
annotating” (Niels-Oliver Walkowski on DARIAH Annotation WG survey, 2017)
Annotation as Scholarly Activity
Sticking to the unwritten rule
● Facilitate third-party annotations in the digital edition:
○ annotations made by researchers unaffiliated to the edition project,
○ contributes to explanatory material already present on the site,
○ Purpose: private study or publish along scholarly article
● Making annotations more a visible part of scholarly communication
○ “Visions of the scholarly web”
● Goal: approach with low threshold for participation
○ For resource providers: tool that is easy to integrate in existing edition
○ For scholars: tool that supports different annotation tasks, allows rich querying/analysing
○ Implementation:
■ W3C Web Annotation data model and protocol (interoperability)
■ Javascript client talking with WA server
Third-Party Annotations
Overview
1. Annotation Digital Editions on the Web
a. The problem of anchoring
b. The problem of semantics
2. Making Web Editions Annotatable
a. Anchors and semantics via RDFa
b. The problem of representation
3. Facilitating Third-Party Annotations
a. The consequences
b. Beyond Digital Editions
Annotating Digital Editions on the Web
The problem of anchoring
● How to anchor annotation to specific location in the edition
● Ensure the annotation addresses a component in the logical information
structure that defines the edition
○ and not a location in an HTML page which is merely one representation of an edited text
Annotating Digital Editions on the Web
● Many open, browser-based tools for social annotation tasks
○ Annotator.js
○ Hypothes.is
○ Dokie.li
○ Pund.it
○ Apache Annotator
● Advantages
○ Annotate online materials
○ Open formats: sharing, collaborating
● Disadvantages
○ Limited knowledge of the structure the annotated object
○ Limited support for using/analysing annotations outside of annotated web page
○ Limited support for annotating multimedia objects
State of the Art in Web-based Annotation
Annotating Digital Editions on the Web
The problem of semantics
● We argue an annotation tool should understand structure of object itself
a. Browser uses HTML representation
i. HTML is layout oriented, no meaningful connection with annotated object
ii. Annotation not robust against changes in HTML representation
b. Multiple websites may have (different) online versions/editions of the same object
i. Annotations all target same object but different URLs
c. Object may have multiple representations
i. Digital edition can have different transcriptions, translations, audio versions
ii. Annotations made on one representation should be accessible for others
d. Resource providers should be able to suggest suitable annotation types for different object
components
Understanding Annotated Object
● Edition provider has:
○ Resources + metadata (e.g. as TEI/XML)
● Transformed to HTML presentation format for web browser
○ Browser (and annotation plugin) only sees presentation information
○ Compare rich semantics of TEI file with poor semantics of HTML representation
Annotating Digital Editions as Web Pages
TEI Header
TEI Body
HTML Version
Making Web Editions Annotatable
Anchors and Semantics via RDFa
● Use RDFa to describe resources in web page
○ Enrich HTML presentation of resource with semantic info on resource
● Develop annotation client that understands RDFa
○ Parse RDFa information in web page to know annotatable components
○ Capture structural semantic information in annotation
Semantic Anchoring via RDFa
Adding Semantics Through RDFa
Adding Semantics Through RDFa
Demo 1
Annotating RDFa-enriched editions on the Web
Making Web Editions Annotatable
The problem of representation
● How to anchor an annotation to specific representation in the edition
● Ensure the information structure is described in sufficient detail to distinguish
○ the edited text or document (the object of editing)
○ its (multiple) representation(s) in the edition
Annotating Digital Editions on the Web
Creative Works and Representation
● Digital Editions can have multiple representations of the same creative work
○ E.g. image scan, transcript, translation
○ Annotations may relate to a specific representation…
■ E.g. a correction or comment on a word in the transcript or translation
○ … or to the abstract creative work...
■ E.g. background information for something referenced in the text
■ Or a code to assign a phrase to a category of interest
○ … or to a combination of representations
■ E.g. linking a phrase in the transcript to a drawing in the page image
● Different structures may be leading in the HTML view
○ E.g. document-centric (pages) and text-centric (sections, paragraphs) structures
○ Annotations made on one structure should be translated to match alternative structure
Annotations on Different Levels
● How can we distinguish between abstract work and representation?
● How can we target annotations at these different levels?
● Which annotations should be shown in which context?
● We created an FRBR-based ontology to distinguish between
○ Editable objects (creative works, parts of works)
○ Edition objects (representations, parts of representations)
● FRBR
○ Functional Requirements for Bibliographic Records
○ Distinguish Work - Manifestation - Expression - Item
○ Van Gogh’s letter is a create Work
○ Diplomatic transcription is an expression of this work
■ (and a creative work in itself)
○ Translation is an expression of this work
■ (and a creative work in itself)
Editable and Edition Domains
Editable Domain
Editable Domain + FRBRoo
Edition Domain + FRBRoo
Editorial Domain
Representing Work and Two Text Versions
Adding Client - Linking External Resource RDF
External Resources in RDF
Demo 2
Annotations in the editable and edition domains
Facilitating Third-Party Annotations
And its consequences
Private, Shared, Public
● Annotations have permissions
○ Private by default, can be shared (once implemented) or made public
○ Importance of private annotations (Bradley, 2012): the role of personal reflection
■ Also, McCarty’s point on the act of making an annotation (“knowing in doing”)
■ Annotations are mainly for structuring your thoughts?
● Annotations for writing vs. annotations for reading
○ Transition from ‘for writing’ (knowing in doing) to ‘for reading’ (knowing in using)
■ I.e. from private/shared to public
○ When does annotator consider annotation of interest for others?
■ E.g. when they’re published alongside article to support arguments made
○ Edit annotations to make them comprehensible for others
Impact
● What are consequences of third-party annotation for scholarship?
○ Publish annotations along scholarly arguments
○ Edition could become living document with ongoing visible communication
■ Esp. within a collaborative project
■ But also more publicly (how to avoid this becoming an impenetrable mess?)
● Feedback
○ Edition owners/maintainers may want to incorporate certain annotations into the edition
○ Third-party annotation to curated annotation/markup
● Editions of famous works or authors may attract much attention
○ Open model: anyone can share anything with everyone
○ Editorial model: public annotations need to be approved (by whom?)
○ Private/shared model: only share with specific collaborators, enable limited conversations,
can’t openly cite annotations
Low Threshold To Participate?
● We want our annotation approach to be easy to adopt by other editions
○ Semantics can be embedded via RDFa without changing the layout
○ The JavaScript client that can be loaded in any RDFa-enriched web page
■ Configurable to suit editor’s/annotator’s needs
○ A Python REST server running Elasticsearch in the background for indexing and retrieval
■ With access permissions per annotation (private, shared, public)
■ Support for AnnotationCollections
● Both available on GitHub
○ Server: https://github.com/marijnkoolen/scholarly-web-annotation-server
○ Client: https://github.com/CLARIAH/scholarly-web-annotation-client
○ Document is minimal and somewhat out-of-date
Adding Client
Facilitating Third-Party Annotations
Beyond Digital Editions
Wrap Up
● We think support for third-party annotation in digital editions is valuable
○ Several difficulties:
■ Changing objects, unstable identifiers
■ Openness comes at a price
○ Our approach has pros and cons
■ Pro: flexible, supports many tasks and multiple modalities, interoperable
■ Cons: complex structure, esp. when using FRBR layers, easy to make mistakes
○ Suggestions for improvement/simplification are welcome
● Plans
○ Set up across CLARIAH infrastructure (funded 2019-2023)
○ Experiment with pilots in different disciplines (historical science, media studies, literary studies,
linguistics, ...)
Anderson, S., T. Blanke, and S. Dunn. (2010). Methodological commons: arts and humanities e-Science fundamentals. Philosophical Transactions
of the Royal Society of London A: Mathematical, Physical and Engineering Sciences, 368(1925), 3779-3796.
Boot, P, Haentjens Dekker, R, Koolen, M, Melgar, L. (2017). Facilitating Fine-grained Open Annotations of Scholarly Sources. In: ​Conference
abstracts Digital Humanities 2017, Montreal​.
Boot, P., Koolen, M (2018). A FRBRoo-based annotation ontology for digital editing. In: Conference abstracts European Association for Digital
Humanities 2018, Galway​.
Bradley, J. (2012). Towards a richer sense of digital annotation: Moving beyond a “media” orientation of the annotation of digital objects. Digital
Humanities Quarterly, 6(2).
Palmer, C. L., Teffeau, L. C., & Pirmann, C. M. (2009). Scholarly Information Practices in the Online Environment. Report commissioned by OCLC
Research.
Unsworth, J. (2001). Scholarly Primitives: what methods do humanities researchers have in common, and how might our tools reflect this. In
Humanities Computing: formal methods, experimental practice symposium, King’s College, London.
Walkowski (2016). The Landscape of Digital Annotation and Its Meaning. Conference on Language Technologies & Digital Humanities, Ljubljana,
2016
References
Thank you!
Questions?
EARMARK
● Extremely Annotational RDF Markup
● Goals:
○ Allow multiple annotators to annotate the same object (overlapping annotations
○ Refer to external entities
● Solution
○ Java application,
○ Works on XML/TEI files,
○ Derives identifier from XML structure, uses XPath and character offsets and range to identify
text elements
○ Allows both standoff annotation and embedding as markup
○ RDF for references to anything in the world
○
Open Web Annotation Using Dokie.li
Source: http://csarven.ca/dokieli-rww
● Approach to enable third-party annotation in digital editions
○ Technical approach is only first step!
● Annotation approach to support fluid nature of annotations
○ Support need for critical distinctions in targeting
● All code on GitHub
○ Server: https://github.com/marijnkoolen/scholarly-web-annotation-server
○ Client: https://github.com/CLARIAH/scholarly-web-annotation-client
○ Document is minimal and somewhat out-of-date
Conclusions

More Related Content

Similar to Facilitating reusable third-party annotations in the digital edition

Open Chemistry, JupyterLab and data: Reproducible quantum chemistry
Open Chemistry, JupyterLab and data: Reproducible quantum chemistryOpen Chemistry, JupyterLab and data: Reproducible quantum chemistry
Open Chemistry, JupyterLab and data: Reproducible quantum chemistryMarcus Hanwell
 
The Joy of Docs, or, Technical Writing for Developers and Engineers
The Joy of Docs, or, Technical Writing for Developers and EngineersThe Joy of Docs, or, Technical Writing for Developers and Engineers
The Joy of Docs, or, Technical Writing for Developers and EngineersPronovix
 
Documentation An Engineering Problem Unsolved
Documentation  An Engineering Problem UnsolvedDocumentation  An Engineering Problem Unsolved
Documentation An Engineering Problem UnsolvedSchalk Cronjé
 
Semtech web-protege-tutorial
Semtech web-protege-tutorialSemtech web-protege-tutorial
Semtech web-protege-tutorialmatthewhorridge
 
Painless XML Authoring?: How DITA Simplifies XML
Painless XML Authoring?: How DITA Simplifies XMLPainless XML Authoring?: How DITA Simplifies XML
Painless XML Authoring?: How DITA Simplifies XMLScott Abel
 
The big conversation: open annotation in manuscripts and the web
The big conversation: open annotation in manuscripts and the webThe big conversation: open annotation in manuscripts and the web
The big conversation: open annotation in manuscripts and the webMark Barratt
 
Topic Modeling for Learning Analytics Researchers LAK15 Tutorial
Topic Modeling for Learning Analytics Researchers LAK15 TutorialTopic Modeling for Learning Analytics Researchers LAK15 Tutorial
Topic Modeling for Learning Analytics Researchers LAK15 TutorialVitomir Kovanovic
 
Automated interpretability of linked data ontologies: an evaluation within th...
Automated interpretability of linked data ontologies: an evaluation within th...Automated interpretability of linked data ontologies: an evaluation within th...
Automated interpretability of linked data ontologies: an evaluation within th...Nuno Freire
 
Linked Data Patterns
Linked Data PatternsLinked Data Patterns
Linked Data PatternsLeigh Dodds
 
BUD17-TR01: Philosophy of Open Source
BUD17-TR01: Philosophy of Open SourceBUD17-TR01: Philosophy of Open Source
BUD17-TR01: Philosophy of Open SourceLinaro
 
Modelling and exchanging annotations
Modelling and exchanging annotationsModelling and exchanging annotations
Modelling and exchanging annotationsAntoine Isaac
 
DockerCon US 2016 - Scaling Open Source operations
DockerCon US 2016 - Scaling Open Source operationsDockerCon US 2016 - Scaling Open Source operations
DockerCon US 2016 - Scaling Open Source operationsArnaud Porterie
 
Philosophy of Open Source - SFO17-TR01
Philosophy of Open Source - SFO17-TR01Philosophy of Open Source - SFO17-TR01
Philosophy of Open Source - SFO17-TR01Linaro
 
Video game controlled vocabulary in wikidata
Video game controlled vocabulary in wikidataVideo game controlled vocabulary in wikidata
Video game controlled vocabulary in wikidatapeterchanws
 
Hakkarinen.OR2016.Enhancing_Citation-only_Repositories.Final
Hakkarinen.OR2016.Enhancing_Citation-only_Repositories.FinalHakkarinen.OR2016.Enhancing_Citation-only_Repositories.Final
Hakkarinen.OR2016.Enhancing_Citation-only_Repositories.FinalMark Hakkarinen
 
The New Lisbon University - SDL Open Exchange 2015
The New Lisbon University -  SDL Open Exchange 2015The New Lisbon University -  SDL Open Exchange 2015
The New Lisbon University - SDL Open Exchange 2015Paul Filkin
 
Contropedia: Critical learning through Wikipedia's edit history
Contropedia: Critical learning through Wikipedia's edit historyContropedia: Critical learning through Wikipedia's edit history
Contropedia: Critical learning through Wikipedia's edit historyDavid Laniado
 
WIDOCO: A Wizard for Documenting Ontologies
WIDOCO: A Wizard for Documenting OntologiesWIDOCO: A Wizard for Documenting Ontologies
WIDOCO: A Wizard for Documenting Ontologiesdgarijo
 
Linked Open Citation Database (LOC-DB)
Linked Open Citation Database (LOC-DB)Linked Open Citation Database (LOC-DB)
Linked Open Citation Database (LOC-DB)Kai Eckert
 

Similar to Facilitating reusable third-party annotations in the digital edition (20)

Open Chemistry, JupyterLab and data: Reproducible quantum chemistry
Open Chemistry, JupyterLab and data: Reproducible quantum chemistryOpen Chemistry, JupyterLab and data: Reproducible quantum chemistry
Open Chemistry, JupyterLab and data: Reproducible quantum chemistry
 
The Joy of Docs, or, Technical Writing for Developers and Engineers
The Joy of Docs, or, Technical Writing for Developers and EngineersThe Joy of Docs, or, Technical Writing for Developers and Engineers
The Joy of Docs, or, Technical Writing for Developers and Engineers
 
Documentation An Engineering Problem Unsolved
Documentation  An Engineering Problem UnsolvedDocumentation  An Engineering Problem Unsolved
Documentation An Engineering Problem Unsolved
 
Semtech web-protege-tutorial
Semtech web-protege-tutorialSemtech web-protege-tutorial
Semtech web-protege-tutorial
 
Painless XML Authoring?: How DITA Simplifies XML
Painless XML Authoring?: How DITA Simplifies XMLPainless XML Authoring?: How DITA Simplifies XML
Painless XML Authoring?: How DITA Simplifies XML
 
The big conversation: open annotation in manuscripts and the web
The big conversation: open annotation in manuscripts and the webThe big conversation: open annotation in manuscripts and the web
The big conversation: open annotation in manuscripts and the web
 
Topic Modeling for Learning Analytics Researchers LAK15 Tutorial
Topic Modeling for Learning Analytics Researchers LAK15 TutorialTopic Modeling for Learning Analytics Researchers LAK15 Tutorial
Topic Modeling for Learning Analytics Researchers LAK15 Tutorial
 
Automated interpretability of linked data ontologies: an evaluation within th...
Automated interpretability of linked data ontologies: an evaluation within th...Automated interpretability of linked data ontologies: an evaluation within th...
Automated interpretability of linked data ontologies: an evaluation within th...
 
Linked Data Patterns
Linked Data PatternsLinked Data Patterns
Linked Data Patterns
 
DBpedia Viewer - LDOW 2014
DBpedia Viewer - LDOW 2014DBpedia Viewer - LDOW 2014
DBpedia Viewer - LDOW 2014
 
BUD17-TR01: Philosophy of Open Source
BUD17-TR01: Philosophy of Open SourceBUD17-TR01: Philosophy of Open Source
BUD17-TR01: Philosophy of Open Source
 
Modelling and exchanging annotations
Modelling and exchanging annotationsModelling and exchanging annotations
Modelling and exchanging annotations
 
DockerCon US 2016 - Scaling Open Source operations
DockerCon US 2016 - Scaling Open Source operationsDockerCon US 2016 - Scaling Open Source operations
DockerCon US 2016 - Scaling Open Source operations
 
Philosophy of Open Source - SFO17-TR01
Philosophy of Open Source - SFO17-TR01Philosophy of Open Source - SFO17-TR01
Philosophy of Open Source - SFO17-TR01
 
Video game controlled vocabulary in wikidata
Video game controlled vocabulary in wikidataVideo game controlled vocabulary in wikidata
Video game controlled vocabulary in wikidata
 
Hakkarinen.OR2016.Enhancing_Citation-only_Repositories.Final
Hakkarinen.OR2016.Enhancing_Citation-only_Repositories.FinalHakkarinen.OR2016.Enhancing_Citation-only_Repositories.Final
Hakkarinen.OR2016.Enhancing_Citation-only_Repositories.Final
 
The New Lisbon University - SDL Open Exchange 2015
The New Lisbon University -  SDL Open Exchange 2015The New Lisbon University -  SDL Open Exchange 2015
The New Lisbon University - SDL Open Exchange 2015
 
Contropedia: Critical learning through Wikipedia's edit history
Contropedia: Critical learning through Wikipedia's edit historyContropedia: Critical learning through Wikipedia's edit history
Contropedia: Critical learning through Wikipedia's edit history
 
WIDOCO: A Wizard for Documenting Ontologies
WIDOCO: A Wizard for Documenting OntologiesWIDOCO: A Wizard for Documenting Ontologies
WIDOCO: A Wizard for Documenting Ontologies
 
Linked Open Citation Database (LOC-DB)
Linked Open Citation Database (LOC-DB)Linked Open Citation Database (LOC-DB)
Linked Open Citation Database (LOC-DB)
 

More from Marijn Koolen

Recommender Systems NL Meetup
Recommender Systems NL MeetupRecommender Systems NL Meetup
Recommender Systems NL MeetupMarijn Koolen
 
Narrative-Driven Recommendation for Casual Leisure Needs
Narrative-Driven Recommendation for Casual Leisure NeedsNarrative-Driven Recommendation for Casual Leisure Needs
Narrative-Driven Recommendation for Casual Leisure NeedsMarijn Koolen
 
Tools that Encourage Criticism - Leiden University Symposium on Tools Criticism
Tools that Encourage Criticism - Leiden University Symposium on Tools CriticismTools that Encourage Criticism - Leiden University Symposium on Tools Criticism
Tools that Encourage Criticism - Leiden University Symposium on Tools CriticismMarijn Koolen
 
Digital History - Maritieme Carrieres bij de VOC
Digital History - Maritieme Carrieres bij de VOCDigital History - Maritieme Carrieres bij de VOC
Digital History - Maritieme Carrieres bij de VOCMarijn Koolen
 
Hobby horses-and-detail-devils-transparency-in-digital-humanities-research-an...
Hobby horses-and-detail-devils-transparency-in-digital-humanities-research-an...Hobby horses-and-detail-devils-transparency-in-digital-humanities-research-an...
Hobby horses-and-detail-devils-transparency-in-digital-humanities-research-an...Marijn Koolen
 
Narrative-Driven Recommendation for Casual Leisure Needs
Narrative-Driven Recommendation for Casual Leisure NeedsNarrative-Driven Recommendation for Casual Leisure Needs
Narrative-Driven Recommendation for Casual Leisure NeedsMarijn Koolen
 
Scholary Web Annotation - HuC Live 2018
Scholary Web Annotation - HuC Live 2018Scholary Web Annotation - HuC Live 2018
Scholary Web Annotation - HuC Live 2018Marijn Koolen
 
A hands-on approach to digital tool criticism: Tools for (self-)reflection
A hands-on approach to digital tool criticism: Tools for (self-)reflectionA hands-on approach to digital tool criticism: Tools for (self-)reflection
A hands-on approach to digital tool criticism: Tools for (self-)reflectionMarijn Koolen
 
Search in Research, Let's Make it More Complex!
Search in Research, Let's Make it More Complex!Search in Research, Let's Make it More Complex!
Search in Research, Let's Make it More Complex!Marijn Koolen
 
Data Scopes - Towards transparent data research in digital humanities (Digita...
Data Scopes - Towards transparent data research in digital humanities (Digita...Data Scopes - Towards transparent data research in digital humanities (Digita...
Data Scopes - Towards transparent data research in digital humanities (Digita...Marijn Koolen
 
Lessons Learned from a Digital Tool Criticism Workshop
Lessons Learned from a Digital Tool Criticism WorkshopLessons Learned from a Digital Tool Criticism Workshop
Lessons Learned from a Digital Tool Criticism WorkshopMarijn Koolen
 

More from Marijn Koolen (12)

Recommender Systems NL Meetup
Recommender Systems NL MeetupRecommender Systems NL Meetup
Recommender Systems NL Meetup
 
Narrative-Driven Recommendation for Casual Leisure Needs
Narrative-Driven Recommendation for Casual Leisure NeedsNarrative-Driven Recommendation for Casual Leisure Needs
Narrative-Driven Recommendation for Casual Leisure Needs
 
Tools that Encourage Criticism - Leiden University Symposium on Tools Criticism
Tools that Encourage Criticism - Leiden University Symposium on Tools CriticismTools that Encourage Criticism - Leiden University Symposium on Tools Criticism
Tools that Encourage Criticism - Leiden University Symposium on Tools Criticism
 
Digital History - Maritieme Carrieres bij de VOC
Digital History - Maritieme Carrieres bij de VOCDigital History - Maritieme Carrieres bij de VOC
Digital History - Maritieme Carrieres bij de VOC
 
Tool criticism
Tool criticismTool criticism
Tool criticism
 
Hobby horses-and-detail-devils-transparency-in-digital-humanities-research-an...
Hobby horses-and-detail-devils-transparency-in-digital-humanities-research-an...Hobby horses-and-detail-devils-transparency-in-digital-humanities-research-an...
Hobby horses-and-detail-devils-transparency-in-digital-humanities-research-an...
 
Narrative-Driven Recommendation for Casual Leisure Needs
Narrative-Driven Recommendation for Casual Leisure NeedsNarrative-Driven Recommendation for Casual Leisure Needs
Narrative-Driven Recommendation for Casual Leisure Needs
 
Scholary Web Annotation - HuC Live 2018
Scholary Web Annotation - HuC Live 2018Scholary Web Annotation - HuC Live 2018
Scholary Web Annotation - HuC Live 2018
 
A hands-on approach to digital tool criticism: Tools for (self-)reflection
A hands-on approach to digital tool criticism: Tools for (self-)reflectionA hands-on approach to digital tool criticism: Tools for (self-)reflection
A hands-on approach to digital tool criticism: Tools for (self-)reflection
 
Search in Research, Let's Make it More Complex!
Search in Research, Let's Make it More Complex!Search in Research, Let's Make it More Complex!
Search in Research, Let's Make it More Complex!
 
Data Scopes - Towards transparent data research in digital humanities (Digita...
Data Scopes - Towards transparent data research in digital humanities (Digita...Data Scopes - Towards transparent data research in digital humanities (Digita...
Data Scopes - Towards transparent data research in digital humanities (Digita...
 
Lessons Learned from a Digital Tool Criticism Workshop
Lessons Learned from a Digital Tool Criticism WorkshopLessons Learned from a Digital Tool Criticism Workshop
Lessons Learned from a Digital Tool Criticism Workshop
 

Recently uploaded

Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...jana861314
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfSumit Kumar yadav
 
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxkessiyaTpeter
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...Sérgio Sacani
 
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencyHire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencySheetal Arora
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000Sapana Sha
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxUmerFayaz5
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)PraveenaKalaiselvan1
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)Areesha Ahmad
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfSumit Kumar yadav
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfSumit Kumar yadav
 
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisDiwakar Mishra
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Lokesh Kothari
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)Areesha Ahmad
 
Green chemistry and Sustainable development.pptx
Green chemistry  and Sustainable development.pptxGreen chemistry  and Sustainable development.pptx
Green chemistry and Sustainable development.pptxRajatChauhan518211
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPirithiRaju
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptxanandsmhk
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfSumit Kumar yadav
 

Recently uploaded (20)

Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdf
 
The Philosophy of Science
The Philosophy of ScienceThe Philosophy of Science
The Philosophy of Science
 
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
 
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencyHire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptx
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdf
 
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdf
 
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
 
Green chemistry and Sustainable development.pptx
Green chemistry  and Sustainable development.pptxGreen chemistry  and Sustainable development.pptx
Green chemistry and Sustainable development.pptx
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdf
 

Facilitating reusable third-party annotations in the digital edition

  • 1. Facilitating Reusable Third-Party Annotations in the Digital Edition Marijn Koolen (Royal Netherlands Academy of Arts and Sciences - Humanities Cluster) Peter Boot (Huygens ING) Annotation in Scholarly Editions and Research, 22-02-2019, Wuppertal, Germany
  • 2. ● Annotations in Digital Editions ○ Tend to be restricted to critical notes by creators of the edition ○ Users rarely have support from editions for making their own annotations ● Annotation is a scholarly primitive (Unsworth 2000, Palmer et al. 2009) ○ All scholars make annotations, use them to structure thoughts, gather data ○ Either visible only in private copies, or invisible in shared source materials, ○ Add interpretations, explanations and perspectives ● Annotation is broad but vaguely defined concept ○ “nearly every type of digital research activity in the Humanities today is referred to or connected to annotating” (Niels-Oliver Walkowski on DARIAH Annotation WG survey, 2017) Annotation as Scholarly Activity
  • 3. Sticking to the unwritten rule
  • 4. ● Facilitate third-party annotations in the digital edition: ○ annotations made by researchers unaffiliated to the edition project, ○ contributes to explanatory material already present on the site, ○ Purpose: private study or publish along scholarly article ● Making annotations more a visible part of scholarly communication ○ “Visions of the scholarly web” ● Goal: approach with low threshold for participation ○ For resource providers: tool that is easy to integrate in existing edition ○ For scholars: tool that supports different annotation tasks, allows rich querying/analysing ○ Implementation: ■ W3C Web Annotation data model and protocol (interoperability) ■ Javascript client talking with WA server Third-Party Annotations
  • 5. Overview 1. Annotation Digital Editions on the Web a. The problem of anchoring b. The problem of semantics 2. Making Web Editions Annotatable a. Anchors and semantics via RDFa b. The problem of representation 3. Facilitating Third-Party Annotations a. The consequences b. Beyond Digital Editions
  • 6. Annotating Digital Editions on the Web The problem of anchoring
  • 7. ● How to anchor annotation to specific location in the edition ● Ensure the annotation addresses a component in the logical information structure that defines the edition ○ and not a location in an HTML page which is merely one representation of an edited text Annotating Digital Editions on the Web
  • 8. ● Many open, browser-based tools for social annotation tasks ○ Annotator.js ○ Hypothes.is ○ Dokie.li ○ Pund.it ○ Apache Annotator ● Advantages ○ Annotate online materials ○ Open formats: sharing, collaborating ● Disadvantages ○ Limited knowledge of the structure the annotated object ○ Limited support for using/analysing annotations outside of annotated web page ○ Limited support for annotating multimedia objects State of the Art in Web-based Annotation
  • 9.
  • 10. Annotating Digital Editions on the Web The problem of semantics
  • 11. ● We argue an annotation tool should understand structure of object itself a. Browser uses HTML representation i. HTML is layout oriented, no meaningful connection with annotated object ii. Annotation not robust against changes in HTML representation b. Multiple websites may have (different) online versions/editions of the same object i. Annotations all target same object but different URLs c. Object may have multiple representations i. Digital edition can have different transcriptions, translations, audio versions ii. Annotations made on one representation should be accessible for others d. Resource providers should be able to suggest suitable annotation types for different object components Understanding Annotated Object
  • 12. ● Edition provider has: ○ Resources + metadata (e.g. as TEI/XML) ● Transformed to HTML presentation format for web browser ○ Browser (and annotation plugin) only sees presentation information ○ Compare rich semantics of TEI file with poor semantics of HTML representation Annotating Digital Editions as Web Pages
  • 16.
  • 17.
  • 18.
  • 19. Making Web Editions Annotatable Anchors and Semantics via RDFa
  • 20. ● Use RDFa to describe resources in web page ○ Enrich HTML presentation of resource with semantic info on resource ● Develop annotation client that understands RDFa ○ Parse RDFa information in web page to know annotatable components ○ Capture structural semantic information in annotation Semantic Anchoring via RDFa
  • 23.
  • 24.
  • 25. Demo 1 Annotating RDFa-enriched editions on the Web
  • 26. Making Web Editions Annotatable The problem of representation
  • 27. ● How to anchor an annotation to specific representation in the edition ● Ensure the information structure is described in sufficient detail to distinguish ○ the edited text or document (the object of editing) ○ its (multiple) representation(s) in the edition Annotating Digital Editions on the Web
  • 28. Creative Works and Representation ● Digital Editions can have multiple representations of the same creative work ○ E.g. image scan, transcript, translation ○ Annotations may relate to a specific representation… ■ E.g. a correction or comment on a word in the transcript or translation ○ … or to the abstract creative work... ■ E.g. background information for something referenced in the text ■ Or a code to assign a phrase to a category of interest ○ … or to a combination of representations ■ E.g. linking a phrase in the transcript to a drawing in the page image ● Different structures may be leading in the HTML view ○ E.g. document-centric (pages) and text-centric (sections, paragraphs) structures ○ Annotations made on one structure should be translated to match alternative structure
  • 29. Annotations on Different Levels ● How can we distinguish between abstract work and representation? ● How can we target annotations at these different levels? ● Which annotations should be shown in which context?
  • 30. ● We created an FRBR-based ontology to distinguish between ○ Editable objects (creative works, parts of works) ○ Edition objects (representations, parts of representations) ● FRBR ○ Functional Requirements for Bibliographic Records ○ Distinguish Work - Manifestation - Expression - Item ○ Van Gogh’s letter is a create Work ○ Diplomatic transcription is an expression of this work ■ (and a creative work in itself) ○ Translation is an expression of this work ■ (and a creative work in itself) Editable and Edition Domains
  • 35. Representing Work and Two Text Versions
  • 36.
  • 37. Adding Client - Linking External Resource RDF
  • 39. Demo 2 Annotations in the editable and edition domains
  • 41. Private, Shared, Public ● Annotations have permissions ○ Private by default, can be shared (once implemented) or made public ○ Importance of private annotations (Bradley, 2012): the role of personal reflection ■ Also, McCarty’s point on the act of making an annotation (“knowing in doing”) ■ Annotations are mainly for structuring your thoughts? ● Annotations for writing vs. annotations for reading ○ Transition from ‘for writing’ (knowing in doing) to ‘for reading’ (knowing in using) ■ I.e. from private/shared to public ○ When does annotator consider annotation of interest for others? ■ E.g. when they’re published alongside article to support arguments made ○ Edit annotations to make them comprehensible for others
  • 42. Impact ● What are consequences of third-party annotation for scholarship? ○ Publish annotations along scholarly arguments ○ Edition could become living document with ongoing visible communication ■ Esp. within a collaborative project ■ But also more publicly (how to avoid this becoming an impenetrable mess?) ● Feedback ○ Edition owners/maintainers may want to incorporate certain annotations into the edition ○ Third-party annotation to curated annotation/markup ● Editions of famous works or authors may attract much attention ○ Open model: anyone can share anything with everyone ○ Editorial model: public annotations need to be approved (by whom?) ○ Private/shared model: only share with specific collaborators, enable limited conversations, can’t openly cite annotations
  • 43. Low Threshold To Participate? ● We want our annotation approach to be easy to adopt by other editions ○ Semantics can be embedded via RDFa without changing the layout ○ The JavaScript client that can be loaded in any RDFa-enriched web page ■ Configurable to suit editor’s/annotator’s needs ○ A Python REST server running Elasticsearch in the background for indexing and retrieval ■ With access permissions per annotation (private, shared, public) ■ Support for AnnotationCollections ● Both available on GitHub ○ Server: https://github.com/marijnkoolen/scholarly-web-annotation-server ○ Client: https://github.com/CLARIAH/scholarly-web-annotation-client ○ Document is minimal and somewhat out-of-date
  • 46.
  • 47.
  • 48.
  • 49.
  • 50.
  • 51. Wrap Up ● We think support for third-party annotation in digital editions is valuable ○ Several difficulties: ■ Changing objects, unstable identifiers ■ Openness comes at a price ○ Our approach has pros and cons ■ Pro: flexible, supports many tasks and multiple modalities, interoperable ■ Cons: complex structure, esp. when using FRBR layers, easy to make mistakes ○ Suggestions for improvement/simplification are welcome ● Plans ○ Set up across CLARIAH infrastructure (funded 2019-2023) ○ Experiment with pilots in different disciplines (historical science, media studies, literary studies, linguistics, ...)
  • 52. Anderson, S., T. Blanke, and S. Dunn. (2010). Methodological commons: arts and humanities e-Science fundamentals. Philosophical Transactions of the Royal Society of London A: Mathematical, Physical and Engineering Sciences, 368(1925), 3779-3796. Boot, P, Haentjens Dekker, R, Koolen, M, Melgar, L. (2017). Facilitating Fine-grained Open Annotations of Scholarly Sources. In: ​Conference abstracts Digital Humanities 2017, Montreal​. Boot, P., Koolen, M (2018). A FRBRoo-based annotation ontology for digital editing. In: Conference abstracts European Association for Digital Humanities 2018, Galway​. Bradley, J. (2012). Towards a richer sense of digital annotation: Moving beyond a “media” orientation of the annotation of digital objects. Digital Humanities Quarterly, 6(2). Palmer, C. L., Teffeau, L. C., & Pirmann, C. M. (2009). Scholarly Information Practices in the Online Environment. Report commissioned by OCLC Research. Unsworth, J. (2001). Scholarly Primitives: what methods do humanities researchers have in common, and how might our tools reflect this. In Humanities Computing: formal methods, experimental practice symposium, King’s College, London. Walkowski (2016). The Landscape of Digital Annotation and Its Meaning. Conference on Language Technologies & Digital Humanities, Ljubljana, 2016 References
  • 54. EARMARK ● Extremely Annotational RDF Markup ● Goals: ○ Allow multiple annotators to annotate the same object (overlapping annotations ○ Refer to external entities ● Solution ○ Java application, ○ Works on XML/TEI files, ○ Derives identifier from XML structure, uses XPath and character offsets and range to identify text elements ○ Allows both standoff annotation and embedding as markup ○ RDF for references to anything in the world ○
  • 55.
  • 56.
  • 57. Open Web Annotation Using Dokie.li Source: http://csarven.ca/dokieli-rww
  • 58.
  • 59.
  • 60. ● Approach to enable third-party annotation in digital editions ○ Technical approach is only first step! ● Annotation approach to support fluid nature of annotations ○ Support need for critical distinctions in targeting ● All code on GitHub ○ Server: https://github.com/marijnkoolen/scholarly-web-annotation-server ○ Client: https://github.com/CLARIAH/scholarly-web-annotation-client ○ Document is minimal and somewhat out-of-date Conclusions