Following the concept of human memory Forget IT aims to create a framework which will bring “managed forgetting” to TYPO3 CMS. It will provide semantic annotation, intelligent preservation and managed archiving of content objects. Learn what dkd plans for 2014 and how you can contribute.
While preservation of digital content is now well established in memory institutions such as national libraries and archives, it is still in its infancy in most other organizations, and even more so for personal content. ForgetIT combines three new concepts to ease the adoption of preservation in the personal and organizational context.
Managed Forgetting:
Managed Forgetting models resource selection as a function of attention and significance dynamics. It is inspired by the important role of forgetting in human memory and focuses on characteristic signals of reduction in salience.
Synergetic Preservation:
Synergetic Preservation crosses the chasm that exists between active information use and preservation management by making intelligent preservation processes an integral part of the content lifecycle in information management.
Contextualized Remembering:
Contextualized Remembering targets keeping preserved content meaningful and useful. It will be based on a process of dynamic evolution-aware contextualization.
Impact on TYPO3 CMS:
Together with the TYPO3 community and selected pilot customers, dkd will work on establishing the respective extensions to provide these concepts to TYPO3 CMS and its user base.
Olivier will introduce you the project, its concepts and the framework architecture. The past year has been used to define these and a solid foundation was laid.
We elaborated the design and functional requirements by using two use cases (I. Press release, II. DAM integration into the backend).
The current year in the project will be used to create a first and working implementation.
What does this mean for you?
After a short break, a joint brainstorming about how you can be involved and what potential benefits would be, shall take place.
Things to look at will be:
* the value of content objects
* semantic annotation and contextualization
* memory buoyancy, allowing mechanics to forget content over time
* utilization of open standards like CMIS, ODATA, Stanbol
5. TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Welcome to the digital, information age...
…a never ending flood of content!
Technology enables us to produce nearly unlimited data
We are still „hunters and collectors“ somehow
Currently storage space feels to be „infinite“, but resources on
earth are limited sooner or later
Velocity of innovation/evolution of technology increases, which
brings new technology/formats/standard at an increasing
frequency
-> so how do we handle this?
9. “The digital dark age is a possible future
situation where it will be difficult or impossible
to read historical electronic documents and
multimedia, because they have been stored in
an obsolete and obscure file format.” Wikipedia
11. TYPO3 Developer Days, 19/22 June 2014, Eindhoven
What is preservation?
“Preservation — The protection of cultural
property through activities that minimize
chemical and physical deterioration and
damage and that prevent loss of informational
content. The primary goal of preservation is to
prolong the existence of cultural property.”
Preservation 101
12. TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Preserving a website is not trivial
What do want you preserve?
Content only?
Content and Design?
How often? Stock prices vs. Company History page
How do you deal with browser differences?
How do you preserve functionality? E.g. insurance fee calculator
15. TYPO3 Developer Days, 19/22 June 2014, Eindhoven
The project
Deliver a framework for intelligent
preservation, incl. pilot
applications (personal use case,
organizational use case) that
already bring value to their target
groups.
16. TYPO3 Developer Days, 19/22 June 2014, Eindhoven
The Project
EU research project
Part of the Seventh framework
programme
Countries involved : Germany,
Sweden, Israel, Turkey,
Greece, United Kingdom, Italy
Project duration: 2013/2016
19. TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Memory buoyancy and preservation value
20. TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Memory buoyancy and preservation value
Digital preservation
21. TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Memory buoyancy and preservation value
Digital preservation
Forgetting without context
22. TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Memory buoyancy and preservation value
Digital preservation
Forgetting without context
Preservation with learning
23. TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Memory buoyancy and preservation value
Digital preservation
Forgetting without context Preservation with context
Preservation with learning
24. TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Memory buoyancy and preservation value
Digital preservation
Forgetting without context
Managed digital preservation
Preservation with context
Preservation with learning
25. TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Memory buoyancy and preservation value
Archive or delete
Digital preservation
Forgetting without context
Managed digital preservation
Preservation with context
Preservation with learning
26. TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Memory buoyancy and preservation value
Archive or delete
Information not neededDigital preservation
Forgetting without context
Managed digital preservation
Preservation with context
Preservation with learning
27. TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Use cases
Organizational
Preservation
Personal
Preservation
28. TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Organizational use case
Organizational
Preservation
Digital Asset Management
Versioning
Archiving a complete Website
Individual genres and their
specific requirements
Example: Press Release
29. TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Business case / Value preposition
Creating metrics to actually „measure“ the value of content is unique
to ForgetIT and will be a USP
Sustainable and integrated tools to manage the process of
preservation, which is new to CMS systems
The utilized standards (e.g. CMIS, ODATA, STANBOL, etc.) and
newly created tools within the context of TYPO3 CMS will lead to
CMS interoperability and thus prevent future loss of content due to
technological evolution (see „preventing the digital dark age“)
30. TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Content Value Performance Indicators:
Potential dimensions to look at:
Production Inner relevance Outer relevance „Meaning"
Effort References
Social Media
relevance
Context
Complexity
Page
impressions
Google page
rank
Ontologies
Versions
TYPO3 CMS
page rank
Backlinks Annotation
…
Memory
Buoyancy
… …
…
31. TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Why TYPO3 CMS?
Open source
large base of installation
Want to create awareness on
the concept of preservation
39. TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Content Management Interoperability Services
(CMIS)
Standard allowing
interoperability between CMS
Abstraction layer
Defined domain model
OASIS Standard
40. TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Semantic web
A web that can be processed by
machines
Resource Description
Framework (RDF)
41. TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Ontologies / Domains
Semantic relations in Content
industry
specific
concepts
!
geography,
time,
abstract concepts
!
!
company related
products, events,
concepts, ...
This is our set of concepts
to annotate content with!
during creation/update
flows over time
as the basis for defining
value
future „smart semantic“
editing
42. TYPO3 Developer Days, 19/22 June 2014, Eindhoven
But - what does semantic annotation mean? How does it
look to us in a press release (tt_news)?
„.. to announce, that the Global Toy fare will be held in Nuremberg on
February 12th, 2014. LEGO will be presenting it products in Hall ... “
company
event
common
geography
common
date
industry concept
of a brand
43. TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Suggestion how to tackle this from dkd:
Treat semantics like learning the system a foreign (company)
language
Implementing a semantic „overlay“ within the backend, so that
during the creation/update of content annotation can happen
Suggest annotations if the backend already knows a word/concept
Using these content annotations to level up DAM in TYPO3CMS
Integrating semantic search in back end and front end
Connect DAM to the Media Mixer from ForgetIT Framework
44. TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Text summarization
Generation of visual summaries!
• Content Detection analyzes a
document to determine which
sections are useful in terms of
content (e.g. removing the generic
menus in a web page; avoids
irrelevant material biasing the
summary)!
• TermRaider extracts representative,
weighted terms (words, entities
etc.) from documents which can
provide a summary (e.g. as a term
cloud)
45. TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Outlook: Semantic text composition
Semantic text editor!
• Tool for inferring and suggesting semantic annotations for text while it
is being composed
46. TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Outlook: Semantic text composition
Semantic text editor components!
• Editor!
− An extended version of the open-source HTML-based rich text editor CKEditor,
which allows for annotating and tracking arbitrary parts of the text !
• Natural Language Processing component!
− Named entity recognition locates and classifies atomic elements in text into
predefined categories such as people, organizations, and locations!
− Coreference resolution identifies which words refer to which things in a text!
− Relation extraction extracts binary relations from the text being composed!
• Linked Open Data component!
− Entity disambiguation distinguishes between different entities that have similar
or identical names!
− Relation extraction searches for relations among entities!
− Context inference finds contextual information about entities mentioned in the
text
61. TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Use Case I: Press release
Use case I: Press release
• Creating a press release
• Adding meta data
• Semantic annotation
63. TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Ingest press release
Automatic annotation
• Initiated by user
• Add entity to own ontology
• Color coded according to type
65. TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Ingest press release
Manual annotation
• Selection from text or clipboard
• Add entity to own ontology
66. TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Use Case II: Preservation-aware digital asset
management
Use Case II: Preservation-aware
digital asset management
• Searching for assets
• Managing digital assets
• Handling digital assets
71. TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Call to Action
Join our efforts in creating:
a semantic layer in TYPO3 CMS
defining the future of DAM within the TYPO3 world
establishing content value measures
preparing TYPO3 and our customers to manage forgetting and
preservation of content