SlideShare a Scribd company logo
1 of 72
Download to read offline
Concise Preservation by combining Managed
Forgetting and Contextualized Remembering
Olivier Dobberkau (R&D)
T3DD2014!
Beyond the page -
Giving content a meaning and value!
TYPO3 Developer Days, 19/22 June 2014, Eindhoven
TYPO3 Developer Days, 19/22 June 2014, Eindhoven
About
Olivier Dobberkau
R&D dkd
President of TYPO3
Association
@TReverendNeverend
The problem
TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Welcome to the digital, information age...
…a never ending flood of content!
Technology enables us to produce nearly unlimited data
We are still „hunters and collectors“ somehow
Currently storage space feels to be „infinite“, but resources on
earth are limited sooner or later
Velocity of innovation/evolution of technology increases, which
brings new technology/formats/standard at an increasing
frequency

-> so how do we handle this?
Storage capacity is ever increasing
Prices for storage are falling
TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Easy - let‘s keep everything!
There’s a lot more costs
Retrieval
Maintenance
Indexing
Updates
Deprecated formats
Should we really keep everything
as it was created ?
“The digital dark age is a possible future
situation where it will be difficult or impossible
to read historical electronic documents and
multimedia, because they have been stored in
an obsolete and obscure file format.” Wikipedia
How do we tackle this?
TYPO3 Developer Days, 19/22 June 2014, Eindhoven
What is preservation?
“Preservation — The protection of cultural
property through activities that minimize
chemical and physical deterioration and
damage and that prevent loss of informational
content. The primary goal of preservation is to
prolong the existence of cultural property.”
Preservation 101
TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Preserving a website is not trivial
What do want you preserve?
Content only?
Content and Design?
How often? Stock prices vs. Company History page
How do you deal with browser differences?
How do you preserve functionality? E.g. insurance fee calculator
The project
Concise Preservation by combining
Managed Forgetting and Contextualized
Remembering
TYPO3 Developer Days, 19/22 June 2014, Eindhoven
The project
Deliver a framework for intelligent
preservation, incl. pilot
applications (personal use case,
organizational use case) that
already bring value to their target
groups.
TYPO3 Developer Days, 19/22 June 2014, Eindhoven
The Project
EU research project
Part of the Seventh framework
programme
Countries involved : Germany,
Sweden, Israel, Turkey,
Greece, United Kingdom, Italy
Project duration: 2013/2016
TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Core concepts
Synergetic
Preservation
Contextualised
Remembering
Managed
Forgetting
TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Core values
Preservation valueMemory buoyancy
TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Memory buoyancy and preservation value
TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Memory buoyancy and preservation value
Digital preservation
TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Memory buoyancy and preservation value
Digital preservation
Forgetting without context
TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Memory buoyancy and preservation value
Digital preservation
Forgetting without context
Preservation with learning
TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Memory buoyancy and preservation value
Digital preservation
Forgetting without context Preservation with context
Preservation with learning
TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Memory buoyancy and preservation value
Digital preservation
Forgetting without context
Managed digital preservation
Preservation with context
Preservation with learning
TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Memory buoyancy and preservation value
Archive or delete
Digital preservation
Forgetting without context
Managed digital preservation
Preservation with context
Preservation with learning
TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Memory buoyancy and preservation value
Archive or delete
Information not neededDigital preservation
Forgetting without context
Managed digital preservation
Preservation with context
Preservation with learning
TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Use cases
Organizational
Preservation
Personal
Preservation
TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Organizational use case
Organizational
Preservation
Digital Asset Management
Versioning
Archiving a complete Website
Individual genres and their
specific requirements
Example: Press Release
TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Business case / Value preposition
Creating metrics to actually „measure“ the value of content is unique
to ForgetIT and will be a USP
Sustainable and integrated tools to manage the process of
preservation, which is new to CMS systems
The utilized standards (e.g. CMIS, ODATA, STANBOL, etc.) and
newly created tools within the context of TYPO3 CMS will lead to
CMS interoperability and thus prevent future loss of content due to
technological evolution (see „preventing the digital dark age“)
TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Content Value Performance Indicators:

Potential dimensions to look at:
Production Inner relevance Outer relevance „Meaning"
Effort References
Social Media
relevance
Context
Complexity
Page
impressions
Google page
rank
Ontologies
Versions
TYPO3 CMS

page rank
Backlinks Annotation
…
Memory
Buoyancy
… …
…
TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Why TYPO3 CMS?
Open source
large base of installation
Want to create awareness on
the concept of preservation
ForgetIT: Beyond the page: Giving content a meaning and value
TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Architecture
TYPO3 Developer Days, 19/22 June 2014, Eindhoven
TYPO3 Developer Days, 19/22 June 2014, Eindhoven
TYPO3 Developer Days, 19/22 June 2014, Eindhoven
TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Technology
TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Content Management Interoperability Services
(CMIS)
Standard allowing
interoperability between CMS
Abstraction layer
Defined domain model
OASIS Standard
TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Semantic web
A web that can be processed by
machines
Resource Description
Framework (RDF)
TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Ontologies / Domains
Semantic relations in Content
industry
specific

concepts
!
geography,

time,

abstract concepts
!
!
company related
products, events,

concepts, ...
This is our set of concepts
to annotate content with!
during creation/update
flows over time
as the basis for defining
value
future „smart semantic“
editing
TYPO3 Developer Days, 19/22 June 2014, Eindhoven
But - what does semantic annotation mean? How does it
look to us in a press release (tt_news)?
„.. to announce, that the Global Toy fare will be held in Nuremberg on
February 12th, 2014. LEGO will be presenting it products in Hall ... “
company 

event
common

geography
common

date
industry concept

of a brand
TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Suggestion how to tackle this from dkd:
Treat semantics like learning the system a foreign (company)
language
Implementing a semantic „overlay“ within the backend, so that
during the creation/update of content annotation can happen
Suggest annotations if the backend already knows a word/concept
Using these content annotations to level up DAM in TYPO3CMS
Integrating semantic search in back end and front end
Connect DAM to the Media Mixer from ForgetIT Framework
TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Text summarization
Generation of visual summaries!
• Content Detection analyzes a
document to determine which
sections are useful in terms of
content (e.g. removing the generic
menus in a web page; avoids
irrelevant material biasing the
summary)!
• TermRaider extracts representative,
weighted terms (words, entities
etc.) from documents which can
provide a summary (e.g. as a term
cloud)
TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Outlook: Semantic text composition
Semantic text editor!
• Tool for inferring and suggesting semantic annotations for text while it
is being composed
TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Outlook: Semantic text composition
Semantic text editor components!
• Editor!
− An extended version of the open-source HTML-based rich text editor CKEditor,
which allows for annotating and tracking arbitrary parts of the text !
• Natural Language Processing component!
− Named entity recognition locates and classifies atomic elements in text into
predefined categories such as people, organizations, and locations!
− Coreference resolution identifies which words refer to which things in a text!
− Relation extraction extracts binary relations from the text being composed!
• Linked Open Data component!
− Entity disambiguation distinguishes between different entities that have similar
or identical names!
− Relation extraction searches for relations among entities!
− Context inference finds contextual information about entities mentioned in the
text
Annotation/Contextualisation of Images
TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Image analysis
ForgetIT visual analysis
technologies demonstrator!
• Concept detection and feature
extraction!
• Visual quality assessment!
• Image clustering!
• Face detection
http://multimedia.iti.gr/ForgetIT/CostaRica/demonstrator.html
TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Image feature extraction and concept detection
TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Image clustering for summarization
Want to support the ForgetIT project?
How to get involved?
Ideas
TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Code contributions
TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Test and evaluate
TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Take our survey! (1/2)
Organizational
Preservation
http://bit.ly/U65uL6
TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Take our survey! (2/2)
Personal
Preservation
http://bit.ly/1kJPNhZ
Timeline
TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Timeline
2013
• D10.3
• Mockups
• Proof of concept
2014
• Architecture
• FAL
• Semantic UI / Layer
• DAM Dashboard
• Log Aggregation
Toolkit
2015
• Content value
framework
2016
• Final
TYPO3 Developer Days, 19/22 June 2014, Eindhoven
D10.1
Research
Analysis
Application Design
Application Logic and Workflow
Mockups
TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Use Case I: Press release
Use case I: Press release
• Creating a press release
• Adding meta data
• Semantic annotation
TYPO3 Developer Days, 19/22 June 2014, Eindhoven
TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Ingest press release
Automatic annotation
• Initiated by user
• Add entity to own ontology
• Color coded according to type
TYPO3 Developer Days, 19/22 June 2014, Eindhoven
TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Ingest press release
Manual annotation
• Selection from text or clipboard
• Add entity to own ontology
TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Use Case II: Preservation-aware digital asset
management
Use Case II: Preservation-aware
digital asset management
• Searching for assets
• Managing digital assets
• Handling digital assets
TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Summary
TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Where to find us
http://www.forgetit-project.eu
TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Contact
@ForgetITProject
Olivier Dobberkau
olivier.dobberkau@dkd.de
TYPO3 Developer Days, 19/22 June 2014, Eindhoven
Call to Action
Join our efforts in creating:
a semantic layer in TYPO3 CMS
defining the future of DAM within the TYPO3 world
establishing content value measures
preparing TYPO3 and our customers to manage forgetting and
preservation of content
Thank you for your attention!

More Related Content

Similar to ForgetIT: Beyond the page: Giving content a meaning and value

OER Rapid Innovation
OER Rapid InnovationOER Rapid Innovation
OER Rapid InnovationJisc
 
The Archives Forum - The National Archives - 02 March 2011
The Archives Forum - The National Archives - 02 March 2011The Archives Forum - The National Archives - 02 March 2011
The Archives Forum - The National Archives - 02 March 2011David F. Flanders
 
Toward FAIR Semantic Resources
Toward FAIR Semantic ResourcesToward FAIR Semantic Resources
Toward FAIR Semantic ResourcesEUDAT
 
QURATOR: A Flexible AI Platform for the Adaptive Analysis and Creative Genera...
QURATOR: A Flexible AI Platform for the Adaptive Analysis and Creative Genera...QURATOR: A Flexible AI Platform for the Adaptive Analysis and Creative Genera...
QURATOR: A Flexible AI Platform for the Adaptive Analysis and Creative Genera...Georg Rehm
 
Perfect Memory Media Asset Management MAM of Audiovisual Big Data @ Radio 2.0...
Perfect Memory Media Asset Management MAM of Audiovisual Big Data @ Radio 2.0...Perfect Memory Media Asset Management MAM of Audiovisual Big Data @ Radio 2.0...
Perfect Memory Media Asset Management MAM of Audiovisual Big Data @ Radio 2.0...ACTUONDA
 
F.S. Nucci - Search as an architectural component: searching for a new paradigm
F.S. Nucci - Search as an architectural component: searching for a new paradigmF.S. Nucci - Search as an architectural component: searching for a new paradigm
F.S. Nucci - Search as an architectural component: searching for a new paradigmFIA2010
 
MOVING presentation at the Course in Open Education Design, July 2018, Slovenia
MOVING presentation at the Course in Open Education Design, July 2018, SloveniaMOVING presentation at the Course in Open Education Design, July 2018, Slovenia
MOVING presentation at the Course in Open Education Design, July 2018, SloveniaMOVING Project
 
Kitodo - open source community and service providers hand in hand by Kerstin ...
Kitodo - open source community and service providers hand in hand by Kerstin ...Kitodo - open source community and service providers hand in hand by Kerstin ...
Kitodo - open source community and service providers hand in hand by Kerstin ...Europeana
 
Painless XML Authoring?: How DITA Simplifies XML
Painless XML Authoring?: How DITA Simplifies XMLPainless XML Authoring?: How DITA Simplifies XML
Painless XML Authoring?: How DITA Simplifies XMLScott Abel
 
Serge Ravet - Reinventing the e-Portfolio
Serge Ravet - Reinventing the e-PortfolioSerge Ravet - Reinventing the e-Portfolio
Serge Ravet - Reinventing the e-PortfolioBestr
 
Reinventing the ePortfolio with Open Badges
Reinventing the ePortfolio with Open BadgesReinventing the ePortfolio with Open Badges
Reinventing the ePortfolio with Open BadgesSerge Ravet
 
Summer school bz_fp7research_20100708
Summer school bz_fp7research_20100708Summer school bz_fp7research_20100708
Summer school bz_fp7research_20100708Sandro D'Elia
 
Presentation of Clemens Neudecker, BnF Information Day
Presentation of Clemens Neudecker, BnF Information DayPresentation of Clemens Neudecker, BnF Information Day
Presentation of Clemens Neudecker, BnF Information DayEuropeana Newspapers
 
Open Badge Passport:
Open Badge Passport: Open Badge Passport:
Open Badge Passport: Serge Ravet
 
20191210 NDLI KEDL2019 Building the dutch digital heritage network
20191210 NDLI KEDL2019 Building the dutch digital heritage network20191210 NDLI KEDL2019 Building the dutch digital heritage network
20191210 NDLI KEDL2019 Building the dutch digital heritage networkEnno Meijers
 
Participatory Media Literacy Uwi2009
Participatory Media Literacy Uwi2009Participatory Media Literacy Uwi2009
Participatory Media Literacy Uwi2009urauch
 
Tech Trans as Learning
Tech Trans as LearningTech Trans as Learning
Tech Trans as LearningVidensemergens
 

Similar to ForgetIT: Beyond the page: Giving content a meaning and value (20)

OER Rapid Innovation
OER Rapid InnovationOER Rapid Innovation
OER Rapid Innovation
 
The Archives Forum - The National Archives - 02 March 2011
The Archives Forum - The National Archives - 02 March 2011The Archives Forum - The National Archives - 02 March 2011
The Archives Forum - The National Archives - 02 March 2011
 
Toward FAIR Semantic Resources
Toward FAIR Semantic ResourcesToward FAIR Semantic Resources
Toward FAIR Semantic Resources
 
QURATOR: A Flexible AI Platform for the Adaptive Analysis and Creative Genera...
QURATOR: A Flexible AI Platform for the Adaptive Analysis and Creative Genera...QURATOR: A Flexible AI Platform for the Adaptive Analysis and Creative Genera...
QURATOR: A Flexible AI Platform for the Adaptive Analysis and Creative Genera...
 
Perfect Memory Media Asset Management MAM of Audiovisual Big Data @ Radio 2.0...
Perfect Memory Media Asset Management MAM of Audiovisual Big Data @ Radio 2.0...Perfect Memory Media Asset Management MAM of Audiovisual Big Data @ Radio 2.0...
Perfect Memory Media Asset Management MAM of Audiovisual Big Data @ Radio 2.0...
 
On Annotation of Video Content for Multimedia Retrieval and Sharing
On Annotation of Video Content for Multimedia  Retrieval and SharingOn Annotation of Video Content for Multimedia  Retrieval and Sharing
On Annotation of Video Content for Multimedia Retrieval and Sharing
 
F.S. Nucci - Search as an architectural component: searching for a new paradigm
F.S. Nucci - Search as an architectural component: searching for a new paradigmF.S. Nucci - Search as an architectural component: searching for a new paradigm
F.S. Nucci - Search as an architectural component: searching for a new paradigm
 
MOVING presentation at the Course in Open Education Design, July 2018, Slovenia
MOVING presentation at the Course in Open Education Design, July 2018, SloveniaMOVING presentation at the Course in Open Education Design, July 2018, Slovenia
MOVING presentation at the Course in Open Education Design, July 2018, Slovenia
 
Kitodo - open source community and service providers hand in hand by Kerstin ...
Kitodo - open source community and service providers hand in hand by Kerstin ...Kitodo - open source community and service providers hand in hand by Kerstin ...
Kitodo - open source community and service providers hand in hand by Kerstin ...
 
Painless XML Authoring?: How DITA Simplifies XML
Painless XML Authoring?: How DITA Simplifies XMLPainless XML Authoring?: How DITA Simplifies XML
Painless XML Authoring?: How DITA Simplifies XML
 
Serge Ravet - Reinventing the e-Portfolio
Serge Ravet - Reinventing the e-PortfolioSerge Ravet - Reinventing the e-Portfolio
Serge Ravet - Reinventing the e-Portfolio
 
Reinventing the ePortfolio with Open Badges
Reinventing the ePortfolio with Open BadgesReinventing the ePortfolio with Open Badges
Reinventing the ePortfolio with Open Badges
 
Summer school bz_fp7research_20100708
Summer school bz_fp7research_20100708Summer school bz_fp7research_20100708
Summer school bz_fp7research_20100708
 
Presentation of Clemens Neudecker, BnF Information Day
Presentation of Clemens Neudecker, BnF Information DayPresentation of Clemens Neudecker, BnF Information Day
Presentation of Clemens Neudecker, BnF Information Day
 
Open Badge Passport:
Open Badge Passport: Open Badge Passport:
Open Badge Passport:
 
ICWI_2002 (1).pdf
ICWI_2002 (1).pdfICWI_2002 (1).pdf
ICWI_2002 (1).pdf
 
20191210 NDLI KEDL2019 Building the dutch digital heritage network
20191210 NDLI KEDL2019 Building the dutch digital heritage network20191210 NDLI KEDL2019 Building the dutch digital heritage network
20191210 NDLI KEDL2019 Building the dutch digital heritage network
 
Participatory Media Literacy Uwi2009
Participatory Media Literacy Uwi2009Participatory Media Literacy Uwi2009
Participatory Media Literacy Uwi2009
 
DMI slides
DMI slidesDMI slides
DMI slides
 
Tech Trans as Learning
Tech Trans as LearningTech Trans as Learning
Tech Trans as Learning
 

More from Olivier Dobberkau

Meet TYPO3 Vienna - Solr die Suchmachine für TYPO3
Meet TYPO3 Vienna - Solr die Suchmachine für TYPO3Meet TYPO3 Vienna - Solr die Suchmachine für TYPO3
Meet TYPO3 Vienna - Solr die Suchmachine für TYPO3Olivier Dobberkau
 
Apache Solr for TYPO3: More than a search engine
Apache Solr for TYPO3: More than a search engineApache Solr for TYPO3: More than a search engine
Apache Solr for TYPO3: More than a search engineOlivier Dobberkau
 
With a little help from my friends (english)
With a little help  from my friends (english)With a little help  from my friends (english)
With a little help from my friends (english)Olivier Dobberkau
 
With a little help from my friends
With a little help from my friendsWith a little help from my friends
With a little help from my friendsOlivier Dobberkau
 
Sonnenschein für ihre Website
Sonnenschein für ihre WebsiteSonnenschein für ihre Website
Sonnenschein für ihre WebsiteOlivier Dobberkau
 
TYPO3 Camp Poznan - Solr Usecases with Hosted Solr
TYPO3 Camp Poznan - Solr Usecases with Hosted SolrTYPO3 Camp Poznan - Solr Usecases with Hosted Solr
TYPO3 Camp Poznan - Solr Usecases with Hosted SolrOlivier Dobberkau
 
ForgetIT Project TYPO3Camp Milano 2014
ForgetIT Project TYPO3Camp Milano 2014ForgetIT Project TYPO3Camp Milano 2014
ForgetIT Project TYPO3Camp Milano 2014Olivier Dobberkau
 
Apache Solr for TYPO3 CMS 101
Apache Solr for TYPO3 CMS 101Apache Solr for TYPO3 CMS 101
Apache Solr for TYPO3 CMS 101Olivier Dobberkau
 
Outside the Box - Panel on CMS at TYPO3 Camp Mallorca
Outside the Box - Panel on CMS at TYPO3 Camp MallorcaOutside the Box - Panel on CMS at TYPO3 Camp Mallorca
Outside the Box - Panel on CMS at TYPO3 Camp MallorcaOlivier Dobberkau
 
Status & Outlook on EXT:solr for TYPO3 CMS
Status & Outlook on EXT:solr for TYPO3 CMSStatus & Outlook on EXT:solr for TYPO3 CMS
Status & Outlook on EXT:solr for TYPO3 CMSOlivier Dobberkau
 
Digital dark age - Are we doing enough to preserve our website heritage?
Digital dark age - Are we doing enough to preserve our website heritage?Digital dark age - Are we doing enough to preserve our website heritage?
Digital dark age - Are we doing enough to preserve our website heritage?Olivier Dobberkau
 
Everything you always wanted to know about search in typo3
Everything you always wanted to know about search in typo3Everything you always wanted to know about search in typo3
Everything you always wanted to know about search in typo3Olivier Dobberkau
 
Alles was-sie-ueber-suche-wissen-wollten
Alles was-sie-ueber-suche-wissen-wolltenAlles was-sie-ueber-suche-wissen-wollten
Alles was-sie-ueber-suche-wissen-wolltenOlivier Dobberkau
 
Searching does not mean finding Stuff - Apache Solr for TYPO3
Searching does not mean finding Stuff - Apache Solr for TYPO3Searching does not mean finding Stuff - Apache Solr for TYPO3
Searching does not mean finding Stuff - Apache Solr for TYPO3Olivier Dobberkau
 
Das Solr System - Suche nicht nur auf Planet TYPO3
Das Solr System - Suche nicht nur auf Planet TYPO3Das Solr System - Suche nicht nur auf Planet TYPO3
Das Solr System - Suche nicht nur auf Planet TYPO3Olivier Dobberkau
 
How we did it: Apache Solr for TYPO3. Collecting ideas, partners and money in...
How we did it: Apache Solr for TYPO3. Collecting ideas, partners and money in...How we did it: Apache Solr for TYPO3. Collecting ideas, partners and money in...
How we did it: Apache Solr for TYPO3. Collecting ideas, partners and money in...Olivier Dobberkau
 

More from Olivier Dobberkau (20)

Meet TYPO3 Vienna - Solr die Suchmachine für TYPO3
Meet TYPO3 Vienna - Solr die Suchmachine für TYPO3Meet TYPO3 Vienna - Solr die Suchmachine für TYPO3
Meet TYPO3 Vienna - Solr die Suchmachine für TYPO3
 
Apache Solr for TYPO3: More than a search engine
Apache Solr for TYPO3: More than a search engineApache Solr for TYPO3: More than a search engine
Apache Solr for TYPO3: More than a search engine
 
TYPO3 v8 LTS in the cloud
TYPO3 v8 LTS in the cloudTYPO3 v8 LTS in the cloud
TYPO3 v8 LTS in the cloud
 
With a little help from my friends (english)
With a little help  from my friends (english)With a little help  from my friends (english)
With a little help from my friends (english)
 
With a little help from my friends
With a little help from my friendsWith a little help from my friends
With a little help from my friends
 
TYPO3 & You
TYPO3 & YouTYPO3 & You
TYPO3 & You
 
Sonnenschein für ihre Website
Sonnenschein für ihre WebsiteSonnenschein für ihre Website
Sonnenschein für ihre Website
 
Apache Solr Revisited 2015
Apache Solr Revisited 2015Apache Solr Revisited 2015
Apache Solr Revisited 2015
 
TYPO3 Camp Poznan - Solr Usecases with Hosted Solr
TYPO3 Camp Poznan - Solr Usecases with Hosted SolrTYPO3 Camp Poznan - Solr Usecases with Hosted Solr
TYPO3 Camp Poznan - Solr Usecases with Hosted Solr
 
TYPO3 and CMIS
TYPO3 and CMISTYPO3 and CMIS
TYPO3 and CMIS
 
ForgetIT Project TYPO3Camp Milano 2014
ForgetIT Project TYPO3Camp Milano 2014ForgetIT Project TYPO3Camp Milano 2014
ForgetIT Project TYPO3Camp Milano 2014
 
Apache Solr for TYPO3 CMS 101
Apache Solr for TYPO3 CMS 101Apache Solr for TYPO3 CMS 101
Apache Solr for TYPO3 CMS 101
 
Outside the Box - Panel on CMS at TYPO3 Camp Mallorca
Outside the Box - Panel on CMS at TYPO3 Camp MallorcaOutside the Box - Panel on CMS at TYPO3 Camp Mallorca
Outside the Box - Panel on CMS at TYPO3 Camp Mallorca
 
Status & Outlook on EXT:solr for TYPO3 CMS
Status & Outlook on EXT:solr for TYPO3 CMSStatus & Outlook on EXT:solr for TYPO3 CMS
Status & Outlook on EXT:solr for TYPO3 CMS
 
Digital dark age - Are we doing enough to preserve our website heritage?
Digital dark age - Are we doing enough to preserve our website heritage?Digital dark age - Are we doing enough to preserve our website heritage?
Digital dark age - Are we doing enough to preserve our website heritage?
 
Everything you always wanted to know about search in typo3
Everything you always wanted to know about search in typo3Everything you always wanted to know about search in typo3
Everything you always wanted to know about search in typo3
 
Alles was-sie-ueber-suche-wissen-wollten
Alles was-sie-ueber-suche-wissen-wolltenAlles was-sie-ueber-suche-wissen-wollten
Alles was-sie-ueber-suche-wissen-wollten
 
Searching does not mean finding Stuff - Apache Solr for TYPO3
Searching does not mean finding Stuff - Apache Solr for TYPO3Searching does not mean finding Stuff - Apache Solr for TYPO3
Searching does not mean finding Stuff - Apache Solr for TYPO3
 
Das Solr System - Suche nicht nur auf Planet TYPO3
Das Solr System - Suche nicht nur auf Planet TYPO3Das Solr System - Suche nicht nur auf Planet TYPO3
Das Solr System - Suche nicht nur auf Planet TYPO3
 
How we did it: Apache Solr for TYPO3. Collecting ideas, partners and money in...
How we did it: Apache Solr for TYPO3. Collecting ideas, partners and money in...How we did it: Apache Solr for TYPO3. Collecting ideas, partners and money in...
How we did it: Apache Solr for TYPO3. Collecting ideas, partners and money in...
 

Recently uploaded

Benefits of doing Internet peering and running an Internet Exchange (IX) pres...
Benefits of doing Internet peering and running an Internet Exchange (IX) pres...Benefits of doing Internet peering and running an Internet Exchange (IX) pres...
Benefits of doing Internet peering and running an Internet Exchange (IX) pres...APNIC
 
Zero-day Vulnerabilities
Zero-day VulnerabilitiesZero-day Vulnerabilities
Zero-day Vulnerabilitiesalihassaah1994
 
WordPress by the numbers - Jan Loeffler, CTO WebPros, CloudFest 2024
WordPress by the numbers - Jan Loeffler, CTO WebPros, CloudFest 2024WordPress by the numbers - Jan Loeffler, CTO WebPros, CloudFest 2024
WordPress by the numbers - Jan Loeffler, CTO WebPros, CloudFest 2024Jan Löffler
 
TYPES AND DEFINITION OF ONLINE CRIMES AND HAZARDS
TYPES AND DEFINITION OF ONLINE CRIMES AND HAZARDSTYPES AND DEFINITION OF ONLINE CRIMES AND HAZARDS
TYPES AND DEFINITION OF ONLINE CRIMES AND HAZARDSedrianrheine
 
Vision Forward: Tracing Image Search SEO From Its Roots To AI-Enhanced Horizons
Vision Forward: Tracing Image Search SEO From Its Roots To AI-Enhanced HorizonsVision Forward: Tracing Image Search SEO From Its Roots To AI-Enhanced Horizons
Vision Forward: Tracing Image Search SEO From Its Roots To AI-Enhanced HorizonsRoxana Stingu
 
Presentation2.pptx - JoyPress Wordpress
Presentation2.pptx -  JoyPress WordpressPresentation2.pptx -  JoyPress Wordpress
Presentation2.pptx - JoyPress Wordpressssuser166378
 
Bio Medical Waste Management Guideliness 2023 ppt.pptx
Bio Medical Waste Management Guideliness 2023 ppt.pptxBio Medical Waste Management Guideliness 2023 ppt.pptx
Bio Medical Waste Management Guideliness 2023 ppt.pptxnaveenithkrishnan
 
LESSON 10/ GROUP 10/ ST. THOMAS AQUINASS
LESSON 10/ GROUP 10/ ST. THOMAS AQUINASSLESSON 10/ GROUP 10/ ST. THOMAS AQUINASS
LESSON 10/ GROUP 10/ ST. THOMAS AQUINASSlesteraporado16
 
Check out the Free Landing Page Hosting in 2024
Check out the Free Landing Page Hosting in 2024Check out the Free Landing Page Hosting in 2024
Check out the Free Landing Page Hosting in 2024Shubham Pant
 
Computer 10 Lesson 8: Building a Website
Computer 10 Lesson 8: Building a WebsiteComputer 10 Lesson 8: Building a Website
Computer 10 Lesson 8: Building a WebsiteMavein
 
LESSON 5 GROUP 10 ST. THOMAS AQUINAS.pdf
LESSON 5 GROUP 10 ST. THOMAS AQUINAS.pdfLESSON 5 GROUP 10 ST. THOMAS AQUINAS.pdf
LESSON 5 GROUP 10 ST. THOMAS AQUINAS.pdfmchristianalwyn
 
Introduction to ICANN and Fellowship program by Shreedeep Rayamajhi.pdf
Introduction to ICANN and Fellowship program  by Shreedeep Rayamajhi.pdfIntroduction to ICANN and Fellowship program  by Shreedeep Rayamajhi.pdf
Introduction to ICANN and Fellowship program by Shreedeep Rayamajhi.pdfShreedeep Rayamajhi
 

Recently uploaded (12)

Benefits of doing Internet peering and running an Internet Exchange (IX) pres...
Benefits of doing Internet peering and running an Internet Exchange (IX) pres...Benefits of doing Internet peering and running an Internet Exchange (IX) pres...
Benefits of doing Internet peering and running an Internet Exchange (IX) pres...
 
Zero-day Vulnerabilities
Zero-day VulnerabilitiesZero-day Vulnerabilities
Zero-day Vulnerabilities
 
WordPress by the numbers - Jan Loeffler, CTO WebPros, CloudFest 2024
WordPress by the numbers - Jan Loeffler, CTO WebPros, CloudFest 2024WordPress by the numbers - Jan Loeffler, CTO WebPros, CloudFest 2024
WordPress by the numbers - Jan Loeffler, CTO WebPros, CloudFest 2024
 
TYPES AND DEFINITION OF ONLINE CRIMES AND HAZARDS
TYPES AND DEFINITION OF ONLINE CRIMES AND HAZARDSTYPES AND DEFINITION OF ONLINE CRIMES AND HAZARDS
TYPES AND DEFINITION OF ONLINE CRIMES AND HAZARDS
 
Vision Forward: Tracing Image Search SEO From Its Roots To AI-Enhanced Horizons
Vision Forward: Tracing Image Search SEO From Its Roots To AI-Enhanced HorizonsVision Forward: Tracing Image Search SEO From Its Roots To AI-Enhanced Horizons
Vision Forward: Tracing Image Search SEO From Its Roots To AI-Enhanced Horizons
 
Presentation2.pptx - JoyPress Wordpress
Presentation2.pptx -  JoyPress WordpressPresentation2.pptx -  JoyPress Wordpress
Presentation2.pptx - JoyPress Wordpress
 
Bio Medical Waste Management Guideliness 2023 ppt.pptx
Bio Medical Waste Management Guideliness 2023 ppt.pptxBio Medical Waste Management Guideliness 2023 ppt.pptx
Bio Medical Waste Management Guideliness 2023 ppt.pptx
 
LESSON 10/ GROUP 10/ ST. THOMAS AQUINASS
LESSON 10/ GROUP 10/ ST. THOMAS AQUINASSLESSON 10/ GROUP 10/ ST. THOMAS AQUINASS
LESSON 10/ GROUP 10/ ST. THOMAS AQUINASS
 
Check out the Free Landing Page Hosting in 2024
Check out the Free Landing Page Hosting in 2024Check out the Free Landing Page Hosting in 2024
Check out the Free Landing Page Hosting in 2024
 
Computer 10 Lesson 8: Building a Website
Computer 10 Lesson 8: Building a WebsiteComputer 10 Lesson 8: Building a Website
Computer 10 Lesson 8: Building a Website
 
LESSON 5 GROUP 10 ST. THOMAS AQUINAS.pdf
LESSON 5 GROUP 10 ST. THOMAS AQUINAS.pdfLESSON 5 GROUP 10 ST. THOMAS AQUINAS.pdf
LESSON 5 GROUP 10 ST. THOMAS AQUINAS.pdf
 
Introduction to ICANN and Fellowship program by Shreedeep Rayamajhi.pdf
Introduction to ICANN and Fellowship program  by Shreedeep Rayamajhi.pdfIntroduction to ICANN and Fellowship program  by Shreedeep Rayamajhi.pdf
Introduction to ICANN and Fellowship program by Shreedeep Rayamajhi.pdf
 

ForgetIT: Beyond the page: Giving content a meaning and value

  • 1. Concise Preservation by combining Managed Forgetting and Contextualized Remembering
  • 2. Olivier Dobberkau (R&D) T3DD2014! Beyond the page - Giving content a meaning and value! TYPO3 Developer Days, 19/22 June 2014, Eindhoven
  • 3. TYPO3 Developer Days, 19/22 June 2014, Eindhoven About Olivier Dobberkau R&D dkd President of TYPO3 Association @TReverendNeverend
  • 5. TYPO3 Developer Days, 19/22 June 2014, Eindhoven Welcome to the digital, information age... …a never ending flood of content! Technology enables us to produce nearly unlimited data We are still „hunters and collectors“ somehow Currently storage space feels to be „infinite“, but resources on earth are limited sooner or later Velocity of innovation/evolution of technology increases, which brings new technology/formats/standard at an increasing frequency
 -> so how do we handle this?
  • 6. Storage capacity is ever increasing Prices for storage are falling
  • 7. TYPO3 Developer Days, 19/22 June 2014, Eindhoven Easy - let‘s keep everything! There’s a lot more costs Retrieval Maintenance Indexing Updates Deprecated formats
  • 8. Should we really keep everything as it was created ?
  • 9. “The digital dark age is a possible future situation where it will be difficult or impossible to read historical electronic documents and multimedia, because they have been stored in an obsolete and obscure file format.” Wikipedia
  • 10. How do we tackle this?
  • 11. TYPO3 Developer Days, 19/22 June 2014, Eindhoven What is preservation? “Preservation — The protection of cultural property through activities that minimize chemical and physical deterioration and damage and that prevent loss of informational content. The primary goal of preservation is to prolong the existence of cultural property.” Preservation 101
  • 12. TYPO3 Developer Days, 19/22 June 2014, Eindhoven Preserving a website is not trivial What do want you preserve? Content only? Content and Design? How often? Stock prices vs. Company History page How do you deal with browser differences? How do you preserve functionality? E.g. insurance fee calculator
  • 14. Concise Preservation by combining Managed Forgetting and Contextualized Remembering
  • 15. TYPO3 Developer Days, 19/22 June 2014, Eindhoven The project Deliver a framework for intelligent preservation, incl. pilot applications (personal use case, organizational use case) that already bring value to their target groups.
  • 16. TYPO3 Developer Days, 19/22 June 2014, Eindhoven The Project EU research project Part of the Seventh framework programme Countries involved : Germany, Sweden, Israel, Turkey, Greece, United Kingdom, Italy Project duration: 2013/2016
  • 17. TYPO3 Developer Days, 19/22 June 2014, Eindhoven Core concepts Synergetic Preservation Contextualised Remembering Managed Forgetting
  • 18. TYPO3 Developer Days, 19/22 June 2014, Eindhoven Core values Preservation valueMemory buoyancy
  • 19. TYPO3 Developer Days, 19/22 June 2014, Eindhoven Memory buoyancy and preservation value
  • 20. TYPO3 Developer Days, 19/22 June 2014, Eindhoven Memory buoyancy and preservation value Digital preservation
  • 21. TYPO3 Developer Days, 19/22 June 2014, Eindhoven Memory buoyancy and preservation value Digital preservation Forgetting without context
  • 22. TYPO3 Developer Days, 19/22 June 2014, Eindhoven Memory buoyancy and preservation value Digital preservation Forgetting without context Preservation with learning
  • 23. TYPO3 Developer Days, 19/22 June 2014, Eindhoven Memory buoyancy and preservation value Digital preservation Forgetting without context Preservation with context Preservation with learning
  • 24. TYPO3 Developer Days, 19/22 June 2014, Eindhoven Memory buoyancy and preservation value Digital preservation Forgetting without context Managed digital preservation Preservation with context Preservation with learning
  • 25. TYPO3 Developer Days, 19/22 June 2014, Eindhoven Memory buoyancy and preservation value Archive or delete Digital preservation Forgetting without context Managed digital preservation Preservation with context Preservation with learning
  • 26. TYPO3 Developer Days, 19/22 June 2014, Eindhoven Memory buoyancy and preservation value Archive or delete Information not neededDigital preservation Forgetting without context Managed digital preservation Preservation with context Preservation with learning
  • 27. TYPO3 Developer Days, 19/22 June 2014, Eindhoven Use cases Organizational Preservation Personal Preservation
  • 28. TYPO3 Developer Days, 19/22 June 2014, Eindhoven Organizational use case Organizational Preservation Digital Asset Management Versioning Archiving a complete Website Individual genres and their specific requirements Example: Press Release
  • 29. TYPO3 Developer Days, 19/22 June 2014, Eindhoven Business case / Value preposition Creating metrics to actually „measure“ the value of content is unique to ForgetIT and will be a USP Sustainable and integrated tools to manage the process of preservation, which is new to CMS systems The utilized standards (e.g. CMIS, ODATA, STANBOL, etc.) and newly created tools within the context of TYPO3 CMS will lead to CMS interoperability and thus prevent future loss of content due to technological evolution (see „preventing the digital dark age“)
  • 30. TYPO3 Developer Days, 19/22 June 2014, Eindhoven Content Value Performance Indicators:
 Potential dimensions to look at: Production Inner relevance Outer relevance „Meaning" Effort References Social Media relevance Context Complexity Page impressions Google page rank Ontologies Versions TYPO3 CMS
 page rank Backlinks Annotation … Memory Buoyancy … … …
  • 31. TYPO3 Developer Days, 19/22 June 2014, Eindhoven Why TYPO3 CMS? Open source large base of installation Want to create awareness on the concept of preservation
  • 33. TYPO3 Developer Days, 19/22 June 2014, Eindhoven Architecture
  • 34. TYPO3 Developer Days, 19/22 June 2014, Eindhoven
  • 35. TYPO3 Developer Days, 19/22 June 2014, Eindhoven
  • 36. TYPO3 Developer Days, 19/22 June 2014, Eindhoven
  • 37. TYPO3 Developer Days, 19/22 June 2014, Eindhoven
  • 39. TYPO3 Developer Days, 19/22 June 2014, Eindhoven Content Management Interoperability Services (CMIS) Standard allowing interoperability between CMS Abstraction layer Defined domain model OASIS Standard
  • 40. TYPO3 Developer Days, 19/22 June 2014, Eindhoven Semantic web A web that can be processed by machines Resource Description Framework (RDF)
  • 41. TYPO3 Developer Days, 19/22 June 2014, Eindhoven Ontologies / Domains Semantic relations in Content industry specific
 concepts ! geography,
 time,
 abstract concepts ! ! company related products, events,
 concepts, ... This is our set of concepts to annotate content with! during creation/update flows over time as the basis for defining value future „smart semantic“ editing
  • 42. TYPO3 Developer Days, 19/22 June 2014, Eindhoven But - what does semantic annotation mean? How does it look to us in a press release (tt_news)? „.. to announce, that the Global Toy fare will be held in Nuremberg on February 12th, 2014. LEGO will be presenting it products in Hall ... “ company 
 event common
 geography common
 date industry concept
 of a brand
  • 43. TYPO3 Developer Days, 19/22 June 2014, Eindhoven Suggestion how to tackle this from dkd: Treat semantics like learning the system a foreign (company) language Implementing a semantic „overlay“ within the backend, so that during the creation/update of content annotation can happen Suggest annotations if the backend already knows a word/concept Using these content annotations to level up DAM in TYPO3CMS Integrating semantic search in back end and front end Connect DAM to the Media Mixer from ForgetIT Framework
  • 44. TYPO3 Developer Days, 19/22 June 2014, Eindhoven Text summarization Generation of visual summaries! • Content Detection analyzes a document to determine which sections are useful in terms of content (e.g. removing the generic menus in a web page; avoids irrelevant material biasing the summary)! • TermRaider extracts representative, weighted terms (words, entities etc.) from documents which can provide a summary (e.g. as a term cloud)
  • 45. TYPO3 Developer Days, 19/22 June 2014, Eindhoven Outlook: Semantic text composition Semantic text editor! • Tool for inferring and suggesting semantic annotations for text while it is being composed
  • 46. TYPO3 Developer Days, 19/22 June 2014, Eindhoven Outlook: Semantic text composition Semantic text editor components! • Editor! − An extended version of the open-source HTML-based rich text editor CKEditor, which allows for annotating and tracking arbitrary parts of the text ! • Natural Language Processing component! − Named entity recognition locates and classifies atomic elements in text into predefined categories such as people, organizations, and locations! − Coreference resolution identifies which words refer to which things in a text! − Relation extraction extracts binary relations from the text being composed! • Linked Open Data component! − Entity disambiguation distinguishes between different entities that have similar or identical names! − Relation extraction searches for relations among entities! − Context inference finds contextual information about entities mentioned in the text
  • 48. TYPO3 Developer Days, 19/22 June 2014, Eindhoven Image analysis ForgetIT visual analysis technologies demonstrator! • Concept detection and feature extraction! • Visual quality assessment! • Image clustering! • Face detection http://multimedia.iti.gr/ForgetIT/CostaRica/demonstrator.html
  • 49. TYPO3 Developer Days, 19/22 June 2014, Eindhoven Image feature extraction and concept detection
  • 50. TYPO3 Developer Days, 19/22 June 2014, Eindhoven Image clustering for summarization
  • 51. Want to support the ForgetIT project? How to get involved?
  • 52. Ideas
  • 53. TYPO3 Developer Days, 19/22 June 2014, Eindhoven Code contributions
  • 54. TYPO3 Developer Days, 19/22 June 2014, Eindhoven Test and evaluate
  • 55. TYPO3 Developer Days, 19/22 June 2014, Eindhoven Take our survey! (1/2) Organizational Preservation http://bit.ly/U65uL6
  • 56. TYPO3 Developer Days, 19/22 June 2014, Eindhoven Take our survey! (2/2) Personal Preservation http://bit.ly/1kJPNhZ
  • 58. TYPO3 Developer Days, 19/22 June 2014, Eindhoven Timeline 2013 • D10.3 • Mockups • Proof of concept 2014 • Architecture • FAL • Semantic UI / Layer • DAM Dashboard • Log Aggregation Toolkit 2015 • Content value framework 2016 • Final
  • 59. TYPO3 Developer Days, 19/22 June 2014, Eindhoven D10.1 Research Analysis Application Design Application Logic and Workflow
  • 61. TYPO3 Developer Days, 19/22 June 2014, Eindhoven Use Case I: Press release Use case I: Press release • Creating a press release • Adding meta data • Semantic annotation
  • 62. TYPO3 Developer Days, 19/22 June 2014, Eindhoven
  • 63. TYPO3 Developer Days, 19/22 June 2014, Eindhoven Ingest press release Automatic annotation • Initiated by user • Add entity to own ontology • Color coded according to type
  • 64. TYPO3 Developer Days, 19/22 June 2014, Eindhoven
  • 65. TYPO3 Developer Days, 19/22 June 2014, Eindhoven Ingest press release Manual annotation • Selection from text or clipboard • Add entity to own ontology
  • 66. TYPO3 Developer Days, 19/22 June 2014, Eindhoven Use Case II: Preservation-aware digital asset management Use Case II: Preservation-aware digital asset management • Searching for assets • Managing digital assets • Handling digital assets
  • 67. TYPO3 Developer Days, 19/22 June 2014, Eindhoven
  • 69. TYPO3 Developer Days, 19/22 June 2014, Eindhoven Where to find us http://www.forgetit-project.eu
  • 70. TYPO3 Developer Days, 19/22 June 2014, Eindhoven Contact @ForgetITProject Olivier Dobberkau olivier.dobberkau@dkd.de
  • 71. TYPO3 Developer Days, 19/22 June 2014, Eindhoven Call to Action Join our efforts in creating: a semantic layer in TYPO3 CMS defining the future of DAM within the TYPO3 world establishing content value measures preparing TYPO3 and our customers to manage forgetting and preservation of content
  • 72. Thank you for your attention!