SlideShare a Scribd company logo
1 of 32
Download to read offline
Addressing and annotating
     multimedia fragments
Raphaël Troncy <raphael.troncy@eurecom.fr>
Use Case




  10/04/2012 -   Séminaire Muséologie - Le Web devient audiovisuel   -2
Use Case

 Aidem received on her Facebook
  wall a status message containing
  a Media Fragment URI
   Use a ‘#’ !
   Highlight a
    video
    sequence
   Highlight a
    region
    to pay
    attention to


   10/04/2012 -    Séminaire Muséologie - Le Web devient audiovisuel   -3
What are Media Fragments?




0                                  20            temporal media fragment               35   t

     spatial media fragment




                                                                track media fragment




       10/04/2012 -           Séminaire Muséologie - Le Web devient audiovisuel        -4
Temporal Media Fragments



Original resource
      length




Fragment beginning     Playback progress                                      Fragment end

        10/04/2012 -      Séminaire Muséologie - Le Web devient audiovisuel        -5
Spatial Media Fragments


                                                                              highlighted
                                                                               fragment
semi-opaque
  overlay




      10/04/2012 -   Séminaire Muséologie - Le Web devient audiovisuel   -6
Media Fragments Use Cases

 Bookmark / Share parts (fragments) of
  audio/video content
 Annotate media fragments
 Mash-ups
 Conserve bandwidth


  http://www.w3.org/TR/media-frags-reqs/



   10/04/2012 -   Séminaire Muséologie - Le Web devient audiovisuel   -7
Requirements
 r01: Temporal fragments:
    a clipping along the time dimension from a start to an end time that
     are within the duration of the media resource

 r02: Spatial fragments:
    a clipping of an image region, only consider rectangular regions

 r03: Track fragments:
    a track as exposed by a container format of the media resource

 r04: Named fragments:
    A temporal media fragment that has been given a name through
     some sort of annotation mechanism




    10/04/2012 -       Séminaire Muséologie - Le Web devient audiovisuel   -8
Media URIs

 Using URI query part:
  http://www.example.org/video.ogv?t=60,100

 Using URI fragment part:
  http://www.example.org/video.ogv#t=60,100

 Mixing both:
  http://www.example.org/video.ogv?t=60,100
  #t=10,15



   10/04/2012 -   Séminaire Muséologie - Le Web devient audiovisuel   -9
URI Fragments vs. URI Queries
                      #t=20,30                                           ?t=20,30
                 secondary resource,                                 primary resource,
                  notion of context                                 no notion of context
          extraction needs to be no adaptation restrictions
         expressible in byte ranges
                no provisions for                            key-value pairs are sent to
                communi-cating                                       the server
            fragments to the server
             potentially cacheable                                             not cacheable
 The media fragment URI syntax can be used
  for URI queries
 We will focus on URI fragments
  10/04/2012 -             Séminaire Muséologie - Le Web devient audiovisuel           - 10
Media Fragments Resolution

 For the URI query part:
   The media file is only processed on server side
   The UA receives a new video file

 For the URI fragment part:
   Smart UA may strip out the fragment definition and
    encode it into custom http headers (Range header)
   (Media) Servers will handle the request, slice the media
    content and serve just the fragment (corresponding byte
    ranges)
    … while old ones will serve the whole resource



   10/04/2012 -   Séminaire Muséologie - Le Web devient audiovisuel   - 11
Influence of Media Formats

 Fragment extraction needs to be expressible in
  terms of byte ranges
 Requirements for the different axes
   temporal: presence of intra-coded frames
    (i.e., random access points)
   spatial: presence of independently coded spatial regions
   track: need to be identifiable by a name

 Conclusion: temporal and track axes are
  realistic, spatial fragments can hardly be
  expressed in terms of byte ranges

   10/04/2012 -   Séminaire Muséologie - Le Web devient audiovisuel   - 12
Can I use Media Fragments now?

                                           <video id='v'
                                             src='AudioAPI.webm#t=50,100'
                                             onloadedmetadata='update()'
                                             onpause='update()'
                                             onplay='update()'
                                             onseeked='update()'
                                             controls></video>

                                                 Firefox 9+ (or WebKit)
                                                 No bandwidth saving




  10/04/2012 -   Séminaire Muséologie - Le Web devient audiovisuel   - 13
Can I use Media Fragments now?




                                                  Sysnote over YouTube
                                                  No bandwidth saving




  10/04/2012 -   Séminaire Muséologie - Le Web devient audiovisuel   - 14
Can I use Media Fragments now?



                                                                      Ninsuna
                                                                      Bandwidth
                                                                       saving




  10/04/2012 -   Séminaire Muséologie - Le Web devient audiovisuel        - 15
Examining HTTP Traffic

 HTTP request:
   retrieving URI
   parsing key=values pairs from fragment part
   setting Range header

 HTTP response:
   checking Content-Type
    and Content-Range-Mapping headers values
   attaching custom playback controls to page
   creating spatial dimension overlay (if specified)



   10/04/2012 -    Séminaire Muséologie - Le Web devient audiovisuel   - 16
Example: Requesting a Time Fragment

 A web developer specifies a video source with
  a temporal fragment URI:
http://ninsuna.elis.ugent.be/DownloadServlet/mfwg/fragf2f.ogv#t=5,15



 key=value pair is analyzed,
  fragment begin and end time are matched
 t=5,15


 Media Fragments Extension analyses the
  fragment part, retrieves beginning and end time
  and sets proper Range header value:
 Range: t:npt=5-15
    10/04/2012 -     Séminaire Muséologie - Le Web devient audiovisuel   - 17
Example: Requesting a Time Fragment

 The NinSuna server responds with the 206
  Partial Content response and Content-Range-
  Mapping header showing the mapped time
  ranges and media fragment in the message
  payload:
HTTP/1.1 206 Partial Content

Content-Type: video/ogg
Accept-Ranges: bytes, t, track, id
Content-Range: bytes 629578-1690588/4055466
Content-Range-Mapping:
{t:npt 4.8-14.8/0-38.33}={bytes 629578-690588/4055466}



   10/04/2012 -   Séminaire Muséologie - Le Web devient audiovisuel   - 18
Example: Requesting a Time Fragment

 ... and it won’t work!


 Because the
  player does not
  have the media
  file header




    10/04/2012 -   Séminaire Muséologie - Le Web devient audiovisuel   - 19
Example: Requesting a Time Fragment

 Player needs to be initialized, thus bare
  fragment is not playable
 To fix this, we add ;include-setup to the Range
  header value:
   Range: t:npt=5-15;include-setup




   10/04/2012 -   Séminaire Muséologie - Le Web devient audiovisuel   - 20
Example: Requesting a Time Fragment

 The response from the server is slightly different:
 HTTP/1.1 206 Partial Content

 Content-Type: multipart/byteranges;boundary=End
 Content-Range-Mapping:
  {t:npt 4.8-14.8/0-38.33;include-setup}
 ={bytes 0-5998,629578-1690588/4055466}
 ...

 --End
 Content-Type: video/ogg
 Content-Range: bytes 0-5997/4055466

 {binary data}
 --End
 Content-Type: video/ogg
 Content-Range: bytes 629578-1690588/4055466

 {binary data}
 --End
 --End--


     10/04/2012 -        Séminaire Muséologie - Le Web devient audiovisuel   - 21
Example: Requesting a Time Fragment

 This is why we need to attach a Stream Listener
  to the HTTP channel:
HTTP/1.1 206 Partial Content

Content-Type:
multipart/byteranges;boundary=End
...
                                                                 HTTP/1.1 206 Partial Content
--End
Content-Type: video/ogg                                          Content-Type: video/ogg
Content-Range: bytes 0-5997/4055466                              ...

{binary data}                                                    {binary data}
--End                                                            {binary data}
Content-Type: video/ogg
Content-Range: bytes 629578-
1690588/4055466

{binary data}
--End
--End--

     10/04/2012 -       Séminaire Muséologie - Le Web devient audiovisuel        - 22
Spatial Fragments

 Without additional information send to server
 Overlay is created by appending four additional
  DIVs to the webpage
 They are styled to create the impression of
  semi-opaque layer over the video element




   10/04/2012 -   Séminaire Muséologie - Le Web devient audiovisuel   - 23
NinSuna: Overview

 Fully integrated media adaptation and delivery
  platform
   media adaptation and media packaging core is
    independent of media formats
        based on model-driven content adaptation & delivery technique
   only high-level adaptation operations such as scene
    selection and frame rate scaling
        no transcoding is applied
        ideal candidate for MF implementation
   more information: http://ninsuna.elis.ugent.be




   10/04/2012 -        Séminaire Muséologie - Le Web devient audiovisuel   - 24
MF-specific Features of NinSuna

 Support for Media Fragment URI queries
   both HTTP and RTSP implementation
   try some URIs at
        http://ninsuna.elis.ugent.be/MediaFragmentsServer#Test
        http://ninsuna.elis.ugent.be/DownloadServlet/apple/10,000_BC_t
         railer_2.mp4?track=5;6

 Support for Media Fragment URI fragments
   i.e., support for the MF-specific HTTP headers

 Combining Media Fragment URI queries and
  fragments
   e.g., http://foo.com/media.mp4?t=10,40#t=5,10

   10/04/2012 -        Séminaire Muséologie - Le Web devient audiovisuel   - 25
Impact on the Existing Web Infrastructure

 Updating existing infrastructure is not trivial
   web servers need to be extended with
         media extractors
         MF-specific HTTP communication
   existing web caches are only effective with HTTP byte
    ranges
         specialized media caches need to be developed in the future to
          cache the other cases
   user agents need to be extended with
         a Media Fragment URI parser and interpreter
         MF-specific HTTP communication
         MF-specific visualization


    10/04/2012 -        Séminaire Muséologie - Le Web devient audiovisuel   - 26
Media Fragments Proxy

 Goal: make existing media resources, served
  by generic HTTP Web servers, available as
  Media Fragments
 Biggest problem for content providers
  willing to support Media Fragments
    media extractor (dependent on underlying media
     formats)
 Solution: an approach that
    works with existing HTTP Web servers (apache, IIS)
    works with existing Web caches
    works with not so smart user agents

  10/04/2012 -   Séminaire Muséologie - Le Web devient audiovisuel   - 27
Media Fragments Proxy                                                           http://foo.com/video.ogv#t=11,19


                                            GET /?url=http://foo.com/video.ogv HTTP/1.1
                                            Host: MFProxy.com
   HTTP                                     Accept: video/*
   server                                   Range: t:npt=11-19
                                            Accept-Range-Redirect: bytes

                                                  HTTP/1.1 307 Temporary Redirect
                                                  Location: http://foo.com/video.ogv
                                                     -get (only) the header info
                                                  Accept-Ranges: bytes, t, track
                                                  Content-Length:a fragment-to-byte mapping
                                                     -try to find 0
                                                  Content-Type: video/ogg
                                                     -construct the redirect response
                                                  Content-Range-Mapping: t:npt 10-20/0-50
                                                  Range-Redirect: 24000-32000
                                                  Vary: Accept-Range-Redirect

                                                                           GET /video.ogv HTTP/1.1
 MF Proxy                                                                  Host: foo.com
                         MF User                                           Accept: video/*
                          Agent                                            Range: bytes=24000-32000

http://ninsuna.elis.ugent.be/MFProxy?url=<mediaURI>

       10/04/2012 -          Séminaire Muséologie - Le Web devient audiovisuel              - 28
Annotating Media Fragments                                           yuma.min.js




  10/04/2012 -   Séminaire Muséologie - Le Web devient audiovisuel   - 29
Annotating Media Fragments

                                                        Benchmarking: Sphinx, HTK,
                                                                 Julius




                                                  NER + full text index with the
                                                          transcription
                                              Interlinking with the Linked Data
                                               Cloud to enable semantic search




  10/04/2012 -   Séminaire Muséologie - Le Web devient audiovisuel   - 30
10/04/2012 -   Séminaire Muséologie - Le Web devient audiovisuel   - 31
http://www.slideshare.net/troncy

10/04/2012 -   Séminaire Muséologie - Le Web devient audiovisuel   - 32

More Related Content

More from Raphael Troncy

K CAP 2019 Opening Ceremony
K CAP 2019 Opening CeremonyK CAP 2019 Opening Ceremony
K CAP 2019 Opening CeremonyRaphael Troncy
 
Semantic Technologies for Connected Vehicles in a Web of Things Environment
Semantic Technologies for Connected Vehicles in a Web of Things EnvironmentSemantic Technologies for Connected Vehicles in a Web of Things Environment
Semantic Technologies for Connected Vehicles in a Web of Things EnvironmentRaphael Troncy
 
HyperTED: exploring video lectures at the fragment levels for enhancing learning
HyperTED: exploring video lectures at the fragment levels for enhancing learningHyperTED: exploring video lectures at the fragment levels for enhancing learning
HyperTED: exploring video lectures at the fragment levels for enhancing learningRaphael Troncy
 
Location Embeddings for Next Trip Recommendation
Location Embeddings for Next Trip RecommendationLocation Embeddings for Next Trip Recommendation
Location Embeddings for Next Trip RecommendationRaphael Troncy
 
A replication study of the top performing systems in SemEval twitter sentimen...
A replication study of the top performing systems in SemEval twitter sentimen...A replication study of the top performing systems in SemEval twitter sentimen...
A replication study of the top performing systems in SemEval twitter sentimen...Raphael Troncy
 
Contextualizing Events in TV News Shows - SNOW 2014
Contextualizing Events in TV News Shows - SNOW 2014Contextualizing Events in TV News Shows - SNOW 2014
Contextualizing Events in TV News Shows - SNOW 2014Raphael Troncy
 
Modeling Geometry and Reference Systems on the Web of Data - LGD 2014
Modeling Geometry and Reference Systems on the Web of Data - LGD 2014Modeling Geometry and Reference Systems on the Web of Data - LGD 2014
Modeling Geometry and Reference Systems on the Web of Data - LGD 2014Raphael Troncy
 
NERD: an open source platform for extracting and disambiguating named entitie...
NERD: an open source platform for extracting and disambiguating named entitie...NERD: an open source platform for extracting and disambiguating named entitie...
NERD: an open source platform for extracting and disambiguating named entitie...Raphael Troncy
 
Deep-linking into Media Assets at the Fragment Level SMAM 2013
Deep-linking into Media Assets at the Fragment Level SMAM 2013Deep-linking into Media Assets at the Fragment Level SMAM 2013
Deep-linking into Media Assets at the Fragment Level SMAM 2013Raphael Troncy
 
Describing Media Assets: Media Fragment Specification and Description
Describing Media Assets: Media Fragment Specification and DescriptionDescribing Media Assets: Media Fragment Specification and Description
Describing Media Assets: Media Fragment Specification and DescriptionRaphael Troncy
 
Semantics at the multimedia fragment level SSSW 2013
Semantics at the multimedia fragment level SSSW 2013Semantics at the multimedia fragment level SSSW 2013
Semantics at the multimedia fragment level SSSW 2013Raphael Troncy
 
Semantic structuring and linking of event-centric data in the social web
Semantic structuring and linking of event-centric data in the social webSemantic structuring and linking of event-centric data in the social web
Semantic structuring and linking of event-centric data in the social webRaphael Troncy
 
Live topic generation from event streams
Live topic generation from event streamsLive topic generation from event streams
Live topic generation from event streamsRaphael Troncy
 
MediaFinder: Collect, Enrich and Visualize Media Memes Shared by the Crowd
MediaFinder: Collect, Enrich and Visualize Media Memes Shared by the CrowdMediaFinder: Collect, Enrich and Visualize Media Memes Shared by the Crowd
MediaFinder: Collect, Enrich and Visualize Media Memes Shared by the CrowdRaphael Troncy
 
EventMedia Live: Exploring Events Connections in Real-Time to Enhance Content
EventMedia Live: Exploring Events Connections in Real-Time to Enhance ContentEventMedia Live: Exploring Events Connections in Real-Time to Enhance Content
EventMedia Live: Exploring Events Connections in Real-Time to Enhance ContentRaphael Troncy
 
Extracting Media Items from Multiple Social Networks
Extracting Media Items from Multiple Social NetworksExtracting Media Items from Multiple Social Networks
Extracting Media Items from Multiple Social NetworksRaphael Troncy
 
Semantics at the multimedia fragment level or how enabling the remixing of on...
Semantics at the multimedia fragment level or how enabling the remixing of on...Semantics at the multimedia fragment level or how enabling the remixing of on...
Semantics at the multimedia fragment level or how enabling the remixing of on...Raphael Troncy
 
MediaEval 2012 SED Opening
MediaEval 2012 SED OpeningMediaEval 2012 SED Opening
MediaEval 2012 SED OpeningRaphael Troncy
 
DeRiVE 2011 workshop opening
DeRiVE 2011 workshop openingDeRiVE 2011 workshop opening
DeRiVE 2011 workshop openingRaphael Troncy
 
MediaEval 2011 SED Opening
MediaEval 2011 SED OpeningMediaEval 2011 SED Opening
MediaEval 2011 SED OpeningRaphael Troncy
 

More from Raphael Troncy (20)

K CAP 2019 Opening Ceremony
K CAP 2019 Opening CeremonyK CAP 2019 Opening Ceremony
K CAP 2019 Opening Ceremony
 
Semantic Technologies for Connected Vehicles in a Web of Things Environment
Semantic Technologies for Connected Vehicles in a Web of Things EnvironmentSemantic Technologies for Connected Vehicles in a Web of Things Environment
Semantic Technologies for Connected Vehicles in a Web of Things Environment
 
HyperTED: exploring video lectures at the fragment levels for enhancing learning
HyperTED: exploring video lectures at the fragment levels for enhancing learningHyperTED: exploring video lectures at the fragment levels for enhancing learning
HyperTED: exploring video lectures at the fragment levels for enhancing learning
 
Location Embeddings for Next Trip Recommendation
Location Embeddings for Next Trip RecommendationLocation Embeddings for Next Trip Recommendation
Location Embeddings for Next Trip Recommendation
 
A replication study of the top performing systems in SemEval twitter sentimen...
A replication study of the top performing systems in SemEval twitter sentimen...A replication study of the top performing systems in SemEval twitter sentimen...
A replication study of the top performing systems in SemEval twitter sentimen...
 
Contextualizing Events in TV News Shows - SNOW 2014
Contextualizing Events in TV News Shows - SNOW 2014Contextualizing Events in TV News Shows - SNOW 2014
Contextualizing Events in TV News Shows - SNOW 2014
 
Modeling Geometry and Reference Systems on the Web of Data - LGD 2014
Modeling Geometry and Reference Systems on the Web of Data - LGD 2014Modeling Geometry and Reference Systems on the Web of Data - LGD 2014
Modeling Geometry and Reference Systems on the Web of Data - LGD 2014
 
NERD: an open source platform for extracting and disambiguating named entitie...
NERD: an open source platform for extracting and disambiguating named entitie...NERD: an open source platform for extracting and disambiguating named entitie...
NERD: an open source platform for extracting and disambiguating named entitie...
 
Deep-linking into Media Assets at the Fragment Level SMAM 2013
Deep-linking into Media Assets at the Fragment Level SMAM 2013Deep-linking into Media Assets at the Fragment Level SMAM 2013
Deep-linking into Media Assets at the Fragment Level SMAM 2013
 
Describing Media Assets: Media Fragment Specification and Description
Describing Media Assets: Media Fragment Specification and DescriptionDescribing Media Assets: Media Fragment Specification and Description
Describing Media Assets: Media Fragment Specification and Description
 
Semantics at the multimedia fragment level SSSW 2013
Semantics at the multimedia fragment level SSSW 2013Semantics at the multimedia fragment level SSSW 2013
Semantics at the multimedia fragment level SSSW 2013
 
Semantic structuring and linking of event-centric data in the social web
Semantic structuring and linking of event-centric data in the social webSemantic structuring and linking of event-centric data in the social web
Semantic structuring and linking of event-centric data in the social web
 
Live topic generation from event streams
Live topic generation from event streamsLive topic generation from event streams
Live topic generation from event streams
 
MediaFinder: Collect, Enrich and Visualize Media Memes Shared by the Crowd
MediaFinder: Collect, Enrich and Visualize Media Memes Shared by the CrowdMediaFinder: Collect, Enrich and Visualize Media Memes Shared by the Crowd
MediaFinder: Collect, Enrich and Visualize Media Memes Shared by the Crowd
 
EventMedia Live: Exploring Events Connections in Real-Time to Enhance Content
EventMedia Live: Exploring Events Connections in Real-Time to Enhance ContentEventMedia Live: Exploring Events Connections in Real-Time to Enhance Content
EventMedia Live: Exploring Events Connections in Real-Time to Enhance Content
 
Extracting Media Items from Multiple Social Networks
Extracting Media Items from Multiple Social NetworksExtracting Media Items from Multiple Social Networks
Extracting Media Items from Multiple Social Networks
 
Semantics at the multimedia fragment level or how enabling the remixing of on...
Semantics at the multimedia fragment level or how enabling the remixing of on...Semantics at the multimedia fragment level or how enabling the remixing of on...
Semantics at the multimedia fragment level or how enabling the remixing of on...
 
MediaEval 2012 SED Opening
MediaEval 2012 SED OpeningMediaEval 2012 SED Opening
MediaEval 2012 SED Opening
 
DeRiVE 2011 workshop opening
DeRiVE 2011 workshop openingDeRiVE 2011 workshop opening
DeRiVE 2011 workshop opening
 
MediaEval 2011 SED Opening
MediaEval 2011 SED OpeningMediaEval 2011 SED Opening
MediaEval 2011 SED Opening
 

Recently uploaded

Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfNeo4j
 
React Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App FrameworkReact Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App FrameworkPixlogix Infotech
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Strongerpanagenda
 
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical InfrastructureVarsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructureitnewsafrica
 
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS:  6 Ways to Automate Your Data IntegrationBridging Between CAD & GIS:  6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integrationmarketing932765
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsNathaniel Shimoni
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsRavi Sanghani
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
QCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesQCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesBernd Ruecker
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Mark Goldstein
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality AssuranceInflectra
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 

Recently uploaded (20)

Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdf
 
React Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App FrameworkReact Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App Framework
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
 
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical InfrastructureVarsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
 
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS:  6 Ways to Automate Your Data IntegrationBridging Between CAD & GIS:  6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directions
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and Insights
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
QCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesQCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architectures
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 

Addressing and Annotating Multimedia Fragments

  • 1. Addressing and annotating multimedia fragments Raphaël Troncy <raphael.troncy@eurecom.fr>
  • 2. Use Case 10/04/2012 - Séminaire Muséologie - Le Web devient audiovisuel -2
  • 3. Use Case  Aidem received on her Facebook wall a status message containing a Media Fragment URI  Use a ‘#’ !  Highlight a video sequence  Highlight a region to pay attention to 10/04/2012 - Séminaire Muséologie - Le Web devient audiovisuel -3
  • 4. What are Media Fragments? 0 20 temporal media fragment 35 t spatial media fragment track media fragment 10/04/2012 - Séminaire Muséologie - Le Web devient audiovisuel -4
  • 5. Temporal Media Fragments Original resource length Fragment beginning Playback progress Fragment end 10/04/2012 - Séminaire Muséologie - Le Web devient audiovisuel -5
  • 6. Spatial Media Fragments highlighted fragment semi-opaque overlay 10/04/2012 - Séminaire Muséologie - Le Web devient audiovisuel -6
  • 7. Media Fragments Use Cases  Bookmark / Share parts (fragments) of audio/video content  Annotate media fragments  Mash-ups  Conserve bandwidth http://www.w3.org/TR/media-frags-reqs/ 10/04/2012 - Séminaire Muséologie - Le Web devient audiovisuel -7
  • 8. Requirements  r01: Temporal fragments:  a clipping along the time dimension from a start to an end time that are within the duration of the media resource  r02: Spatial fragments:  a clipping of an image region, only consider rectangular regions  r03: Track fragments:  a track as exposed by a container format of the media resource  r04: Named fragments:  A temporal media fragment that has been given a name through some sort of annotation mechanism 10/04/2012 - Séminaire Muséologie - Le Web devient audiovisuel -8
  • 9. Media URIs  Using URI query part: http://www.example.org/video.ogv?t=60,100  Using URI fragment part: http://www.example.org/video.ogv#t=60,100  Mixing both: http://www.example.org/video.ogv?t=60,100 #t=10,15 10/04/2012 - Séminaire Muséologie - Le Web devient audiovisuel -9
  • 10. URI Fragments vs. URI Queries #t=20,30 ?t=20,30 secondary resource, primary resource, notion of context no notion of context extraction needs to be no adaptation restrictions expressible in byte ranges no provisions for key-value pairs are sent to communi-cating the server fragments to the server potentially cacheable not cacheable  The media fragment URI syntax can be used for URI queries  We will focus on URI fragments 10/04/2012 - Séminaire Muséologie - Le Web devient audiovisuel - 10
  • 11. Media Fragments Resolution  For the URI query part:  The media file is only processed on server side  The UA receives a new video file  For the URI fragment part:  Smart UA may strip out the fragment definition and encode it into custom http headers (Range header)  (Media) Servers will handle the request, slice the media content and serve just the fragment (corresponding byte ranges) … while old ones will serve the whole resource 10/04/2012 - Séminaire Muséologie - Le Web devient audiovisuel - 11
  • 12. Influence of Media Formats  Fragment extraction needs to be expressible in terms of byte ranges  Requirements for the different axes  temporal: presence of intra-coded frames (i.e., random access points)  spatial: presence of independently coded spatial regions  track: need to be identifiable by a name  Conclusion: temporal and track axes are realistic, spatial fragments can hardly be expressed in terms of byte ranges 10/04/2012 - Séminaire Muséologie - Le Web devient audiovisuel - 12
  • 13. Can I use Media Fragments now? <video id='v' src='AudioAPI.webm#t=50,100' onloadedmetadata='update()' onpause='update()' onplay='update()' onseeked='update()' controls></video>  Firefox 9+ (or WebKit)  No bandwidth saving 10/04/2012 - Séminaire Muséologie - Le Web devient audiovisuel - 13
  • 14. Can I use Media Fragments now?  Sysnote over YouTube  No bandwidth saving 10/04/2012 - Séminaire Muséologie - Le Web devient audiovisuel - 14
  • 15. Can I use Media Fragments now?  Ninsuna  Bandwidth saving 10/04/2012 - Séminaire Muséologie - Le Web devient audiovisuel - 15
  • 16. Examining HTTP Traffic  HTTP request:  retrieving URI  parsing key=values pairs from fragment part  setting Range header  HTTP response:  checking Content-Type and Content-Range-Mapping headers values  attaching custom playback controls to page  creating spatial dimension overlay (if specified) 10/04/2012 - Séminaire Muséologie - Le Web devient audiovisuel - 16
  • 17. Example: Requesting a Time Fragment  A web developer specifies a video source with a temporal fragment URI: http://ninsuna.elis.ugent.be/DownloadServlet/mfwg/fragf2f.ogv#t=5,15  key=value pair is analyzed, fragment begin and end time are matched t=5,15  Media Fragments Extension analyses the fragment part, retrieves beginning and end time and sets proper Range header value: Range: t:npt=5-15 10/04/2012 - Séminaire Muséologie - Le Web devient audiovisuel - 17
  • 18. Example: Requesting a Time Fragment  The NinSuna server responds with the 206 Partial Content response and Content-Range- Mapping header showing the mapped time ranges and media fragment in the message payload: HTTP/1.1 206 Partial Content Content-Type: video/ogg Accept-Ranges: bytes, t, track, id Content-Range: bytes 629578-1690588/4055466 Content-Range-Mapping: {t:npt 4.8-14.8/0-38.33}={bytes 629578-690588/4055466} 10/04/2012 - Séminaire Muséologie - Le Web devient audiovisuel - 18
  • 19. Example: Requesting a Time Fragment  ... and it won’t work!  Because the player does not have the media file header 10/04/2012 - Séminaire Muséologie - Le Web devient audiovisuel - 19
  • 20. Example: Requesting a Time Fragment  Player needs to be initialized, thus bare fragment is not playable  To fix this, we add ;include-setup to the Range header value: Range: t:npt=5-15;include-setup 10/04/2012 - Séminaire Muséologie - Le Web devient audiovisuel - 20
  • 21. Example: Requesting a Time Fragment  The response from the server is slightly different: HTTP/1.1 206 Partial Content Content-Type: multipart/byteranges;boundary=End Content-Range-Mapping: {t:npt 4.8-14.8/0-38.33;include-setup} ={bytes 0-5998,629578-1690588/4055466} ... --End Content-Type: video/ogg Content-Range: bytes 0-5997/4055466 {binary data} --End Content-Type: video/ogg Content-Range: bytes 629578-1690588/4055466 {binary data} --End --End-- 10/04/2012 - Séminaire Muséologie - Le Web devient audiovisuel - 21
  • 22. Example: Requesting a Time Fragment  This is why we need to attach a Stream Listener to the HTTP channel: HTTP/1.1 206 Partial Content Content-Type: multipart/byteranges;boundary=End ... HTTP/1.1 206 Partial Content --End Content-Type: video/ogg Content-Type: video/ogg Content-Range: bytes 0-5997/4055466 ... {binary data} {binary data} --End {binary data} Content-Type: video/ogg Content-Range: bytes 629578- 1690588/4055466 {binary data} --End --End-- 10/04/2012 - Séminaire Muséologie - Le Web devient audiovisuel - 22
  • 23. Spatial Fragments  Without additional information send to server  Overlay is created by appending four additional DIVs to the webpage  They are styled to create the impression of semi-opaque layer over the video element 10/04/2012 - Séminaire Muséologie - Le Web devient audiovisuel - 23
  • 24. NinSuna: Overview  Fully integrated media adaptation and delivery platform  media adaptation and media packaging core is independent of media formats based on model-driven content adaptation & delivery technique  only high-level adaptation operations such as scene selection and frame rate scaling no transcoding is applied ideal candidate for MF implementation  more information: http://ninsuna.elis.ugent.be 10/04/2012 - Séminaire Muséologie - Le Web devient audiovisuel - 24
  • 25. MF-specific Features of NinSuna  Support for Media Fragment URI queries  both HTTP and RTSP implementation  try some URIs at http://ninsuna.elis.ugent.be/MediaFragmentsServer#Test http://ninsuna.elis.ugent.be/DownloadServlet/apple/10,000_BC_t railer_2.mp4?track=5;6  Support for Media Fragment URI fragments  i.e., support for the MF-specific HTTP headers  Combining Media Fragment URI queries and fragments  e.g., http://foo.com/media.mp4?t=10,40#t=5,10 10/04/2012 - Séminaire Muséologie - Le Web devient audiovisuel - 25
  • 26. Impact on the Existing Web Infrastructure  Updating existing infrastructure is not trivial  web servers need to be extended with media extractors MF-specific HTTP communication  existing web caches are only effective with HTTP byte ranges specialized media caches need to be developed in the future to cache the other cases  user agents need to be extended with a Media Fragment URI parser and interpreter MF-specific HTTP communication MF-specific visualization 10/04/2012 - Séminaire Muséologie - Le Web devient audiovisuel - 26
  • 27. Media Fragments Proxy  Goal: make existing media resources, served by generic HTTP Web servers, available as Media Fragments  Biggest problem for content providers willing to support Media Fragments  media extractor (dependent on underlying media formats)  Solution: an approach that  works with existing HTTP Web servers (apache, IIS)  works with existing Web caches  works with not so smart user agents 10/04/2012 - Séminaire Muséologie - Le Web devient audiovisuel - 27
  • 28. Media Fragments Proxy http://foo.com/video.ogv#t=11,19 GET /?url=http://foo.com/video.ogv HTTP/1.1 Host: MFProxy.com HTTP Accept: video/* server Range: t:npt=11-19 Accept-Range-Redirect: bytes HTTP/1.1 307 Temporary Redirect Location: http://foo.com/video.ogv -get (only) the header info Accept-Ranges: bytes, t, track Content-Length:a fragment-to-byte mapping -try to find 0 Content-Type: video/ogg -construct the redirect response Content-Range-Mapping: t:npt 10-20/0-50 Range-Redirect: 24000-32000 Vary: Accept-Range-Redirect GET /video.ogv HTTP/1.1 MF Proxy Host: foo.com MF User Accept: video/* Agent Range: bytes=24000-32000 http://ninsuna.elis.ugent.be/MFProxy?url=<mediaURI> 10/04/2012 - Séminaire Muséologie - Le Web devient audiovisuel - 28
  • 29. Annotating Media Fragments yuma.min.js 10/04/2012 - Séminaire Muséologie - Le Web devient audiovisuel - 29
  • 30. Annotating Media Fragments Benchmarking: Sphinx, HTK, Julius  NER + full text index with the transcription  Interlinking with the Linked Data Cloud to enable semantic search 10/04/2012 - Séminaire Muséologie - Le Web devient audiovisuel - 30
  • 31. 10/04/2012 - Séminaire Muséologie - Le Web devient audiovisuel - 31
  • 32. http://www.slideshare.net/troncy 10/04/2012 - Séminaire Muséologie - Le Web devient audiovisuel - 32