SlideShare a Scribd company logo
1 of 55
Making the most of
structured content:
data products from OpenLearn XML

                 Tony Hirst
      Dept of Communication and Systems,
              The Open University
So how do we get the links?
OU XML
BUT…
…the index is typically too small to be
                      of much interest
So what else
might we be
 able to do?
OU-XML lets
     you treat SA*
     documents as
      databases…
* Structured authoring
If the documents are
  just datasets, what
secondary or derived
    products can we
produce from them?
Automatic
   outline
 extraction:
mindmap view
Parent Unit   Unit code   Section   Section Heading
Coursewide
directories:
  learning
 outcomes
Course   Learning Outcome
XYX123   Have an understanding of P
XYZ123   Be able to do Q
ABC123   Have an understanding of R
Grab a list of course units from OpenLearn
For each unit:
  - Generate the URL for the XML version of
  each unit
  - Grab the XML for the unit
  - Extract learning outcomes, image locations,
  glossary items and link information
Coursewide
 directories:
 OpenLearn
image finder
Unit   Image Location   Caption   Description/alt text   Acknowledgements
Coursewide
 directories:
 OpenLearn
meta-glossary
Unit   Glossary Item   Description
Dynamic
   outline
 generation:
mindmap view
- unit
- unitset



- Learning outcome

- Glossary item

- figure
DiscOU
OU KMi project
   (d’Aquin et al)
Search Engine Text Adventures
What’s
missing…?
… search-based
   pedagogy…?
blog.ouseful.info

@psychemedia

More Related Content

Similar to Ili structuredauthoring

Searching Heterogenous E Learning Resources
Searching Heterogenous E Learning ResourcesSearching Heterogenous E Learning Resources
Searching Heterogenous E Learning Resourcesimranlatif
 
Apache Solr, il motore di ricerca enterprise open source
Apache Solr, il motore di ricerca enterprise open sourceApache Solr, il motore di ricerca enterprise open source
Apache Solr, il motore di ricerca enterprise open sourceLuca Bonesini
 
Search Me: Using Lucene.Net
Search Me: Using Lucene.NetSearch Me: Using Lucene.Net
Search Me: Using Lucene.Netgramana
 
Object And Oriented Programing ( Oop ) Languages
Object And Oriented Programing ( Oop ) LanguagesObject And Oriented Programing ( Oop ) Languages
Object And Oriented Programing ( Oop ) LanguagesJessica Deakin
 
CPP_,module2_1.pptx
CPP_,module2_1.pptxCPP_,module2_1.pptx
CPP_,module2_1.pptxAbhilashTom4
 
C++ plus data structures, 3rd edition (2003)
C++ plus data structures, 3rd edition (2003)C++ plus data structures, 3rd edition (2003)
C++ plus data structures, 3rd edition (2003)SHC
 
LUCERO - Building the Open University's Web of Linked Data
LUCERO - Building the Open University's Web of Linked DataLUCERO - Building the Open University's Web of Linked Data
LUCERO - Building the Open University's Web of Linked DataMathieu d'Aquin
 
LUCERO - Building the Open University Web of Linked Data
LUCERO - Building the Open University Web of Linked DataLUCERO - Building the Open University Web of Linked Data
LUCERO - Building the Open University Web of Linked DataMathieu d'Aquin
 
OpenURL Linking: the Academic Library Experience
OpenURL Linking: the Academic Library ExperienceOpenURL Linking: the Academic Library Experience
OpenURL Linking: the Academic Library ExperienceJohn McDonald
 
SHOE (simple html ontology extensions)
SHOE (simple html ontology extensions)SHOE (simple html ontology extensions)
SHOE (simple html ontology extensions)Selman Bozkır
 
LODUM talk at ifgi's Spatial @ WWU series
LODUM talk at ifgi's Spatial @ WWU seriesLODUM talk at ifgi's Spatial @ WWU series
LODUM talk at ifgi's Spatial @ WWU seriesCarsten Keßler
 
DB-IR-ranking
DB-IR-rankingDB-IR-ranking
DB-IR-rankingFELIX75
 
UMBEL Semantic Web Services
UMBEL Semantic Web ServicesUMBEL Semantic Web Services
UMBEL Semantic Web ServicesMike Bergman
 
Linked Data at the OU - the story so far
Linked Data at the OU - the story so farLinked Data at the OU - the story so far
Linked Data at the OU - the story so farEnrico Daga
 
2015.05.19 tom de nies - tin can2prov exposing interoperable provenance of ...
2015.05.19   tom de nies - tin can2prov exposing interoperable provenance of ...2015.05.19   tom de nies - tin can2prov exposing interoperable provenance of ...
2015.05.19 tom de nies - tin can2prov exposing interoperable provenance of ...tdenies
 

Similar to Ili structuredauthoring (20)

Searching Heterogenous E Learning Resources
Searching Heterogenous E Learning ResourcesSearching Heterogenous E Learning Resources
Searching Heterogenous E Learning Resources
 
Apache Solr, il motore di ricerca enterprise open source
Apache Solr, il motore di ricerca enterprise open sourceApache Solr, il motore di ricerca enterprise open source
Apache Solr, il motore di ricerca enterprise open source
 
Search Me: Using Lucene.Net
Search Me: Using Lucene.NetSearch Me: Using Lucene.Net
Search Me: Using Lucene.Net
 
Object And Oriented Programing ( Oop ) Languages
Object And Oriented Programing ( Oop ) LanguagesObject And Oriented Programing ( Oop ) Languages
Object And Oriented Programing ( Oop ) Languages
 
130704798265658191
130704798265658191130704798265658191
130704798265658191
 
CPP_,module2_1.pptx
CPP_,module2_1.pptxCPP_,module2_1.pptx
CPP_,module2_1.pptx
 
C++ plus data structures, 3rd edition (2003)
C++ plus data structures, 3rd edition (2003)C++ plus data structures, 3rd edition (2003)
C++ plus data structures, 3rd edition (2003)
 
Segmentation
SegmentationSegmentation
Segmentation
 
CS3391 -OOP -UNIT – I NOTES FINAL.pdf
CS3391 -OOP -UNIT – I  NOTES FINAL.pdfCS3391 -OOP -UNIT – I  NOTES FINAL.pdf
CS3391 -OOP -UNIT – I NOTES FINAL.pdf
 
LUCERO - Building the Open University's Web of Linked Data
LUCERO - Building the Open University's Web of Linked DataLUCERO - Building the Open University's Web of Linked Data
LUCERO - Building the Open University's Web of Linked Data
 
LUCERO - Building the Open University Web of Linked Data
LUCERO - Building the Open University Web of Linked DataLUCERO - Building the Open University Web of Linked Data
LUCERO - Building the Open University Web of Linked Data
 
OpenURL Linking: the Academic Library Experience
OpenURL Linking: the Academic Library ExperienceOpenURL Linking: the Academic Library Experience
OpenURL Linking: the Academic Library Experience
 
SHOE (simple html ontology extensions)
SHOE (simple html ontology extensions)SHOE (simple html ontology extensions)
SHOE (simple html ontology extensions)
 
LODUM talk at ifgi's Spatial @ WWU series
LODUM talk at ifgi's Spatial @ WWU seriesLODUM talk at ifgi's Spatial @ WWU series
LODUM talk at ifgi's Spatial @ WWU series
 
DB-IR-ranking
DB-IR-rankingDB-IR-ranking
DB-IR-ranking
 
Freeware For Research V2 Mar 09
Freeware For Research V2 Mar 09Freeware For Research V2 Mar 09
Freeware For Research V2 Mar 09
 
UMBEL Semantic Web Services
UMBEL Semantic Web ServicesUMBEL Semantic Web Services
UMBEL Semantic Web Services
 
Ad507
Ad507Ad507
Ad507
 
Linked Data at the OU - the story so far
Linked Data at the OU - the story so farLinked Data at the OU - the story so far
Linked Data at the OU - the story so far
 
2015.05.19 tom de nies - tin can2prov exposing interoperable provenance of ...
2015.05.19   tom de nies - tin can2prov exposing interoperable provenance of ...2015.05.19   tom de nies - tin can2prov exposing interoperable provenance of ...
2015.05.19 tom de nies - tin can2prov exposing interoperable provenance of ...
 

More from Tony Hirst

15 in 20 research fiesta
15 in 20 research fiesta15 in 20 research fiesta
15 in 20 research fiestaTony Hirst
 
Jupyternotebooks ou.pptx
Jupyternotebooks ou.pptxJupyternotebooks ou.pptx
Jupyternotebooks ou.pptxTony Hirst
 
Virtual computing.pptx
Virtual computing.pptxVirtual computing.pptx
Virtual computing.pptxTony Hirst
 
ouseful-parlihacks
ouseful-parlihacksouseful-parlihacks
ouseful-parlihacksTony Hirst
 
Gors appropriate
Gors appropriateGors appropriate
Gors appropriateTony Hirst
 
Gors appropriate
Gors appropriateGors appropriate
Gors appropriateTony Hirst
 
Robotlab jupyter
Robotlab   jupyterRobotlab   jupyter
Robotlab jupyterTony Hirst
 
Fco open data in half day th-v2
Fco open data in half day  th-v2Fco open data in half day  th-v2
Fco open data in half day th-v2Tony Hirst
 
Notes on the Future - ILI2015 Workshop
Notes on the Future - ILI2015 WorkshopNotes on the Future - ILI2015 Workshop
Notes on the Future - ILI2015 WorkshopTony Hirst
 
Community Journalism Conf - hyperlocal data wire
Community Journalism Conf - hyperlocal data wireCommunity Journalism Conf - hyperlocal data wire
Community Journalism Conf - hyperlocal data wireTony Hirst
 
Residential school 2015_robotics_interest
Residential school 2015_robotics_interestResidential school 2015_robotics_interest
Residential school 2015_robotics_interestTony Hirst
 
Data Mining - Separating Fact From Fiction - NetIKX
Data Mining - Separating Fact From Fiction - NetIKXData Mining - Separating Fact From Fiction - NetIKX
Data Mining - Separating Fact From Fiction - NetIKXTony Hirst
 
A Quick Tour of OpenRefine
A Quick Tour of OpenRefineA Quick Tour of OpenRefine
A Quick Tour of OpenRefineTony Hirst
 
Conversations with data
Conversations with dataConversations with data
Conversations with dataTony Hirst
 
Data reuse OU workshop bingo
Data reuse OU workshop bingoData reuse OU workshop bingo
Data reuse OU workshop bingoTony Hirst
 
Inspiring content - You Don't Need Big Data to Tell Good Data Stories
Inspiring content - You Don't Need Big Data to Tell Good Data Stories Inspiring content - You Don't Need Big Data to Tell Good Data Stories
Inspiring content - You Don't Need Big Data to Tell Good Data Stories Tony Hirst
 
Lincoln jun14datajournalism
Lincoln jun14datajournalismLincoln jun14datajournalism
Lincoln jun14datajournalismTony Hirst
 

More from Tony Hirst (20)

15 in 20 research fiesta
15 in 20 research fiesta15 in 20 research fiesta
15 in 20 research fiesta
 
Dev8d jupyter
Dev8d jupyterDev8d jupyter
Dev8d jupyter
 
Ili 16 robot
Ili 16 robotIli 16 robot
Ili 16 robot
 
Jupyternotebooks ou.pptx
Jupyternotebooks ou.pptxJupyternotebooks ou.pptx
Jupyternotebooks ou.pptx
 
Virtual computing.pptx
Virtual computing.pptxVirtual computing.pptx
Virtual computing.pptx
 
ouseful-parlihacks
ouseful-parlihacksouseful-parlihacks
ouseful-parlihacks
 
Gors appropriate
Gors appropriateGors appropriate
Gors appropriate
 
Gors appropriate
Gors appropriateGors appropriate
Gors appropriate
 
Robotlab jupyter
Robotlab   jupyterRobotlab   jupyter
Robotlab jupyter
 
Fco open data in half day th-v2
Fco open data in half day  th-v2Fco open data in half day  th-v2
Fco open data in half day th-v2
 
Notes on the Future - ILI2015 Workshop
Notes on the Future - ILI2015 WorkshopNotes on the Future - ILI2015 Workshop
Notes on the Future - ILI2015 Workshop
 
Community Journalism Conf - hyperlocal data wire
Community Journalism Conf - hyperlocal data wireCommunity Journalism Conf - hyperlocal data wire
Community Journalism Conf - hyperlocal data wire
 
Residential school 2015_robotics_interest
Residential school 2015_robotics_interestResidential school 2015_robotics_interest
Residential school 2015_robotics_interest
 
Data Mining - Separating Fact From Fiction - NetIKX
Data Mining - Separating Fact From Fiction - NetIKXData Mining - Separating Fact From Fiction - NetIKX
Data Mining - Separating Fact From Fiction - NetIKX
 
Week4
Week4Week4
Week4
 
A Quick Tour of OpenRefine
A Quick Tour of OpenRefineA Quick Tour of OpenRefine
A Quick Tour of OpenRefine
 
Conversations with data
Conversations with dataConversations with data
Conversations with data
 
Data reuse OU workshop bingo
Data reuse OU workshop bingoData reuse OU workshop bingo
Data reuse OU workshop bingo
 
Inspiring content - You Don't Need Big Data to Tell Good Data Stories
Inspiring content - You Don't Need Big Data to Tell Good Data Stories Inspiring content - You Don't Need Big Data to Tell Good Data Stories
Inspiring content - You Don't Need Big Data to Tell Good Data Stories
 
Lincoln jun14datajournalism
Lincoln jun14datajournalismLincoln jun14datajournalism
Lincoln jun14datajournalism
 

Ili structuredauthoring

Editor's Notes

  1. [ABSTRACT: the Open University’s open educational resources are derived from course units that are authored as structured XML documents on the OpenLearn site. Tony Hirst shows how new products can be derived from document archives, if we think of them as ‘data’.]The current OU workflow results in the production of structured XML documents that can be used to generate several different "output" document formats, from HTML documents for use in the VLE to eBooks and PDFs. But XML documents can also be viewed as a database within which different asset types can be reliably identified. In this presentation, I will review how OU-XML documentsas used in course production and OpenLearn workflows can be mined in order to create course specific search engines (as well as reflecting on why these might NOT be such a good idea after all) and annotatable mindmap styled overviews of module units.
  2. For some years, I have been running a Google Custom Search Engine over resources linked to from the Relevant Knowledge short course, T151 Digital Worlds. The course is built on a resource based learning model, and draws heavily on content available on the public web. The search engine is embedded within an iframe within a page contained within the VLE, and styled in sympathy with the styling of the VLE itself.[Show example of analytics/reporting available from CSE]
  3. For several years, I have been exploring how we might make use of third party, linked to resources within the course context. Trivially, we might view them as of three types: resources linked to by the course team; resources that are discovered and shared within the course context; and resources that are discovered via web searches related to the course.
  4. The notion of a custom course search engine is based on extending a search over a set of course materials to include the resources linked to from the course. Doing this locally would require identifying the course linked resources, crawl them, index them, and make them available to the course search tool. By using a Google custom search engine, we can use Google’s index to provide a search over at least the publicly linked to resources. (Note that the custom search engine will not be able to search over the course material themselves if they are not public. [Google news as searching over content that is behind a paywall in return for a first click view of the content. Could we do the same with education content?]
  5. ?if a GlossaryTerm is labeled as such within the body of an OpenLearn XML document, we should be able to retrieve the contextualisingpargraph and include this as part of the search results. But I don’t do this..