SlideShare a Scribd company logo
1 of 19
STRUCTURAL TOPIC MODELLING
OF OFSTED DOCUMENTS
BERA seminar, 16 September 2021
Dr Christian Bokhove
Southampton Education School
University of Southampton
Trusting the text: corpus-assisted approaches in education research
Contents
1. Computational Social Science
2. Ofsted’s inspection context and prior
research
3. Structural Topic Modelling
4. Conclusions
Computational research methods
Approach that relies on forms of automated analysis of information,
using computers, to answer education research questions.
The methods can include one or more of the following:
• Analysis depends on algorithms, including the use of
• Artificial intelligence (AI) - computers make complex, human-like judgements
• Machine Learning (ML) - computers learn to copy human behaviour
• Data sets are usually large scale, 'Big Data', sometimes millions of
sources are collected and analysed.
• Information already exists, rather than collected specifically for
research.
• 'Scraping' from websites (news, reports, blogs, etc)
• Extraction from databases and archives created for other purposes (eg journal
contents, interactions with a learning platform)
• Social networks (e.g. social media)
• Simulating new data
Adapting Cioffi-Revilla (2017), we can distinguish different
types of computational social science, each with associated
computational research methods.
• Automated social information extraction;
• Social networks and social complexity;
• Social simulation modelling;
Here, we focus on the first category.
Cioffi-Revilla, C. (2017). Introduction to computational social science (2nd edition).
London, UK: Springer.
For example, Bokhove
(2015) scraped thousands of
OFSTED reports from the
inspection website to answer
the question whether topics
and sentiments in the
reports had changed over
time, so-called ‘sentiment
analysis.
Bokhove, C. (2015). Text mining school inspection reports in England with R. University of
Southampton.
Bokhove, C., & Sims, S. (2020). Demonstrating the potential of text mining for analyzing school inspection
reports: a sentiment analysis of 17,000 Ofsted documents. International Journal of Research and Method in
Education. https://doi.org/10.1080/1743727X.2020.1819228
Boxplot showing the distribution of sentiment scores by inspection grade. N=3,155.
Average sentiment score for the corpus of inspection documents by Chief Inspector. N=17,212.
R
package
stm
(Roberts, Stewart, & Tingley, 2019)
Analytical approach
• 3155 documents, classified by judgement
• Outstanding
• Good
• Requiring Improvement
• Satisfactory
• Inadequate
• Lower case, stemming, remove stopwords, remove
punctuation, remove numbers
• “Your corpus now has 3155 documents, 1435 terms and
1767508 tokens.”
• Judgement as covariate.
• Age as covariate.
Ten topics, some make sense, more prevalent in
‘inadequate’ and ‘requiring improvement’.
Some topics hard to gauge:
• Munoz-Najar Galvez et al. (2019) used text analysis to
study the paradigm wars in graduate research in the field of
education.
• Topic modelling by Inglis and Foster (2018) with the
package MALLET, to study evidence of the ‘social turn’ in
five decades of mathematics education research.
Munoz-Najar Galvez, S., Heiberger, R., & McFarland, D. (2020). Paradigm wars
revisited: A cartography of graduate research in the field of education (1980–2010).
American Educational Research Journal, 57(2), 612-652.
Other examples…
Inglis, M., & Foster, C. (2018). Five decades of mathematics education research.
Journal for Research in Mathematics Education, 49(4), 462-500.
Conclusions
• Large corpora of documents can be analysed at scale
with computational methods (e.g. text mining).
• There are several methods to do this, for example
sentiment analysis and (structural) topic modelling.
• Some methods allow for including other variables.
• Real-world documents are messy and probably require
plenty of cleaning. Interpretation can be a challenge.
• Number of topics to choose not straightforward. There are
methods for this (e.g. ‘perplexity’).
• Computational methods work well in combination with
qualitative methods e.g. ‘quotes in context’.
Thank you - Questions
• C.Bokhove@soton.ac.uk
• Southampton Education School
• Twitter: @cbokhove
• Website: www.bokhove.net

More Related Content

What's hot

Public Lecture Hong Kong University, 18 November 2015
Public Lecture Hong Kong University, 18 November 2015Public Lecture Hong Kong University, 18 November 2015
Public Lecture Hong Kong University, 18 November 2015Christian Bokhove
 
Tiffany Barnes "Making a meaningful difference: Leveraging data to improve le...
Tiffany Barnes "Making a meaningful difference: Leveraging data to improve le...Tiffany Barnes "Making a meaningful difference: Leveraging data to improve le...
Tiffany Barnes "Making a meaningful difference: Leveraging data to improve le...CITE
 
Gobert, Dede, Martin, Rose "Panel: Learning Analytics and Learning Sciences"
Gobert, Dede, Martin, Rose "Panel: Learning Analytics and Learning Sciences"Gobert, Dede, Martin, Rose "Panel: Learning Analytics and Learning Sciences"
Gobert, Dede, Martin, Rose "Panel: Learning Analytics and Learning Sciences"CITE
 
SDS Networking Event breakout session slides - PhD overview
SDS Networking Event breakout session slides - PhD overviewSDS Networking Event breakout session slides - PhD overview
SDS Networking Event breakout session slides - PhD overviewLyndsey Middleton
 
The application of Social Cognitive Theory in Information Science research on...
The application of Social Cognitive Theory in Information Science research on...The application of Social Cognitive Theory in Information Science research on...
The application of Social Cognitive Theory in Information Science research on...Lyndsey Middleton
 
2016-05-31 Venia Legendi (CEITER): Sergey Sosnovsky
2016-05-31 Venia Legendi (CEITER): Sergey Sosnovsky2016-05-31 Venia Legendi (CEITER): Sergey Sosnovsky
2016-05-31 Venia Legendi (CEITER): Sergey Sosnovskyifi8106tlu
 
E-Research Open Learning Conference Unisa 2018
E-Research Open Learning Conference Unisa 2018E-Research Open Learning Conference Unisa 2018
E-Research Open Learning Conference Unisa 2018Terry Anderson
 
2016-05-31 Venia Legendi (CEITER): Adolfo Ruiz Calleja
2016-05-31 Venia Legendi (CEITER): Adolfo Ruiz Calleja2016-05-31 Venia Legendi (CEITER): Adolfo Ruiz Calleja
2016-05-31 Venia Legendi (CEITER): Adolfo Ruiz Callejaifi8106tlu
 
Developing a multiple-document-processing performance assessment for epistem...
 Developing a multiple-document-processing performance assessment for epistem... Developing a multiple-document-processing performance assessment for epistem...
Developing a multiple-document-processing performance assessment for epistem...Simon Knight
 
2011.10.10 Multi-Disciplinary Research Themes and Training
2011.10.10 Multi-Disciplinary Research Themes and Training2011.10.10 Multi-Disciplinary Research Themes and Training
2011.10.10 Multi-Disciplinary Research Themes and TrainingNUI Galway
 
Xiao Hu "Learning Analytics Initiatives"
Xiao Hu "Learning Analytics Initiatives"Xiao Hu "Learning Analytics Initiatives"
Xiao Hu "Learning Analytics Initiatives"CITE
 
Use of ICT for acquiring, practicing and assessing algebraic expertise
 Use of ICT for acquiring, practicing and assessing algebraic expertise  Use of ICT for acquiring, practicing and assessing algebraic expertise
Use of ICT for acquiring, practicing and assessing algebraic expertise Christian Bokhove
 
Social media and scholarly research
Social media and scholarly researchSocial media and scholarly research
Social media and scholarly researchTerry Anderson
 
The Impact of Open Textbooks in the USA and South Africa: When? Why? How?
The Impact of Open Textbooks in the USA and South Africa: When? Why? How?The Impact of Open Textbooks in the USA and South Africa: When? Why? How?
The Impact of Open Textbooks in the USA and South Africa: When? Why? How?OER Hub
 
OLT conference Learning analytics
OLT conference Learning analyticsOLT conference Learning analytics
OLT conference Learning analyticsShirley Alexander
 
Towards Collaborative Learning Analytics
Towards Collaborative Learning AnalyticsTowards Collaborative Learning Analytics
Towards Collaborative Learning Analyticsalywise
 
Introduction to Learning Analytics - Framework and Implementation Concerns
Introduction to Learning Analytics - Framework and Implementation ConcernsIntroduction to Learning Analytics - Framework and Implementation Concerns
Introduction to Learning Analytics - Framework and Implementation ConcernsTore Hoel
 
edmedia2014-learning-analytics-keynote
edmedia2014-learning-analytics-keynoteedmedia2014-learning-analytics-keynote
edmedia2014-learning-analytics-keynoteSimon Buckingham Shum
 

What's hot (20)

Public Lecture Hong Kong University, 18 November 2015
Public Lecture Hong Kong University, 18 November 2015Public Lecture Hong Kong University, 18 November 2015
Public Lecture Hong Kong University, 18 November 2015
 
Tiffany Barnes "Making a meaningful difference: Leveraging data to improve le...
Tiffany Barnes "Making a meaningful difference: Leveraging data to improve le...Tiffany Barnes "Making a meaningful difference: Leveraging data to improve le...
Tiffany Barnes "Making a meaningful difference: Leveraging data to improve le...
 
Gobert, Dede, Martin, Rose "Panel: Learning Analytics and Learning Sciences"
Gobert, Dede, Martin, Rose "Panel: Learning Analytics and Learning Sciences"Gobert, Dede, Martin, Rose "Panel: Learning Analytics and Learning Sciences"
Gobert, Dede, Martin, Rose "Panel: Learning Analytics and Learning Sciences"
 
SDS Networking Event breakout session slides - PhD overview
SDS Networking Event breakout session slides - PhD overviewSDS Networking Event breakout session slides - PhD overview
SDS Networking Event breakout session slides - PhD overview
 
The application of Social Cognitive Theory in Information Science research on...
The application of Social Cognitive Theory in Information Science research on...The application of Social Cognitive Theory in Information Science research on...
The application of Social Cognitive Theory in Information Science research on...
 
2016-05-31 Venia Legendi (CEITER): Sergey Sosnovsky
2016-05-31 Venia Legendi (CEITER): Sergey Sosnovsky2016-05-31 Venia Legendi (CEITER): Sergey Sosnovsky
2016-05-31 Venia Legendi (CEITER): Sergey Sosnovsky
 
E-Research Open Learning Conference Unisa 2018
E-Research Open Learning Conference Unisa 2018E-Research Open Learning Conference Unisa 2018
E-Research Open Learning Conference Unisa 2018
 
engage me
engage meengage me
engage me
 
2016-05-31 Venia Legendi (CEITER): Adolfo Ruiz Calleja
2016-05-31 Venia Legendi (CEITER): Adolfo Ruiz Calleja2016-05-31 Venia Legendi (CEITER): Adolfo Ruiz Calleja
2016-05-31 Venia Legendi (CEITER): Adolfo Ruiz Calleja
 
Developing a multiple-document-processing performance assessment for epistem...
 Developing a multiple-document-processing performance assessment for epistem... Developing a multiple-document-processing performance assessment for epistem...
Developing a multiple-document-processing performance assessment for epistem...
 
QUT Talk
QUT TalkQUT Talk
QUT Talk
 
2011.10.10 Multi-Disciplinary Research Themes and Training
2011.10.10 Multi-Disciplinary Research Themes and Training2011.10.10 Multi-Disciplinary Research Themes and Training
2011.10.10 Multi-Disciplinary Research Themes and Training
 
Xiao Hu "Learning Analytics Initiatives"
Xiao Hu "Learning Analytics Initiatives"Xiao Hu "Learning Analytics Initiatives"
Xiao Hu "Learning Analytics Initiatives"
 
Use of ICT for acquiring, practicing and assessing algebraic expertise
 Use of ICT for acquiring, practicing and assessing algebraic expertise  Use of ICT for acquiring, practicing and assessing algebraic expertise
Use of ICT for acquiring, practicing and assessing algebraic expertise
 
Social media and scholarly research
Social media and scholarly researchSocial media and scholarly research
Social media and scholarly research
 
The Impact of Open Textbooks in the USA and South Africa: When? Why? How?
The Impact of Open Textbooks in the USA and South Africa: When? Why? How?The Impact of Open Textbooks in the USA and South Africa: When? Why? How?
The Impact of Open Textbooks in the USA and South Africa: When? Why? How?
 
OLT conference Learning analytics
OLT conference Learning analyticsOLT conference Learning analytics
OLT conference Learning analytics
 
Towards Collaborative Learning Analytics
Towards Collaborative Learning AnalyticsTowards Collaborative Learning Analytics
Towards Collaborative Learning Analytics
 
Introduction to Learning Analytics - Framework and Implementation Concerns
Introduction to Learning Analytics - Framework and Implementation ConcernsIntroduction to Learning Analytics - Framework and Implementation Concerns
Introduction to Learning Analytics - Framework and Implementation Concerns
 
edmedia2014-learning-analytics-keynote
edmedia2014-learning-analytics-keynoteedmedia2014-learning-analytics-keynote
edmedia2014-learning-analytics-keynote
 

Similar to Structural Topic Modelling of Ofsted Documents

Learning Relations from Social Tagging Data
Learning Relations from Social Tagging DataLearning Relations from Social Tagging Data
Learning Relations from Social Tagging DataHang Dong
 
Educational Technology Research Trends: Examining Six SSCI-indexed Refereed ...
Educational Technology Research Trends:  Examining Six SSCI-indexed Refereed ...Educational Technology Research Trends:  Examining Six SSCI-indexed Refereed ...
Educational Technology Research Trends: Examining Six SSCI-indexed Refereed ...Yu-Chang Hsu
 
Managing 'Big Data' in the social sciences: the contribution of an analytico-...
Managing 'Big Data' in the social sciences: the contribution of an analytico-...Managing 'Big Data' in the social sciences: the contribution of an analytico-...
Managing 'Big Data' in the social sciences: the contribution of an analytico-...CILIP MDG
 
2022_01_21 «Teaching Computing in School: Is research reaching classroom prac...
2022_01_21 «Teaching Computing in School: Is research reaching classroom prac...2022_01_21 «Teaching Computing in School: Is research reaching classroom prac...
2022_01_21 «Teaching Computing in School: Is research reaching classroom prac...eMadrid network
 
Webscience Guest Lecture 1-12-2017
Webscience Guest Lecture 1-12-2017Webscience Guest Lecture 1-12-2017
Webscience Guest Lecture 1-12-2017Christian Bokhove
 
Research trends qualitative analysis in cscl
Research trends  qualitative analysis in csclResearch trends  qualitative analysis in cscl
Research trends qualitative analysis in csclMerlien Institute
 
Part 1 research and evaluation edited
Part 1 research and evaluation editedPart 1 research and evaluation edited
Part 1 research and evaluation editedYISMAW MENGGISTU
 
Seminar University of Loughborough: Using technology to support mathematics e...
Seminar University of Loughborough: Using technology to support mathematics e...Seminar University of Loughborough: Using technology to support mathematics e...
Seminar University of Loughborough: Using technology to support mathematics e...Christian Bokhove
 
KV713 Session 3
KV713 Session 3KV713 Session 3
KV713 Session 3kturvey
 
Using phenomenography in educational technology research from 2003 to 2017: A...
Using phenomenography in educational technology research from 2003 to 2017: A...Using phenomenography in educational technology research from 2003 to 2017: A...
Using phenomenography in educational technology research from 2003 to 2017: A...Sally Wan
 
· You may choose one or more chapters from E.G. Whites, The Minist
· You may choose one or more chapters from E.G. Whites, The Minist· You may choose one or more chapters from E.G. Whites, The Minist
· You may choose one or more chapters from E.G. Whites, The MinistLesleyWhitesidefv
 
3 D Project Based Learning Basics for the New Generation Science Standards
3 D Project Based  Learning Basics for the New Generation Science Standards3 D Project Based  Learning Basics for the New Generation Science Standards
3 D Project Based Learning Basics for the New Generation Science Standardsrekharajaseran
 
Fundamentals of Data science Introduction Unit 1
Fundamentals of Data science Introduction Unit 1Fundamentals of Data science Introduction Unit 1
Fundamentals of Data science Introduction Unit 1sasi
 
KV713 Session 3
KV713 Session 3KV713 Session 3
KV713 Session 3kturvey
 
KV713 Session 3
KV713 Session 3 KV713 Session 3
KV713 Session 3 kturvey
 
IT 3010 Lecture 1 Introduction
IT 3010 Lecture 1 IntroductionIT 3010 Lecture 1 Introduction
IT 3010 Lecture 1 IntroductionBabakFarshchian
 

Similar to Structural Topic Modelling of Ofsted Documents (20)

Learning Relations from Social Tagging Data
Learning Relations from Social Tagging DataLearning Relations from Social Tagging Data
Learning Relations from Social Tagging Data
 
Educational Technology Research Trends: Examining Six SSCI-indexed Refereed ...
Educational Technology Research Trends:  Examining Six SSCI-indexed Refereed ...Educational Technology Research Trends:  Examining Six SSCI-indexed Refereed ...
Educational Technology Research Trends: Examining Six SSCI-indexed Refereed ...
 
Managing 'Big Data' in the social sciences: the contribution of an analytico-...
Managing 'Big Data' in the social sciences: the contribution of an analytico-...Managing 'Big Data' in the social sciences: the contribution of an analytico-...
Managing 'Big Data' in the social sciences: the contribution of an analytico-...
 
2022_01_21 «Teaching Computing in School: Is research reaching classroom prac...
2022_01_21 «Teaching Computing in School: Is research reaching classroom prac...2022_01_21 «Teaching Computing in School: Is research reaching classroom prac...
2022_01_21 «Teaching Computing in School: Is research reaching classroom prac...
 
Webscience Guest Lecture 1-12-2017
Webscience Guest Lecture 1-12-2017Webscience Guest Lecture 1-12-2017
Webscience Guest Lecture 1-12-2017
 
Research trends qualitative analysis in cscl
Research trends  qualitative analysis in csclResearch trends  qualitative analysis in cscl
Research trends qualitative analysis in cscl
 
Part 1 research and evaluation edited
Part 1 research and evaluation editedPart 1 research and evaluation edited
Part 1 research and evaluation edited
 
Data informed decision making - Yaz El Hakim
Data informed decision making - Yaz El HakimData informed decision making - Yaz El Hakim
Data informed decision making - Yaz El Hakim
 
Seminar University of Loughborough: Using technology to support mathematics e...
Seminar University of Loughborough: Using technology to support mathematics e...Seminar University of Loughborough: Using technology to support mathematics e...
Seminar University of Loughborough: Using technology to support mathematics e...
 
KV713 Session 3
KV713 Session 3KV713 Session 3
KV713 Session 3
 
Using phenomenography in educational technology research from 2003 to 2017: A...
Using phenomenography in educational technology research from 2003 to 2017: A...Using phenomenography in educational technology research from 2003 to 2017: A...
Using phenomenography in educational technology research from 2003 to 2017: A...
 
· You may choose one or more chapters from E.G. Whites, The Minist
· You may choose one or more chapters from E.G. Whites, The Minist· You may choose one or more chapters from E.G. Whites, The Minist
· You may choose one or more chapters from E.G. Whites, The Minist
 
3 D Project Based Learning Basics for the New Generation Science Standards
3 D Project Based  Learning Basics for the New Generation Science Standards3 D Project Based  Learning Basics for the New Generation Science Standards
3 D Project Based Learning Basics for the New Generation Science Standards
 
Fundamentals of Data science Introduction Unit 1
Fundamentals of Data science Introduction Unit 1Fundamentals of Data science Introduction Unit 1
Fundamentals of Data science Introduction Unit 1
 
KV713 Session 3
KV713 Session 3KV713 Session 3
KV713 Session 3
 
KV713 Session 3
KV713 Session 3 KV713 Session 3
KV713 Session 3
 
IT 3010 Lecture 1 Introduction
IT 3010 Lecture 1 IntroductionIT 3010 Lecture 1 Introduction
IT 3010 Lecture 1 Introduction
 
TDT39 oppstartsmøte
TDT39 oppstartsmøteTDT39 oppstartsmøte
TDT39 oppstartsmøte
 
Cadgme2016 keynote final
Cadgme2016 keynote finalCadgme2016 keynote final
Cadgme2016 keynote final
 
Websci 2018
Websci 2018Websci 2018
Websci 2018
 

More from Christian Bokhove

Can data from largescale assessments ever be useful For mathematics education?
Can data from largescale assessments ever be useful For mathematics education?Can data from largescale assessments ever be useful For mathematics education?
Can data from largescale assessments ever be useful For mathematics education?Christian Bokhove
 
Creating interactive digital books for the transition from secondary to under...
Creating interactive digital books for the transition from secondary to under...Creating interactive digital books for the transition from secondary to under...
Creating interactive digital books for the transition from secondary to under...Christian Bokhove
 
Research on school inspections: What do we know?
Research on school inspections: What do we know?Research on school inspections: What do we know?
Research on school inspections: What do we know?Christian Bokhove
 
Master mathematics teachers: What do Chinese primary schools look like?
Master mathematics teachers: What do Chinese primary schools look like?Master mathematics teachers: What do Chinese primary schools look like?
Master mathematics teachers: What do Chinese primary schools look like?Christian Bokhove
 
The role of non-cognitive factors in science achievement: an analysis of PISA...
The role of non-cognitive factors in science achievement: an analysis of PISA...The role of non-cognitive factors in science achievement: an analysis of PISA...
The role of non-cognitive factors in science achievement: an analysis of PISA...Christian Bokhove
 
Multilevel modelling of Chinese primary children’s metacognitive strategies i...
Multilevel modelling of Chinese primary children’s metacognitive strategies i...Multilevel modelling of Chinese primary children’s metacognitive strategies i...
Multilevel modelling of Chinese primary children’s metacognitive strategies i...Christian Bokhove
 
Help-seeking in an online maths environment: A sequence analysis of log files
Help-seeking in an online maths environment: A sequence analysis of log filesHelp-seeking in an online maths environment: A sequence analysis of log files
Help-seeking in an online maths environment: A sequence analysis of log filesChristian Bokhove
 
Learning loss and learning inequalities during the covid-19 pandemic: an anal...
Learning loss and learning inequalities during the covid-19 pandemic: an anal...Learning loss and learning inequalities during the covid-19 pandemic: an anal...
Learning loss and learning inequalities during the covid-19 pandemic: an anal...Christian Bokhove
 
The challenge of proof in the transition from A-level mathematics to university
The challenge of proof in the transition from A-level mathematics to universityThe challenge of proof in the transition from A-level mathematics to university
The challenge of proof in the transition from A-level mathematics to universityChristian Bokhove
 
How can we develop expansive, research-informed ITE ?
How can we develop expansive, research-informed ITE ?How can we develop expansive, research-informed ITE ?
How can we develop expansive, research-informed ITE ?Christian Bokhove
 
(On)waarheden en (on)bekende zaken uit onderzoek over reken-wiskundeonderwijs
(On)waarheden en (on)bekende zaken uit onderzoek over reken-wiskundeonderwijs(On)waarheden en (on)bekende zaken uit onderzoek over reken-wiskundeonderwijs
(On)waarheden en (on)bekende zaken uit onderzoek over reken-wiskundeonderwijsChristian Bokhove
 
Transparency in Data Analysis
Transparency in Data AnalysisTransparency in Data Analysis
Transparency in Data AnalysisChristian Bokhove
 
Proof by induction in Calculus: Investigating first-year students’ examinatio...
Proof by induction in Calculus: Investigating first-year students’ examinatio...Proof by induction in Calculus: Investigating first-year students’ examinatio...
Proof by induction in Calculus: Investigating first-year students’ examinatio...Christian Bokhove
 
Evidence informed: Waar is de Bijsluiter?
Evidence informed: Waar is de Bijsluiter?Evidence informed: Waar is de Bijsluiter?
Evidence informed: Waar is de Bijsluiter?Christian Bokhove
 
Roundtable slides RiTE Paderborn 24/9/2021
Roundtable slides RiTE Paderborn 24/9/2021Roundtable slides RiTE Paderborn 24/9/2021
Roundtable slides RiTE Paderborn 24/9/2021Christian Bokhove
 
Learning loss and learning inequalities during the Covid-19 pandemic: an anal...
Learning loss and learning inequalities during the Covid-19 pandemic: an anal...Learning loss and learning inequalities during the Covid-19 pandemic: an anal...
Learning loss and learning inequalities during the Covid-19 pandemic: an anal...Christian Bokhove
 
How can we engage mathematics ITE students with research?
How can we engage mathematics ITE students with research?How can we engage mathematics ITE students with research?
How can we engage mathematics ITE students with research?Christian Bokhove
 

More from Christian Bokhove (20)

Can data from largescale assessments ever be useful For mathematics education?
Can data from largescale assessments ever be useful For mathematics education?Can data from largescale assessments ever be useful For mathematics education?
Can data from largescale assessments ever be useful For mathematics education?
 
Creating interactive digital books for the transition from secondary to under...
Creating interactive digital books for the transition from secondary to under...Creating interactive digital books for the transition from secondary to under...
Creating interactive digital books for the transition from secondary to under...
 
Research on school inspections: What do we know?
Research on school inspections: What do we know?Research on school inspections: What do we know?
Research on school inspections: What do we know?
 
Master mathematics teachers: What do Chinese primary schools look like?
Master mathematics teachers: What do Chinese primary schools look like?Master mathematics teachers: What do Chinese primary schools look like?
Master mathematics teachers: What do Chinese primary schools look like?
 
The role of non-cognitive factors in science achievement: an analysis of PISA...
The role of non-cognitive factors in science achievement: an analysis of PISA...The role of non-cognitive factors in science achievement: an analysis of PISA...
The role of non-cognitive factors in science achievement: an analysis of PISA...
 
Multilevel modelling of Chinese primary children’s metacognitive strategies i...
Multilevel modelling of Chinese primary children’s metacognitive strategies i...Multilevel modelling of Chinese primary children’s metacognitive strategies i...
Multilevel modelling of Chinese primary children’s metacognitive strategies i...
 
Cryptography
CryptographyCryptography
Cryptography
 
Help-seeking in an online maths environment: A sequence analysis of log files
Help-seeking in an online maths environment: A sequence analysis of log filesHelp-seeking in an online maths environment: A sequence analysis of log files
Help-seeking in an online maths environment: A sequence analysis of log files
 
Learning loss and learning inequalities during the covid-19 pandemic: an anal...
Learning loss and learning inequalities during the covid-19 pandemic: an anal...Learning loss and learning inequalities during the covid-19 pandemic: an anal...
Learning loss and learning inequalities during the covid-19 pandemic: an anal...
 
The challenge of proof in the transition from A-level mathematics to university
The challenge of proof in the transition from A-level mathematics to universityThe challenge of proof in the transition from A-level mathematics to university
The challenge of proof in the transition from A-level mathematics to university
 
How can we develop expansive, research-informed ITE ?
How can we develop expansive, research-informed ITE ?How can we develop expansive, research-informed ITE ?
How can we develop expansive, research-informed ITE ?
 
Discussant EARLI sig 27
Discussant EARLI sig 27Discussant EARLI sig 27
Discussant EARLI sig 27
 
(On)waarheden en (on)bekende zaken uit onderzoek over reken-wiskundeonderwijs
(On)waarheden en (on)bekende zaken uit onderzoek over reken-wiskundeonderwijs(On)waarheden en (on)bekende zaken uit onderzoek over reken-wiskundeonderwijs
(On)waarheden en (on)bekende zaken uit onderzoek over reken-wiskundeonderwijs
 
Transparency in Data Analysis
Transparency in Data AnalysisTransparency in Data Analysis
Transparency in Data Analysis
 
Proof by induction in Calculus: Investigating first-year students’ examinatio...
Proof by induction in Calculus: Investigating first-year students’ examinatio...Proof by induction in Calculus: Investigating first-year students’ examinatio...
Proof by induction in Calculus: Investigating first-year students’ examinatio...
 
Evidence informed: Waar is de Bijsluiter?
Evidence informed: Waar is de Bijsluiter?Evidence informed: Waar is de Bijsluiter?
Evidence informed: Waar is de Bijsluiter?
 
Roundtable slides RiTE Paderborn 24/9/2021
Roundtable slides RiTE Paderborn 24/9/2021Roundtable slides RiTE Paderborn 24/9/2021
Roundtable slides RiTE Paderborn 24/9/2021
 
Learning loss and learning inequalities during the Covid-19 pandemic: an anal...
Learning loss and learning inequalities during the Covid-19 pandemic: an anal...Learning loss and learning inequalities during the Covid-19 pandemic: an anal...
Learning loss and learning inequalities during the Covid-19 pandemic: an anal...
 
How can we engage mathematics ITE students with research?
How can we engage mathematics ITE students with research?How can we engage mathematics ITE students with research?
How can we engage mathematics ITE students with research?
 
AMET-NAMA
AMET-NAMAAMET-NAMA
AMET-NAMA
 

Recently uploaded

Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentInMediaRes1
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxRoyAbrique
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxNirmalaLoungPoorunde1
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxOH TEIK BIN
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon AUnboundStockton
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxmanuelaromero2013
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docxPoojaSen20
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 
PSYCHIATRIC History collection FORMAT.pptx
PSYCHIATRIC   History collection FORMAT.pptxPSYCHIATRIC   History collection FORMAT.pptx
PSYCHIATRIC History collection FORMAT.pptxPoojaSen20
 

Recently uploaded (20)

Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media Component
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptx
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptx
 
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon A
 
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptx
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docx
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
Staff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSDStaff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSD
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 
PSYCHIATRIC History collection FORMAT.pptx
PSYCHIATRIC   History collection FORMAT.pptxPSYCHIATRIC   History collection FORMAT.pptx
PSYCHIATRIC History collection FORMAT.pptx
 

Structural Topic Modelling of Ofsted Documents

  • 1. STRUCTURAL TOPIC MODELLING OF OFSTED DOCUMENTS BERA seminar, 16 September 2021 Dr Christian Bokhove Southampton Education School University of Southampton Trusting the text: corpus-assisted approaches in education research
  • 2. Contents 1. Computational Social Science 2. Ofsted’s inspection context and prior research 3. Structural Topic Modelling 4. Conclusions
  • 3. Computational research methods Approach that relies on forms of automated analysis of information, using computers, to answer education research questions. The methods can include one or more of the following: • Analysis depends on algorithms, including the use of • Artificial intelligence (AI) - computers make complex, human-like judgements • Machine Learning (ML) - computers learn to copy human behaviour • Data sets are usually large scale, 'Big Data', sometimes millions of sources are collected and analysed. • Information already exists, rather than collected specifically for research. • 'Scraping' from websites (news, reports, blogs, etc) • Extraction from databases and archives created for other purposes (eg journal contents, interactions with a learning platform) • Social networks (e.g. social media) • Simulating new data
  • 4. Adapting Cioffi-Revilla (2017), we can distinguish different types of computational social science, each with associated computational research methods. • Automated social information extraction; • Social networks and social complexity; • Social simulation modelling; Here, we focus on the first category. Cioffi-Revilla, C. (2017). Introduction to computational social science (2nd edition). London, UK: Springer.
  • 5.
  • 6. For example, Bokhove (2015) scraped thousands of OFSTED reports from the inspection website to answer the question whether topics and sentiments in the reports had changed over time, so-called ‘sentiment analysis. Bokhove, C. (2015). Text mining school inspection reports in England with R. University of Southampton.
  • 7.
  • 8. Bokhove, C., & Sims, S. (2020). Demonstrating the potential of text mining for analyzing school inspection reports: a sentiment analysis of 17,000 Ofsted documents. International Journal of Research and Method in Education. https://doi.org/10.1080/1743727X.2020.1819228
  • 9.
  • 10. Boxplot showing the distribution of sentiment scores by inspection grade. N=3,155.
  • 11. Average sentiment score for the corpus of inspection documents by Chief Inspector. N=17,212.
  • 13. Analytical approach • 3155 documents, classified by judgement • Outstanding • Good • Requiring Improvement • Satisfactory • Inadequate • Lower case, stemming, remove stopwords, remove punctuation, remove numbers • “Your corpus now has 3155 documents, 1435 terms and 1767508 tokens.” • Judgement as covariate. • Age as covariate.
  • 14.
  • 15. Ten topics, some make sense, more prevalent in ‘inadequate’ and ‘requiring improvement’. Some topics hard to gauge:
  • 16.
  • 17. • Munoz-Najar Galvez et al. (2019) used text analysis to study the paradigm wars in graduate research in the field of education. • Topic modelling by Inglis and Foster (2018) with the package MALLET, to study evidence of the ‘social turn’ in five decades of mathematics education research. Munoz-Najar Galvez, S., Heiberger, R., & McFarland, D. (2020). Paradigm wars revisited: A cartography of graduate research in the field of education (1980–2010). American Educational Research Journal, 57(2), 612-652. Other examples… Inglis, M., & Foster, C. (2018). Five decades of mathematics education research. Journal for Research in Mathematics Education, 49(4), 462-500.
  • 18. Conclusions • Large corpora of documents can be analysed at scale with computational methods (e.g. text mining). • There are several methods to do this, for example sentiment analysis and (structural) topic modelling. • Some methods allow for including other variables. • Real-world documents are messy and probably require plenty of cleaning. Interpretation can be a challenge. • Number of topics to choose not straightforward. There are methods for this (e.g. ‘perplexity’). • Computational methods work well in combination with qualitative methods e.g. ‘quotes in context’.
  • 19. Thank you - Questions • C.Bokhove@soton.ac.uk • Southampton Education School • Twitter: @cbokhove • Website: www.bokhove.net

Editor's Notes

  1. In England, an important role for the judgement of educational quality, is provided by the national school inspectorate Ofsted. Periodically they inspect schools and judge them. The result of the inspection is captured in inspection reports and associated documents. Ofsted has had several chief inspectors (HMCI) since 2000 and every HMCI tends to put his/her own mark on the inspectorate. This paper extends the analysis of the corpus in Author (2020) using the corpus of more than 17,000 Ofsted documents which were scraped from their website with text-mining techniques. Using the computational research method of structural topic modelling I re-analyse a set of documents that typically could not be analysed with manual methods. I juxtapose the findings with previous findings from sentiment analyses. The paper does not just cover the substantive topic at hand, but also provide insight in how the methods work, and how they provide insight in policy shifts during the ‘reign’ of different HMCIs. All in all, we can see how such text-mining techniques allow us to analyse existing documents at scale.