SlideShare a Scribd company logo
1 of 53
wikipedia as a data set
& analytical device
Erik Borra, June 2013
DMI research with Wikipedia
reference work
bureaucracy
scandal machine
repurposing Wikipedia as a ...
vigilant community
Medium-specific outlook
“follow the medium”
(Rogers 2009)
The Anatomy of a Wikipedia page
DMI research with Wikipedia
reference work
bureaucracy
scandal machine
repurposing Wikipedia as a ...
vigilant community
DMI research with Wikipedia
repurposing Wikipedia as a ...
reference work <=> cultural reference
bureaucracy
scandal machine
vigilant community
www.nature.com/nature/journal/
v438/n7070/extref/438900a-s1.doc
Burial of 465 identified Bosniaks,
Potočari, 2007.
Map of the Srebrenica military
operations, made by the U.S. Central
Intelligence Agency, with green
arrow showing the route of the
Bosnian forces.
Map of the location of Srebrenica,
the Republika Srpska,
Bosnia-Herzegovina.
Srebrenica-Potočari Memorial and
Cemetery, Bosnia-Herzegovina.
Grave of a 13-year old Bosniak boy.
Ratko Mladic.
An exhumed body with blindfold
and hands tied behind his back. As of
September 2012, the photo has been
removed from Wikipedia article.
Exhumed grave of victims, 2007.
Podrinje Identification Project's
facility for storing, processing, and
handling exhumed remains..
"UN left 8,000 to die in Bosnia."
Headline in The Independent,
30 October 1995.
Satellite photo of Nova Kasaba
mass grave.
International Criminal Tribunal for
the Former Yugoslavia, Den Haag,
the Netherlands.
DUTCH ENGLISH BOSNIAN CROATIAN SERBIAN
SERBO-
CROATIAN
Tool: Wikipedia Cross-Lingual
Image Analysis
Tool: triangulate
National Point of View
Neutral Point of View
Linguistic Point of View
manypedia.com - comparing linguistic points of view (LPOV)
omnipedia.northwestern.edu - making Wikipedia articles in
different languages comparable
DMI research with Wikipedia
repurposing Wikipedia as a ...
reference work
bureaucracy
scandal machine
vigilant community <=> socio-technical device
Wikipedia has been described in terms of open source intelligence(Stalder
and Hirsh, 2002), wisdom of crowds(Surowiecki, 2004; Kittur and Kraut, 2008),
many minds(Sunstein, 2006), collaborative knowledge(Poe, 2006,
McKenzie Wark 2007), an army of volunteers(Jenkins, 2006), mass
collaboration(Tapscott, 2007), distributed collaboration(Shirky,
2008), produsage(Bruns, 2008), crowdsourcing(Economist, 2008), and
mentioned in the context of free labour(Deuze, 2006), and the cult of the
amateur(Keen, 2007).
Tool: Wikipedia Edits Scraper and IP Localizer
+ screen recording software
Govcom.org, 2008.
http://govcom.org/
Analysis by Zachary Devereaux, Sabine Niederer, Richard Rogers, Bram Nijhof
and Michael Stevenson.
© 2008
Wikipedia editing by bots and users: Overall percentage of Bot
activity of all edit activity________________________________
08August
Wikipedia editing by bots and users: Overall percentage of Bot activity of all edit activity
Visualization: Rosa Menkman.
Percentage of bot edits
All wikipedia edits
Digital Methods Initiative, 2008.
www.digitalmethods.net
Analysis by Zachary Devereaux, Sabine Niederer, Richard Rogers, Bram Nijhof
and Michael Stevenson.
© 2008
Human, Bot and Software Assisted Activity on Top Twenty EN
Wikipedia Articles_________________________________________
0August
Human, Bot and Software Assisted Activity on Top Twenty EN Wikipedia Articles
46444644
Software Assisted Edits
30673067
Bot Edits
251378251378
Human Edits
Maps by: Rosa Menkman, Auke Touwslager and Marieke van Dijk
Govcomorg Jubilee, 2008.
www.govcom.org
Analysis by Zachary Devereaux, Sabine Niederer, Richard Rogers, Bram Nijhof
and Michael Stevenson
Visualization by Auke Touwslager
© 2008
Human, Bot and Software Assisted Activity on Top Twenty EN
Wikipedia Articles_________________________________________
0August
Percentage of bot activity
Language scaled by Wikipedia size
Cornish Võro Manx Ladino Friulian Romansh Aromanian Sanskrit Upper Sorbian Corsican Scottish Gaelic Samogitian Chuvash Aragonese Occitan Breton
Endangered Languages
Revived Languages
Hawaiian Cornish Manx West Frisian Belarusian Basque Galician Estonian Hebrew Czech Catalan
Wikipedia Bot Activity in Endangered and Revived Languages
What is the level of bot activity per language, looking at endangered and revived languages?
stats.wikimedia.org + bubble lines
Govcomorg Jubilee, 2008.
www.govcom.org
Analysis by Zachary Devereaux, Sabine Niederer, Richard Rogers, Bram Nijhof
and Michael Stevenson
Visualization by Auke Touwslager
© 2008
Human, Bot and Software Assisted Activity on Top Twenty EN
Wikipedia Articles_________________________________________
0August
Javanese
Urdu
Tamil
Hindi
Marathi Bengali
Telugu
Korean
Vietnamese
Arabic
Chinese
Russian
Spanish
Japanese
Italian
Portuguese
German
French
English
Percentage of bot activity
Language scaled by Wikipedia size
Wikipedia Bot Activity in Most-Used Languages Worldwide
What is the level of bot activity per language, looking at top 20 of most-used languages?
stats.wikimedia.org + Dorling
DMI research with Wikipedia
repurposing Wikipedia as a ...
reference work
bureaucracy
scandal machine <=> place of edits
vigilant community
DMI research with Wikipedia
repurposing Wikipedia as a ...
reference work
bureaucracy <=> controversy diagnostics machine
scandal machine
vigilant community
Administrative apparatus:
Wikipedia’s procedural policies, guidelines and essays
... with the purpose of reaching consensus
Core content policies
Wikipedia collaboration and conflict studies:
Cooperation and conflict (Viegas, 2004)
Conflict and coordination (Kittur, 2007)
Revert Graph (Suh et al, 2007)
Mutual reverts (Brandes, 2007)
Argument identification (Rad & Barosa, 2011)
Edit wars (Sumi, 2011)
Evolution of discussions (Kaltenbrunner & Laniado, 2012)
medium-specific approach:
repurpose how consensus principles (as method
of the medium) act on the objects of the medium
templates
links
images
references
interlanguage links
timestamps comments
Wikipedia objects
author
contropedia.net
DMI research with Wikipedia
reference work <=> cultural reference
bureaucracy <=> controversy diagnostics machine
scandal machine <=> places of edits
repurposing Wikipedia as a ...
vigilant community <=> socio-technical system
DMI projects mentioned
R. Rogers, Digital Methods , Cambridge, MA: MIT Press, 2013. Chapter 5.
https://wiki.digitalmethods.net/Dmi/DebottingWikipedia
https://www.digitalmethods.net/Digitalmethods/TheNetworkedContent
Contropedia, contropedia.net
S. Niederer and J. van Dijck, " Wisdom of the Crowd or Technicity of
Content? Wikipedia as Socio-technical System," New Media & Society,
12, 8, 2010, 1368-1387.
tools.digitalmethods.net
Tools native to Wikipedia
Page information
Search revision history
Contributors per article, ordered by number of edits
Page view statistics
stats.wikimedia.org - Data is published in exportable,
computable format, eg csv. Stats look at all wikis hosted
by the foundation, largest are the Wikipedias.
User edits searches, for all the edits of a specific user
Number of watchers
Other useful tools
manypedia.com - puts two language versions of an
article side by side / computes concept similarity
omnipedia.northwestern.edu - making Wikipedia
articles in different languages comparable
wikiscanner (defunct)
history flow (defunct)
http://vs.aka-online.de/cgi-bin/wppagehiststat.pl
builds an edit history overview page for the article
http://sonetlab.fbk.eu/wikitrip to see geographical
statistics for anonymous edits and gender
Academic literature on Wikipedia
http://en.wikipedia.org/wiki/Wikipedia:Academic_studies_of_Wikipedia
Okoli et al. (2012). The people's encyclopedia under the gaze of
the sages: A systematic review of scholarly research on Wikipedia
www.digitalmethods.net
sabine@digitalmethods.net
thank you.
erik@digitalmethods.net

More Related Content

Viewers also liked

Cross-Platform Profiling tutorial at the Digital Methods Summer School 2013
Cross-Platform Profiling tutorial at the Digital Methods Summer School 2013Cross-Platform Profiling tutorial at the Digital Methods Summer School 2013
Cross-Platform Profiling tutorial at the Digital Methods Summer School 2013Digital Methods Initiative
 
Post-social methods? Issues in live research, by Noortje Marres and Esther We...
Post-social methods? Issues in live research, by Noortje Marres and Esther We...Post-social methods? Issues in live research, by Noortje Marres and Esther We...
Post-social methods? Issues in live research, by Noortje Marres and Esther We...Digital Methods Initiative
 
Tracking the Trackers tutorial at the Digital Methods Summer School 2013
Tracking the Trackers tutorial at the Digital Methods Summer School 2013Tracking the Trackers tutorial at the Digital Methods Summer School 2013
Tracking the Trackers tutorial at the Digital Methods Summer School 2013Digital Methods Initiative
 
Rogers digitalmethodsaftersocialmedia nov2013_optimized_
Rogers digitalmethodsaftersocialmedia nov2013_optimized_Rogers digitalmethodsaftersocialmedia nov2013_optimized_
Rogers digitalmethodsaftersocialmedia nov2013_optimized_Digital Methods Initiative
 
Digital Methods Summer School 2015 Tool Medley
Digital Methods Summer School 2015 Tool MedleyDigital Methods Summer School 2015 Tool Medley
Digital Methods Summer School 2015 Tool MedleyDigital Methods Initiative
 
Interactive visualization and exploration of network data with Gephi
Interactive visualization and exploration of network data with GephiInteractive visualization and exploration of network data with Gephi
Interactive visualization and exploration of network data with GephiDigital Methods Initiative
 
Richard Rogers, Otherwise Engaged: Critical Analytics and the New Meanings of...
Richard Rogers, Otherwise Engaged: Critical Analytics and the New Meanings of...Richard Rogers, Otherwise Engaged: Critical Analytics and the New Meanings of...
Richard Rogers, Otherwise Engaged: Critical Analytics and the New Meanings of...Digital Methods Initiative
 
Studying Facebook via Data Extraction: a Netvizz tutorial at the Digital Meth...
Studying Facebook via Data Extraction: a Netvizz tutorial at the Digital Meth...Studying Facebook via Data Extraction: a Netvizz tutorial at the Digital Meth...
Studying Facebook via Data Extraction: a Netvizz tutorial at the Digital Meth...Digital Methods Initiative
 

Viewers also liked (12)

Cross-Platform Profiling tutorial at the Digital Methods Summer School 2013
Cross-Platform Profiling tutorial at the Digital Methods Summer School 2013Cross-Platform Profiling tutorial at the Digital Methods Summer School 2013
Cross-Platform Profiling tutorial at the Digital Methods Summer School 2013
 
Rogers data days_2014_slides_opti
Rogers data days_2014_slides_optiRogers data days_2014_slides_opti
Rogers data days_2014_slides_opti
 
Web Flags Summer School 2012
Web Flags Summer School 2012Web Flags Summer School 2012
Web Flags Summer School 2012
 
Post-social methods? Issues in live research, by Noortje Marres and Esther We...
Post-social methods? Issues in live research, by Noortje Marres and Esther We...Post-social methods? Issues in live research, by Noortje Marres and Esther We...
Post-social methods? Issues in live research, by Noortje Marres and Esther We...
 
Tracking the Trackers tutorial at the Digital Methods Summer School 2013
Tracking the Trackers tutorial at the Digital Methods Summer School 2013Tracking the Trackers tutorial at the Digital Methods Summer School 2013
Tracking the Trackers tutorial at the Digital Methods Summer School 2013
 
Rogers digitalmethodsaftersocialmedia nov2013_optimized_
Rogers digitalmethodsaftersocialmedia nov2013_optimized_Rogers digitalmethodsaftersocialmedia nov2013_optimized_
Rogers digitalmethodsaftersocialmedia nov2013_optimized_
 
Digital Methods Summer School 2015 Tool Medley
Digital Methods Summer School 2015 Tool MedleyDigital Methods Summer School 2015 Tool Medley
Digital Methods Summer School 2015 Tool Medley
 
Interactive visualization and exploration of network data with Gephi
Interactive visualization and exploration of network data with GephiInteractive visualization and exploration of network data with Gephi
Interactive visualization and exploration of network data with Gephi
 
Richard Rogers, Otherwise Engaged: Critical Analytics and the New Meanings of...
Richard Rogers, Otherwise Engaged: Critical Analytics and the New Meanings of...Richard Rogers, Otherwise Engaged: Critical Analytics and the New Meanings of...
Richard Rogers, Otherwise Engaged: Critical Analytics and the New Meanings of...
 
Digital Methods Tool Medley
Digital Methods Tool MedleyDigital Methods Tool Medley
Digital Methods Tool Medley
 
Studying Facebook via Data Extraction: a Netvizz tutorial at the Digital Meth...
Studying Facebook via Data Extraction: a Netvizz tutorial at the Digital Meth...Studying Facebook via Data Extraction: a Netvizz tutorial at the Digital Meth...
Studying Facebook via Data Extraction: a Netvizz tutorial at the Digital Meth...
 
The Birth of Social Media Methods
The Birth of Social Media MethodsThe Birth of Social Media Methods
The Birth of Social Media Methods
 

Similar to Repurposing Wikipedia: Wikipedia as data set and analytical device

Improving the Coverage of Complex Issues with Data Journalism and Digital Met...
Improving the Coverage of Complex Issues with Data Journalism and Digital Met...Improving the Coverage of Complex Issues with Data Journalism and Digital Met...
Improving the Coverage of Complex Issues with Data Journalism and Digital Met...Liliana Bounegru
 
Sdi, communities and social media
Sdi, communities and social mediaSdi, communities and social media
Sdi, communities and social mediaWirelessInfo
 
Hypermedia-driven Socio-technical Networks for Goal-driven Discovery in the W...
Hypermedia-driven Socio-technical Networks for Goal-driven Discovery in the W...Hypermedia-driven Socio-technical Networks for Goal-driven Discovery in the W...
Hypermedia-driven Socio-technical Networks for Goal-driven Discovery in the W...Andrei Ciortea
 
Scholarship in the Digital World
Scholarship in the Digital WorldScholarship in the Digital World
Scholarship in the Digital WorldDavid De Roure
 
Big Data meets Big Social: Social Machines and the Semantic Web
Big Data meets Big Social: Social Machines and the Semantic WebBig Data meets Big Social: Social Machines and the Semantic Web
Big Data meets Big Social: Social Machines and the Semantic WebDavid De Roure
 
e-Research: A Social Informatics Perspective
e-Research: A Social Informatics Perspectivee-Research: A Social Informatics Perspective
e-Research: A Social Informatics PerspectiveEric Meyer
 
KNVI-IP inspiratiemiddag over Wikipedia - presentatie Richard Rogers
KNVI-IP inspiratiemiddag over Wikipedia - presentatie Richard RogersKNVI-IP inspiratiemiddag over Wikipedia - presentatie Richard Rogers
KNVI-IP inspiratiemiddag over Wikipedia - presentatie Richard Rogersmarjobakker
 
SNSInkCloudWiner20150410
SNSInkCloudWiner20150410SNSInkCloudWiner20150410
SNSInkCloudWiner20150410Dov Winer
 
What Actor-Network Theory (ANT) and digital methods can do for data journalis...
What Actor-Network Theory (ANT) and digital methods can do for data journalis...What Actor-Network Theory (ANT) and digital methods can do for data journalis...
What Actor-Network Theory (ANT) and digital methods can do for data journalis...Liliana Bounegru
 
FirstWorkshopOnWikipediaResearch
FirstWorkshopOnWikipediaResearchFirstWorkshopOnWikipediaResearch
FirstWorkshopOnWikipediaResearchwebuploader
 
Digital Humanities in a Linked Data World - Semnantic Annotations
Digital Humanities in a Linked Data World - Semnantic AnnotationsDigital Humanities in a Linked Data World - Semnantic Annotations
Digital Humanities in a Linked Data World - Semnantic AnnotationsDov Winer
 
Quora ML Workshop: Engineering at the Intersection of Productive Efficiency, ...
Quora ML Workshop: Engineering at the Intersection of Productive Efficiency, ...Quora ML Workshop: Engineering at the Intersection of Productive Efficiency, ...
Quora ML Workshop: Engineering at the Intersection of Productive Efficiency, ...Quora
 
Peer Learning via Dialogue with a Pattern Language ((COINs17)
Peer Learning via Dialogue with a Pattern Language ((COINs17)Peer Learning via Dialogue with a Pattern Language ((COINs17)
Peer Learning via Dialogue with a Pattern Language ((COINs17)Takashi Iba
 
Web Observatories and e-Research
Web Observatories and e-ResearchWeb Observatories and e-Research
Web Observatories and e-ResearchDavid De Roure
 
humaniki User Research Report
humaniki User Research Report humaniki User Research Report
humaniki User Research Report Sejal Khatri
 
BioWikis BSB10
BioWikis BSB10BioWikis BSB10
BioWikis BSB10Dan Bolser
 

Similar to Repurposing Wikipedia: Wikipedia as data set and analytical device (20)

Improving the Coverage of Complex Issues with Data Journalism and Digital Met...
Improving the Coverage of Complex Issues with Data Journalism and Digital Met...Improving the Coverage of Complex Issues with Data Journalism and Digital Met...
Improving the Coverage of Complex Issues with Data Journalism and Digital Met...
 
Sdi, communities and social media
Sdi, communities and social mediaSdi, communities and social media
Sdi, communities and social media
 
Rogers digitalmethods 4nov2010
Rogers digitalmethods 4nov2010Rogers digitalmethods 4nov2010
Rogers digitalmethods 4nov2010
 
Hypermedia-driven Socio-technical Networks for Goal-driven Discovery in the W...
Hypermedia-driven Socio-technical Networks for Goal-driven Discovery in the W...Hypermedia-driven Socio-technical Networks for Goal-driven Discovery in the W...
Hypermedia-driven Socio-technical Networks for Goal-driven Discovery in the W...
 
Scholarship in the Digital World
Scholarship in the Digital WorldScholarship in the Digital World
Scholarship in the Digital World
 
Big Data meets Big Social: Social Machines and the Semantic Web
Big Data meets Big Social: Social Machines and the Semantic WebBig Data meets Big Social: Social Machines and the Semantic Web
Big Data meets Big Social: Social Machines and the Semantic Web
 
e-Research: A Social Informatics Perspective
e-Research: A Social Informatics Perspectivee-Research: A Social Informatics Perspective
e-Research: A Social Informatics Perspective
 
KNVI-IP inspiratiemiddag over Wikipedia - presentatie Richard Rogers
KNVI-IP inspiratiemiddag over Wikipedia - presentatie Richard RogersKNVI-IP inspiratiemiddag over Wikipedia - presentatie Richard Rogers
KNVI-IP inspiratiemiddag over Wikipedia - presentatie Richard Rogers
 
SNSInkCloudWiner20150410
SNSInkCloudWiner20150410SNSInkCloudWiner20150410
SNSInkCloudWiner20150410
 
What Actor-Network Theory (ANT) and digital methods can do for data journalis...
What Actor-Network Theory (ANT) and digital methods can do for data journalis...What Actor-Network Theory (ANT) and digital methods can do for data journalis...
What Actor-Network Theory (ANT) and digital methods can do for data journalis...
 
FirstWorkshopOnWikipediaResearch
FirstWorkshopOnWikipediaResearchFirstWorkshopOnWikipediaResearch
FirstWorkshopOnWikipediaResearch
 
Dh usp 2013
Dh usp 2013Dh usp 2013
Dh usp 2013
 
Digital Humanities in a Linked Data World - Semnantic Annotations
Digital Humanities in a Linked Data World - Semnantic AnnotationsDigital Humanities in a Linked Data World - Semnantic Annotations
Digital Humanities in a Linked Data World - Semnantic Annotations
 
Usp dh 2013
Usp dh 2013Usp dh 2013
Usp dh 2013
 
Digital Methods by Richard Rogers
Digital Methods by Richard RogersDigital Methods by Richard Rogers
Digital Methods by Richard Rogers
 
Quora ML Workshop: Engineering at the Intersection of Productive Efficiency, ...
Quora ML Workshop: Engineering at the Intersection of Productive Efficiency, ...Quora ML Workshop: Engineering at the Intersection of Productive Efficiency, ...
Quora ML Workshop: Engineering at the Intersection of Productive Efficiency, ...
 
Peer Learning via Dialogue with a Pattern Language ((COINs17)
Peer Learning via Dialogue with a Pattern Language ((COINs17)Peer Learning via Dialogue with a Pattern Language ((COINs17)
Peer Learning via Dialogue with a Pattern Language ((COINs17)
 
Web Observatories and e-Research
Web Observatories and e-ResearchWeb Observatories and e-Research
Web Observatories and e-Research
 
humaniki User Research Report
humaniki User Research Report humaniki User Research Report
humaniki User Research Report
 
BioWikis BSB10
BioWikis BSB10BioWikis BSB10
BioWikis BSB10
 

More from Digital Methods Initiative

Query Design for Digital Methods by Richard Rogers
Query Design for Digital Methods by Richard RogersQuery Design for Digital Methods by Richard Rogers
Query Design for Digital Methods by Richard RogersDigital Methods Initiative
 
Digital Methods Summer School 2013 Tool Medley
Digital Methods Summer School 2013 Tool MedleyDigital Methods Summer School 2013 Tool Medley
Digital Methods Summer School 2013 Tool MedleyDigital Methods Initiative
 
Digital Methods Tool Medley. Digital Methods Summer School 2012
Digital Methods Tool Medley. Digital Methods Summer School 2012Digital Methods Tool Medley. Digital Methods Summer School 2012
Digital Methods Tool Medley. Digital Methods Summer School 2012Digital Methods Initiative
 
Digital Methods Winterschool 2012: API - Interfaces to the Cloud
Digital Methods Winterschool 2012: API - Interfaces to the CloudDigital Methods Winterschool 2012: API - Interfaces to the Cloud
Digital Methods Winterschool 2012: API - Interfaces to the CloudDigital Methods Initiative
 
DMI Workshop: Data visualization. Analytical clouding.
DMI Workshop: Data visualization. Analytical clouding.DMI Workshop: Data visualization. Analytical clouding.
DMI Workshop: Data visualization. Analytical clouding.Digital Methods Initiative
 
DMI Workshop: Wikileaks and the Myth of (Data-Driven) Citizen Journalism (wik...
DMI Workshop: Wikileaks and the Myth of (Data-Driven) Citizen Journalism (wik...DMI Workshop: Wikileaks and the Myth of (Data-Driven) Citizen Journalism (wik...
DMI Workshop: Wikileaks and the Myth of (Data-Driven) Citizen Journalism (wik...Digital Methods Initiative
 

More from Digital Methods Initiative (11)

Query Design for Digital Methods by Richard Rogers
Query Design for Digital Methods by Richard RogersQuery Design for Digital Methods by Richard Rogers
Query Design for Digital Methods by Richard Rogers
 
Digital Methods Summer School 2013 Tool Medley
Digital Methods Summer School 2013 Tool MedleyDigital Methods Summer School 2013 Tool Medley
Digital Methods Summer School 2013 Tool Medley
 
Dmi12 workshops - crawling and scraping
Dmi12   workshops - crawling and scrapingDmi12   workshops - crawling and scraping
Dmi12 workshops - crawling and scraping
 
Digital Methods Tool Medley. Digital Methods Summer School 2012
Digital Methods Tool Medley. Digital Methods Summer School 2012Digital Methods Tool Medley. Digital Methods Summer School 2012
Digital Methods Tool Medley. Digital Methods Summer School 2012
 
Digital Methods Winterschool 2012: API - Interfaces to the Cloud
Digital Methods Winterschool 2012: API - Interfaces to the CloudDigital Methods Winterschool 2012: API - Interfaces to the Cloud
Digital Methods Winterschool 2012: API - Interfaces to the Cloud
 
DMI Workshop: When Search Becomes Research
DMI Workshop: When Search Becomes ResearchDMI Workshop: When Search Becomes Research
DMI Workshop: When Search Becomes Research
 
DMI Workshop: Crawling and Scraping
DMI Workshop: Crawling and Scraping DMI Workshop: Crawling and Scraping
DMI Workshop: Crawling and Scraping
 
DMI Workshop: Data visualization. Analytical clouding.
DMI Workshop: Data visualization. Analytical clouding.DMI Workshop: Data visualization. Analytical clouding.
DMI Workshop: Data visualization. Analytical clouding.
 
DMI Workshop: Wikileaks and the Myth of (Data-Driven) Citizen Journalism (wik...
DMI Workshop: Wikileaks and the Myth of (Data-Driven) Citizen Journalism (wik...DMI Workshop: Wikileaks and the Myth of (Data-Driven) Citizen Journalism (wik...
DMI Workshop: Wikileaks and the Myth of (Data-Driven) Citizen Journalism (wik...
 
DMI Workshop. Data visualization: Clouding
DMI Workshop. Data visualization: CloudingDMI Workshop. Data visualization: Clouding
DMI Workshop. Data visualization: Clouding
 
IIPC Dutch Blogosphere
IIPC Dutch BlogosphereIIPC Dutch Blogosphere
IIPC Dutch Blogosphere
 

Recently uploaded

Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...AliaaTarek5
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxBkGupta21
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rick Flair
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsNathaniel Shimoni
 
What is Artificial Intelligence?????????
What is Artificial Intelligence?????????What is Artificial Intelligence?????????
What is Artificial Intelligence?????????blackmambaettijean
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 

Recently uploaded (20)

Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptx
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directions
 
What is Artificial Intelligence?????????
What is Artificial Intelligence?????????What is Artificial Intelligence?????????
What is Artificial Intelligence?????????
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 

Repurposing Wikipedia: Wikipedia as data set and analytical device

  • 1. wikipedia as a data set & analytical device Erik Borra, June 2013
  • 2. DMI research with Wikipedia reference work bureaucracy scandal machine repurposing Wikipedia as a ... vigilant community
  • 3. Medium-specific outlook “follow the medium” (Rogers 2009)
  • 4. The Anatomy of a Wikipedia page
  • 5. DMI research with Wikipedia reference work bureaucracy scandal machine repurposing Wikipedia as a ... vigilant community
  • 6. DMI research with Wikipedia repurposing Wikipedia as a ... reference work <=> cultural reference bureaucracy scandal machine vigilant community
  • 8.
  • 9. Burial of 465 identified Bosniaks, Potočari, 2007. Map of the Srebrenica military operations, made by the U.S. Central Intelligence Agency, with green arrow showing the route of the Bosnian forces. Map of the location of Srebrenica, the Republika Srpska, Bosnia-Herzegovina. Srebrenica-Potočari Memorial and Cemetery, Bosnia-Herzegovina. Grave of a 13-year old Bosniak boy. Ratko Mladic. An exhumed body with blindfold and hands tied behind his back. As of September 2012, the photo has been removed from Wikipedia article. Exhumed grave of victims, 2007. Podrinje Identification Project's facility for storing, processing, and handling exhumed remains.. "UN left 8,000 to die in Bosnia." Headline in The Independent, 30 October 1995. Satellite photo of Nova Kasaba mass grave. International Criminal Tribunal for the Former Yugoslavia, Den Haag, the Netherlands. DUTCH ENGLISH BOSNIAN CROATIAN SERBIAN SERBO- CROATIAN Tool: Wikipedia Cross-Lingual Image Analysis
  • 11. National Point of View Neutral Point of View Linguistic Point of View manypedia.com - comparing linguistic points of view (LPOV) omnipedia.northwestern.edu - making Wikipedia articles in different languages comparable
  • 12.
  • 13.
  • 14. DMI research with Wikipedia repurposing Wikipedia as a ... reference work bureaucracy scandal machine vigilant community <=> socio-technical device
  • 15. Wikipedia has been described in terms of open source intelligence(Stalder and Hirsh, 2002), wisdom of crowds(Surowiecki, 2004; Kittur and Kraut, 2008), many minds(Sunstein, 2006), collaborative knowledge(Poe, 2006, McKenzie Wark 2007), an army of volunteers(Jenkins, 2006), mass collaboration(Tapscott, 2007), distributed collaboration(Shirky, 2008), produsage(Bruns, 2008), crowdsourcing(Economist, 2008), and mentioned in the context of free labour(Deuze, 2006), and the cult of the amateur(Keen, 2007).
  • 16. Tool: Wikipedia Edits Scraper and IP Localizer + screen recording software
  • 17.
  • 18. Govcom.org, 2008. http://govcom.org/ Analysis by Zachary Devereaux, Sabine Niederer, Richard Rogers, Bram Nijhof and Michael Stevenson. © 2008 Wikipedia editing by bots and users: Overall percentage of Bot activity of all edit activity________________________________ 08August Wikipedia editing by bots and users: Overall percentage of Bot activity of all edit activity Visualization: Rosa Menkman. Percentage of bot edits All wikipedia edits
  • 19. Digital Methods Initiative, 2008. www.digitalmethods.net Analysis by Zachary Devereaux, Sabine Niederer, Richard Rogers, Bram Nijhof and Michael Stevenson. © 2008 Human, Bot and Software Assisted Activity on Top Twenty EN Wikipedia Articles_________________________________________ 0August Human, Bot and Software Assisted Activity on Top Twenty EN Wikipedia Articles 46444644 Software Assisted Edits 30673067 Bot Edits 251378251378 Human Edits Maps by: Rosa Menkman, Auke Touwslager and Marieke van Dijk
  • 20. Govcomorg Jubilee, 2008. www.govcom.org Analysis by Zachary Devereaux, Sabine Niederer, Richard Rogers, Bram Nijhof and Michael Stevenson Visualization by Auke Touwslager © 2008 Human, Bot and Software Assisted Activity on Top Twenty EN Wikipedia Articles_________________________________________ 0August Percentage of bot activity Language scaled by Wikipedia size Cornish Võro Manx Ladino Friulian Romansh Aromanian Sanskrit Upper Sorbian Corsican Scottish Gaelic Samogitian Chuvash Aragonese Occitan Breton Endangered Languages Revived Languages Hawaiian Cornish Manx West Frisian Belarusian Basque Galician Estonian Hebrew Czech Catalan Wikipedia Bot Activity in Endangered and Revived Languages What is the level of bot activity per language, looking at endangered and revived languages? stats.wikimedia.org + bubble lines
  • 21. Govcomorg Jubilee, 2008. www.govcom.org Analysis by Zachary Devereaux, Sabine Niederer, Richard Rogers, Bram Nijhof and Michael Stevenson Visualization by Auke Touwslager © 2008 Human, Bot and Software Assisted Activity on Top Twenty EN Wikipedia Articles_________________________________________ 0August Javanese Urdu Tamil Hindi Marathi Bengali Telugu Korean Vietnamese Arabic Chinese Russian Spanish Japanese Italian Portuguese German French English Percentage of bot activity Language scaled by Wikipedia size Wikipedia Bot Activity in Most-Used Languages Worldwide What is the level of bot activity per language, looking at top 20 of most-used languages? stats.wikimedia.org + Dorling
  • 22. DMI research with Wikipedia repurposing Wikipedia as a ... reference work bureaucracy scandal machine <=> place of edits vigilant community
  • 23.
  • 24.
  • 25.
  • 26.
  • 27.
  • 28.
  • 29. DMI research with Wikipedia repurposing Wikipedia as a ... reference work bureaucracy <=> controversy diagnostics machine scandal machine vigilant community
  • 30. Administrative apparatus: Wikipedia’s procedural policies, guidelines and essays ... with the purpose of reaching consensus
  • 32. Wikipedia collaboration and conflict studies: Cooperation and conflict (Viegas, 2004) Conflict and coordination (Kittur, 2007) Revert Graph (Suh et al, 2007) Mutual reverts (Brandes, 2007) Argument identification (Rad & Barosa, 2011) Edit wars (Sumi, 2011) Evolution of discussions (Kaltenbrunner & Laniado, 2012)
  • 33. medium-specific approach: repurpose how consensus principles (as method of the medium) act on the objects of the medium
  • 34.
  • 35.
  • 36.
  • 38.
  • 39.
  • 40.
  • 41.
  • 42.
  • 43.
  • 44.
  • 45.
  • 47. DMI research with Wikipedia reference work <=> cultural reference bureaucracy <=> controversy diagnostics machine scandal machine <=> places of edits repurposing Wikipedia as a ... vigilant community <=> socio-technical system
  • 48. DMI projects mentioned R. Rogers, Digital Methods , Cambridge, MA: MIT Press, 2013. Chapter 5. https://wiki.digitalmethods.net/Dmi/DebottingWikipedia https://www.digitalmethods.net/Digitalmethods/TheNetworkedContent Contropedia, contropedia.net S. Niederer and J. van Dijck, " Wisdom of the Crowd or Technicity of Content? Wikipedia as Socio-technical System," New Media & Society, 12, 8, 2010, 1368-1387.
  • 50. Tools native to Wikipedia Page information Search revision history Contributors per article, ordered by number of edits Page view statistics stats.wikimedia.org - Data is published in exportable, computable format, eg csv. Stats look at all wikis hosted by the foundation, largest are the Wikipedias. User edits searches, for all the edits of a specific user Number of watchers
  • 51. Other useful tools manypedia.com - puts two language versions of an article side by side / computes concept similarity omnipedia.northwestern.edu - making Wikipedia articles in different languages comparable wikiscanner (defunct) history flow (defunct) http://vs.aka-online.de/cgi-bin/wppagehiststat.pl builds an edit history overview page for the article http://sonetlab.fbk.eu/wikitrip to see geographical statistics for anonymous edits and gender
  • 52. Academic literature on Wikipedia http://en.wikipedia.org/wiki/Wikipedia:Academic_studies_of_Wikipedia Okoli et al. (2012). The people's encyclopedia under the gaze of the sages: A systematic review of scholarly research on Wikipedia