SlideShare a Scribd company logo
1 of 20
Download to read offline
place graphs
        are the new
        social graphs
       Matt Biddulph
       @mattb | matt@hackdiary.com

Every data scientist has their own favourite way of representing their data. For some people
it’s Excel, and they think in rows and columns. For others it’s matrices, and they use linear
algreba to interrogate their data. For me, it’s graphs.
We’re all pretty used to the idea that you can model human relationships in a social graph.
“Social network analysis
        views social relationships in
        terms of network theory
        consisting of nodes and ties.
        Nodes are the individual actors
        within the networks, and ties
        are the relationships between
        the actors.”

There’s a pretty deep area of mathematical study called Social Network Analysis that goes
back at least 20 years. It tries to create insight by analysing the structure of social networks,
and usually doesn’t incorporate any elements of culture or sociology in doing so.
Centrality
                                                               measures




It led to the creation of techniques like centrality measures, that try to find the nodes that are
most central to the network. These might be the kind of people on Twitter who have the
highest chance of being retweeted.
Community
                                                              detection




There are also community detection algorithms that try to find the most tightly-knit
subgraphs and cluster those nodes together. If you ran this over the network of people I
follow on Twitter, it might be able to pick out my work colleagues or the people I socialise
with face-to-face.
People you
                                                            may know




Sites like LinkedIn build almost-telepathic “people you may know” features by walking around
the graph starting at your node and looking for people that show up a lot in your
neighbourhood that you haven’t connected with yet.
But enough mathematics. Let’s talk about Belgium.
Belgium is a country in the northwest of Europe with some unusual cultural qualities. It’s
sandwiched between the Netherlands and France. About half of the country speaks French,
and the other half speaks Dutch. It’d be very interesting to study the patterns of interactions
in this country.
Researchers at Louvain in Belgium were lucky enough to do a joint project with a Belgian
mobile phone company. They had access to anonymised records of 2.6 million phone calls -
the record of which phone called which number when.

http://arxiv.org/pdf/0802.2178v2
Belgian
                                                                        phonecall
                                                                        network




Fast unfolding of communities in large networks, Blondel et al [2008]

They used these calls to construct a “call graph”. They were able to develop a community-
detection algorithm that could detect the two separate clusters of Dutch and French speakers
that were mostly only calling each other. The algorithm achieved this simply by analysing the
shape of the graph. It knew nothing about French, Dutch or phone calls.

http://arxiv.org/pdf/0803.0476
So let’s take a step back and think about what other kinds of graph we could form, from what
kinds of data.
I work in location apps at Nokia, and so I naturally think of places. Wouldn’t it be interesting
to study the connections between cities instead of people? For example, people probably fly
more often between NYC and LA than they do between NYC and New Jersey. We could re-
draw the map based on closeness in the travel network.
I turned to the Hadoop cluster at Nokia and took a sample of several weeks of logs from our
routing servers. These are used every time someone uses our maps application to request a
driving route from one place to another. Every time someone drove from A to B, I made an
edge in a “place graph” from A to B.
I ran the data through Gephi and asked it to cluster it based on the strength of connections
between towns. The result is a not-quite-geographic new map of the world, where two cities
are close to each other if people often drive between them.
UK

                                                            China
                                                               Korea,
                                                             Japan, etc



                Spain                           Most of Europe




                                                                             India
                                                                             Pakistan
             Finland                     Russia

As you’d expect, the UK is an island and so people don’t drive in and out of it very often.
Spain and Portugal are not islands, but they appear separate because they’re attached to the
rest of Europe by a very narrow neck of land. So people are much more likely to fly than drive
out of Spain.
How could we use this data in a practical application? Say I’m coming to New York to attend a
conference on big data. I could choose a hotel near the conference venue, but I’d rather see
more interesting parts of New York.
Where should
                                                         I stay?




If I’ve never been to New York before, I could ask a friend. I could tell them that I like
London’s West End and San Francisco’s downtown.
Times Square = Piccadilly Circus
          New York                London
If they know both towns, they’d probably tell me that Times Square is the Piccadilly Circus of
New York.
What is the Greenwich Village
       of Tokyo?
       ... the Noe Valley of New York?
       ... the Shibuya of Los Angeles?



But if we delve into the place graph, we could answer much more interesting questions, and
create a “neighbourhood isomorphism” from city to city. People who like the Mission in SF
and Shoreditch in London could find out that Williamsberg is probably the best place for
them to stay in New York.
Thank you.
Matt Biddulph
@mattb | matt@hackdiary.com

More Related Content

What's hot

Avoiding the Heron's Way
Avoiding the Heron's WayAvoiding the Heron's Way
Avoiding the Heron's WayDorothea Salo
 
Webstock 2013: An Animating Spark
Webstock 2013: An Animating SparkWebstock 2013: An Animating Spark
Webstock 2013: An Animating SparkTom Coates
 
Property-based Access of RDF Data
Property-based Access of RDF DataProperty-based Access of RDF Data
Property-based Access of RDF DataGerd Groener
 
Data Journalism Workshop - Prague
Data Journalism Workshop - PragueData Journalism Workshop - Prague
Data Journalism Workshop - PragueChristian Kreutz
 
Isle of Man open data overview
Isle of Man open data overviewIsle of Man open data overview
Isle of Man open data overviewChris Taggart
 
Social Networks and the Semantic Web: a retrospective of the past 10 years
Social Networks and the Semantic Web: a retrospective of the past 10 yearsSocial Networks and the Semantic Web: a retrospective of the past 10 years
Social Networks and the Semantic Web: a retrospective of the past 10 yearsPeter Mika
 
Visualizing Networks
Visualizing NetworksVisualizing Networks
Visualizing Networksfreshdatabos
 
Lecture 7: How to STUDY the Social Web? (2014)
Lecture 7: How to STUDY the Social Web? (2014)Lecture 7: How to STUDY the Social Web? (2014)
Lecture 7: How to STUDY the Social Web? (2014)Lora Aroyo
 
BISG WEBCAST -- Between Hype and Hope
BISG WEBCAST -- Between Hype and HopeBISG WEBCAST -- Between Hype and Hope
BISG WEBCAST -- Between Hype and Hopebisg
 
The Social Semantic Web
The Social Semantic WebThe Social Semantic Web
The Social Semantic WebJohn Breslin
 
Human-machine Inter-agencies
Human-machine Inter-agenciesHuman-machine Inter-agencies
Human-machine Inter-agenciesmo-seph
 

What's hot (14)

Avoiding the Heron's Way
Avoiding the Heron's WayAvoiding the Heron's Way
Avoiding the Heron's Way
 
Webstock 2013: An Animating Spark
Webstock 2013: An Animating SparkWebstock 2013: An Animating Spark
Webstock 2013: An Animating Spark
 
Property-based Access of RDF Data
Property-based Access of RDF DataProperty-based Access of RDF Data
Property-based Access of RDF Data
 
Data Journalism Workshop - Prague
Data Journalism Workshop - PragueData Journalism Workshop - Prague
Data Journalism Workshop - Prague
 
Urban lenses
Urban lensesUrban lenses
Urban lenses
 
Cyborgs
CyborgsCyborgs
Cyborgs
 
Isle of Man open data overview
Isle of Man open data overviewIsle of Man open data overview
Isle of Man open data overview
 
Social Networks and the Semantic Web: a retrospective of the past 10 years
Social Networks and the Semantic Web: a retrospective of the past 10 yearsSocial Networks and the Semantic Web: a retrospective of the past 10 years
Social Networks and the Semantic Web: a retrospective of the past 10 years
 
Visualizing Networks
Visualizing NetworksVisualizing Networks
Visualizing Networks
 
Lecture 7: How to STUDY the Social Web? (2014)
Lecture 7: How to STUDY the Social Web? (2014)Lecture 7: How to STUDY the Social Web? (2014)
Lecture 7: How to STUDY the Social Web? (2014)
 
2014_WWW_BTOR
2014_WWW_BTOR2014_WWW_BTOR
2014_WWW_BTOR
 
BISG WEBCAST -- Between Hype and Hope
BISG WEBCAST -- Between Hype and HopeBISG WEBCAST -- Between Hype and Hope
BISG WEBCAST -- Between Hype and Hope
 
The Social Semantic Web
The Social Semantic WebThe Social Semantic Web
The Social Semantic Web
 
Human-machine Inter-agencies
Human-machine Inter-agenciesHuman-machine Inter-agencies
Human-machine Inter-agencies
 

Viewers also liked

How To Write Your First Firefox Extension
How To Write Your First Firefox ExtensionHow To Write Your First Firefox Extension
How To Write Your First Firefox ExtensionRobert Nyman
 
iPhone Coding For Web Developers
iPhone Coding For Web DevelopersiPhone Coding For Web Developers
iPhone Coding For Web DevelopersMatt Biddulph
 
Latin America & Caribbean Regional Outlook June 2013
Latin America & Caribbean Regional Outlook June 2013Latin America & Caribbean Regional Outlook June 2013
Latin America & Caribbean Regional Outlook June 2013WB_Research
 
South Asia Regional Outlook June 2013
South Asia Regional Outlook  June 2013South Asia Regional Outlook  June 2013
South Asia Regional Outlook June 2013WB_Research
 
How to do presentations that don't induce suicide
How to do presentations that don't induce suicideHow to do presentations that don't induce suicide
How to do presentations that don't induce suicideAndy Whitlock
 
Science of Delivery in Education
Science of Delivery in EducationScience of Delivery in Education
Science of Delivery in EducationWB_Research
 
Middle East & North Africa Regional Outlook June 2013
Middle East & North Africa Regional Outlook June 2013Middle East & North Africa Regional Outlook June 2013
Middle East & North Africa Regional Outlook June 2013WB_Research
 
Gep june 2013 regional outlook east asia & pacific
Gep june 2013 regional outlook east asia & pacificGep june 2013 regional outlook east asia & pacific
Gep june 2013 regional outlook east asia & pacificWB_Research
 
Rural-Urban Dynamics and the MDGs
Rural-Urban Dynamics and the MDGsRural-Urban Dynamics and the MDGs
Rural-Urban Dynamics and the MDGsWB_Research
 
Dagstuhl FOAF history talk
Dagstuhl FOAF history talkDagstuhl FOAF history talk
Dagstuhl FOAF history talkDan Brickley
 
Global Economic Prospects Jan 2013
Global Economic Prospects Jan 2013Global Economic Prospects Jan 2013
Global Economic Prospects Jan 2013WB_Research
 

Viewers also liked (20)

How To Write Your First Firefox Extension
How To Write Your First Firefox ExtensionHow To Write Your First Firefox Extension
How To Write Your First Firefox Extension
 
iPhone Coding For Web Developers
iPhone Coding For Web DevelopersiPhone Coding For Web Developers
iPhone Coding For Web Developers
 
Latin America & Caribbean Regional Outlook June 2013
Latin America & Caribbean Regional Outlook June 2013Latin America & Caribbean Regional Outlook June 2013
Latin America & Caribbean Regional Outlook June 2013
 
South Asia Regional Outlook June 2013
South Asia Regional Outlook  June 2013South Asia Regional Outlook  June 2013
South Asia Regional Outlook June 2013
 
How to do presentations that don't induce suicide
How to do presentations that don't induce suicideHow to do presentations that don't induce suicide
How to do presentations that don't induce suicide
 
Quality ed (1)
Quality ed (1)Quality ed (1)
Quality ed (1)
 
Secondary ed
Secondary edSecondary ed
Secondary ed
 
Tertiary ed
Tertiary edTertiary ed
Tertiary ed
 
Literacy ed
Literacy edLiteracy ed
Literacy ed
 
Science of Delivery in Education
Science of Delivery in EducationScience of Delivery in Education
Science of Delivery in Education
 
Middle East & North Africa Regional Outlook June 2013
Middle East & North Africa Regional Outlook June 2013Middle East & North Africa Regional Outlook June 2013
Middle East & North Africa Regional Outlook June 2013
 
Gep june 2013 regional outlook east asia & pacific
Gep june 2013 regional outlook east asia & pacificGep june 2013 regional outlook east asia & pacific
Gep june 2013 regional outlook east asia & pacific
 
Equity ed
Equity edEquity ed
Equity ed
 
Primary ed
Primary edPrimary ed
Primary ed
 
Gender ed
Gender edGender ed
Gender ed
 
Rural-Urban Dynamics and the MDGs
Rural-Urban Dynamics and the MDGsRural-Urban Dynamics and the MDGs
Rural-Urban Dynamics and the MDGs
 
Pre primaryed
Pre primaryedPre primaryed
Pre primaryed
 
Dagstuhl FOAF history talk
Dagstuhl FOAF history talkDagstuhl FOAF history talk
Dagstuhl FOAF history talk
 
Has Off-Grid Electrification Come of Age?
Has Off-Grid Electrification Come of Age? Has Off-Grid Electrification Come of Age?
Has Off-Grid Electrification Come of Age?
 
Global Economic Prospects Jan 2013
Global Economic Prospects Jan 2013Global Economic Prospects Jan 2013
Global Economic Prospects Jan 2013
 

Similar to Place graphs are the new social graphs

Cognitive Cities: City analytics
Cognitive Cities: City analyticsCognitive Cities: City analytics
Cognitive Cities: City analyticsMatt Biddulph
 
Network Mapping & Data Storytelling for Beginners
Network Mapping & Data Storytelling for BeginnersNetwork Mapping & Data Storytelling for Beginners
Network Mapping & Data Storytelling for BeginnersRenaud Clément
 
Harvesting collective intelligence.
Harvesting collective intelligence. Harvesting collective intelligence.
Harvesting collective intelligence. Alberto Cottica
 
Critical Network Mapping, Burak Arikan talk at Eyeo2014, Minneapolis
Critical Network Mapping, Burak Arikan talk at Eyeo2014, MinneapolisCritical Network Mapping, Burak Arikan talk at Eyeo2014, Minneapolis
Critical Network Mapping, Burak Arikan talk at Eyeo2014, MinneapolisBurak Arikan
 
Franck Rebillard, Professeur Université Paris 3
Franck Rebillard, Professeur Université Paris 3Franck Rebillard, Professeur Université Paris 3
Franck Rebillard, Professeur Université Paris 3SMCFrance
 
Searching for patterns in crowdsourced information
Searching for patterns in crowdsourced informationSearching for patterns in crowdsourced information
Searching for patterns in crowdsourced informationSilvia Puglisi
 
Social Network Analysis Introduction including Data Structure Graph overview.
Social Network Analysis Introduction including Data Structure Graph overview. Social Network Analysis Introduction including Data Structure Graph overview.
Social Network Analysis Introduction including Data Structure Graph overview. Doug Needham
 
Placebooks
PlacebooksPlacebooks
PlacebooksTom Pert
 
Forty Years of the OTA
Forty Years of the OTAForty Years of the OTA
Forty Years of the OTAMartin Wynne
 
Everyday digital scholarship: Using web-based tools for research
Everyday digital scholarship: Using web-based tools for researchEveryday digital scholarship: Using web-based tools for research
Everyday digital scholarship: Using web-based tools for researchFrancesca Di Donato
 
10 Jahre Web Science
10 Jahre Web Science10 Jahre Web Science
10 Jahre Web ScienceSteffen Staab
 
ViziCities - Lessons Learnt Visualising Real-world Cities in 3D
ViziCities - Lessons Learnt Visualising Real-world Cities in 3DViziCities - Lessons Learnt Visualising Real-world Cities in 3D
ViziCities - Lessons Learnt Visualising Real-world Cities in 3DRobin Hawkes
 
杭州讲座 石田英敬
杭州讲座 石田英敬杭州讲座 石田英敬
杭州讲座 石田英敬luruiyang
 
Frontiers of Computational Journalism week 8 - Visualization and Network Anal...
Frontiers of Computational Journalism week 8 - Visualization and Network Anal...Frontiers of Computational Journalism week 8 - Visualization and Network Anal...
Frontiers of Computational Journalism week 8 - Visualization and Network Anal...Jonathan Stray
 
The Next Wave of AR: Mobile Social Interaction Right Here, Right Now!
The Next Wave of AR: Mobile Social Interaction Right Here, Right Now!The Next Wave of AR: Mobile Social Interaction Right Here, Right Now!
The Next Wave of AR: Mobile Social Interaction Right Here, Right Now!Tish Shute
 
Introduction to Computational Social Science - Lecture 1
Introduction to Computational Social Science - Lecture 1Introduction to Computational Social Science - Lecture 1
Introduction to Computational Social Science - Lecture 1Lauri Eloranta
 
2010-November-8-NIA - Smart Society and Civic Culture - Marc Smith
2010-November-8-NIA - Smart Society and Civic Culture - Marc Smith2010-November-8-NIA - Smart Society and Civic Culture - Marc Smith
2010-November-8-NIA - Smart Society and Civic Culture - Marc SmithMarc Smith
 

Similar to Place graphs are the new social graphs (20)

Cognitive Cities: City analytics
Cognitive Cities: City analyticsCognitive Cities: City analytics
Cognitive Cities: City analytics
 
Network Mapping & Data Storytelling for Beginners
Network Mapping & Data Storytelling for BeginnersNetwork Mapping & Data Storytelling for Beginners
Network Mapping & Data Storytelling for Beginners
 
Harvesting collective intelligence.
Harvesting collective intelligence. Harvesting collective intelligence.
Harvesting collective intelligence.
 
Critical Network Mapping, Burak Arikan talk at Eyeo2014, Minneapolis
Critical Network Mapping, Burak Arikan talk at Eyeo2014, MinneapolisCritical Network Mapping, Burak Arikan talk at Eyeo2014, Minneapolis
Critical Network Mapping, Burak Arikan talk at Eyeo2014, Minneapolis
 
AHRC CDP Digital Humanities 101
AHRC CDP Digital Humanities 101  AHRC CDP Digital Humanities 101
AHRC CDP Digital Humanities 101
 
Franck Rebillard, Professeur Université Paris 3
Franck Rebillard, Professeur Université Paris 3Franck Rebillard, Professeur Université Paris 3
Franck Rebillard, Professeur Université Paris 3
 
Searching for patterns in crowdsourced information
Searching for patterns in crowdsourced informationSearching for patterns in crowdsourced information
Searching for patterns in crowdsourced information
 
Oulu2
Oulu2Oulu2
Oulu2
 
Social Network Analysis Introduction including Data Structure Graph overview.
Social Network Analysis Introduction including Data Structure Graph overview. Social Network Analysis Introduction including Data Structure Graph overview.
Social Network Analysis Introduction including Data Structure Graph overview.
 
Placebooks
PlacebooksPlacebooks
Placebooks
 
Forty Years of the OTA
Forty Years of the OTAForty Years of the OTA
Forty Years of the OTA
 
Everyday digital scholarship: Using web-based tools for research
Everyday digital scholarship: Using web-based tools for researchEveryday digital scholarship: Using web-based tools for research
Everyday digital scholarship: Using web-based tools for research
 
10 Jahre Web Science
10 Jahre Web Science10 Jahre Web Science
10 Jahre Web Science
 
ViziCities - Lessons Learnt Visualising Real-world Cities in 3D
ViziCities - Lessons Learnt Visualising Real-world Cities in 3DViziCities - Lessons Learnt Visualising Real-world Cities in 3D
ViziCities - Lessons Learnt Visualising Real-world Cities in 3D
 
杭州讲座 石田英敬
杭州讲座 石田英敬杭州讲座 石田英敬
杭州讲座 石田英敬
 
Frontiers of Computational Journalism week 8 - Visualization and Network Anal...
Frontiers of Computational Journalism week 8 - Visualization and Network Anal...Frontiers of Computational Journalism week 8 - Visualization and Network Anal...
Frontiers of Computational Journalism week 8 - Visualization and Network Anal...
 
The Next Wave of AR: Mobile Social Interaction Right Here, Right Now!
The Next Wave of AR: Mobile Social Interaction Right Here, Right Now!The Next Wave of AR: Mobile Social Interaction Right Here, Right Now!
The Next Wave of AR: Mobile Social Interaction Right Here, Right Now!
 
Introduction to Computational Social Science - Lecture 1
Introduction to Computational Social Science - Lecture 1Introduction to Computational Social Science - Lecture 1
Introduction to Computational Social Science - Lecture 1
 
2010-November-8-NIA - Smart Society and Civic Culture - Marc Smith
2010-November-8-NIA - Smart Society and Civic Culture - Marc Smith2010-November-8-NIA - Smart Society and Civic Culture - Marc Smith
2010-November-8-NIA - Smart Society and Civic Culture - Marc Smith
 
Dh presentation 2018
Dh presentation 2018Dh presentation 2018
Dh presentation 2018
 

More from Matt Biddulph

The IoT Conversation
The IoT ConversationThe IoT Conversation
The IoT ConversationMatt Biddulph
 
EAN’s World of Data: Prototyping apps using real data
EAN’s World of Data: Prototyping apps using real dataEAN’s World of Data: Prototyping apps using real data
EAN’s World of Data: Prototyping apps using real dataMatt Biddulph
 
Monkigras 2012: Networks Of Data
Monkigras 2012: Networks Of DataMonkigras 2012: Networks Of Data
Monkigras 2012: Networks Of DataMatt Biddulph
 
Science Hackday: using visualisation to understand your data
Science Hackday: using visualisation to understand your dataScience Hackday: using visualisation to understand your data
Science Hackday: using visualisation to understand your dataMatt Biddulph
 
Prototyping with data at Nokia
Prototyping with data at NokiaPrototyping with data at Nokia
Prototyping with data at NokiaMatt Biddulph
 
Mobile Social Location (Web Directions @media version)
Mobile Social Location (Web Directions @media version)Mobile Social Location (Web Directions @media version)
Mobile Social Location (Web Directions @media version)Matt Biddulph
 
Tinkering with game controllers
Tinkering with game controllersTinkering with game controllers
Tinkering with game controllersMatt Biddulph
 
SXSW 2008: Creative Collaboration
SXSW 2008: Creative CollaborationSXSW 2008: Creative Collaboration
SXSW 2008: Creative CollaborationMatt Biddulph
 
Coding on the Shoulders of Giants
Coding on the Shoulders of GiantsCoding on the Shoulders of Giants
Coding on the Shoulders of GiantsMatt Biddulph
 
Connecting First And Second Life
Connecting First And Second LifeConnecting First And Second Life
Connecting First And Second LifeMatt Biddulph
 
Coders need to learn hardware hacking NOW
Coders need to learn hardware hacking NOWCoders need to learn hardware hacking NOW
Coders need to learn hardware hacking NOWMatt Biddulph
 

More from Matt Biddulph (12)

The IoT Conversation
The IoT ConversationThe IoT Conversation
The IoT Conversation
 
EAN’s World of Data: Prototyping apps using real data
EAN’s World of Data: Prototyping apps using real dataEAN’s World of Data: Prototyping apps using real data
EAN’s World of Data: Prototyping apps using real data
 
Monkigras 2012: Networks Of Data
Monkigras 2012: Networks Of DataMonkigras 2012: Networks Of Data
Monkigras 2012: Networks Of Data
 
Science Hackday: using visualisation to understand your data
Science Hackday: using visualisation to understand your dataScience Hackday: using visualisation to understand your data
Science Hackday: using visualisation to understand your data
 
Prototyping with data at Nokia
Prototyping with data at NokiaPrototyping with data at Nokia
Prototyping with data at Nokia
 
Mobile Social Location (Web Directions @media version)
Mobile Social Location (Web Directions @media version)Mobile Social Location (Web Directions @media version)
Mobile Social Location (Web Directions @media version)
 
Tinkering with game controllers
Tinkering with game controllersTinkering with game controllers
Tinkering with game controllers
 
The Realtime Web
The Realtime WebThe Realtime Web
The Realtime Web
 
SXSW 2008: Creative Collaboration
SXSW 2008: Creative CollaborationSXSW 2008: Creative Collaboration
SXSW 2008: Creative Collaboration
 
Coding on the Shoulders of Giants
Coding on the Shoulders of GiantsCoding on the Shoulders of Giants
Coding on the Shoulders of Giants
 
Connecting First And Second Life
Connecting First And Second LifeConnecting First And Second Life
Connecting First And Second Life
 
Coders need to learn hardware hacking NOW
Coders need to learn hardware hacking NOWCoders need to learn hardware hacking NOW
Coders need to learn hardware hacking NOW
 

Recently uploaded

UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPathCommunity
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...itnewsafrica
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality AssuranceInflectra
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsNathaniel Shimoni
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Alkin Tezuysal
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfpanagenda
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...Wes McKinney
 
QCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesQCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesBernd Ruecker
 
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical InfrastructureVarsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructureitnewsafrica
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS:  6 Ways to Automate Your Data IntegrationBridging Between CAD & GIS:  6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integrationmarketing932765
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Hiroshi SHIBATA
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Farhan Tariq
 
React Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App FrameworkReact Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App FrameworkPixlogix Infotech
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesThousandEyes
 

Recently uploaded (20)

UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to Hero
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directions
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
 
QCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesQCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architectures
 
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical InfrastructureVarsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS:  6 Ways to Automate Your Data IntegrationBridging Between CAD & GIS:  6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...
 
React Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App FrameworkReact Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App Framework
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
 

Place graphs are the new social graphs

  • 1. place graphs are the new social graphs Matt Biddulph @mattb | matt@hackdiary.com Every data scientist has their own favourite way of representing their data. For some people it’s Excel, and they think in rows and columns. For others it’s matrices, and they use linear algreba to interrogate their data. For me, it’s graphs.
  • 2. We’re all pretty used to the idea that you can model human relationships in a social graph.
  • 3. “Social network analysis views social relationships in terms of network theory consisting of nodes and ties. Nodes are the individual actors within the networks, and ties are the relationships between the actors.” There’s a pretty deep area of mathematical study called Social Network Analysis that goes back at least 20 years. It tries to create insight by analysing the structure of social networks, and usually doesn’t incorporate any elements of culture or sociology in doing so.
  • 4. Centrality measures It led to the creation of techniques like centrality measures, that try to find the nodes that are most central to the network. These might be the kind of people on Twitter who have the highest chance of being retweeted.
  • 5. Community detection There are also community detection algorithms that try to find the most tightly-knit subgraphs and cluster those nodes together. If you ran this over the network of people I follow on Twitter, it might be able to pick out my work colleagues or the people I socialise with face-to-face.
  • 6. People you may know Sites like LinkedIn build almost-telepathic “people you may know” features by walking around the graph starting at your node and looking for people that show up a lot in your neighbourhood that you haven’t connected with yet.
  • 7. But enough mathematics. Let’s talk about Belgium.
  • 8. Belgium is a country in the northwest of Europe with some unusual cultural qualities. It’s sandwiched between the Netherlands and France. About half of the country speaks French, and the other half speaks Dutch. It’d be very interesting to study the patterns of interactions in this country.
  • 9. Researchers at Louvain in Belgium were lucky enough to do a joint project with a Belgian mobile phone company. They had access to anonymised records of 2.6 million phone calls - the record of which phone called which number when. http://arxiv.org/pdf/0802.2178v2
  • 10. Belgian phonecall network Fast unfolding of communities in large networks, Blondel et al [2008] They used these calls to construct a “call graph”. They were able to develop a community- detection algorithm that could detect the two separate clusters of Dutch and French speakers that were mostly only calling each other. The algorithm achieved this simply by analysing the shape of the graph. It knew nothing about French, Dutch or phone calls. http://arxiv.org/pdf/0803.0476
  • 11. So let’s take a step back and think about what other kinds of graph we could form, from what kinds of data.
  • 12. I work in location apps at Nokia, and so I naturally think of places. Wouldn’t it be interesting to study the connections between cities instead of people? For example, people probably fly more often between NYC and LA than they do between NYC and New Jersey. We could re- draw the map based on closeness in the travel network.
  • 13. I turned to the Hadoop cluster at Nokia and took a sample of several weeks of logs from our routing servers. These are used every time someone uses our maps application to request a driving route from one place to another. Every time someone drove from A to B, I made an edge in a “place graph” from A to B.
  • 14. I ran the data through Gephi and asked it to cluster it based on the strength of connections between towns. The result is a not-quite-geographic new map of the world, where two cities are close to each other if people often drive between them.
  • 15. UK China Korea, Japan, etc Spain Most of Europe India Pakistan Finland Russia As you’d expect, the UK is an island and so people don’t drive in and out of it very often. Spain and Portugal are not islands, but they appear separate because they’re attached to the rest of Europe by a very narrow neck of land. So people are much more likely to fly than drive out of Spain.
  • 16. How could we use this data in a practical application? Say I’m coming to New York to attend a conference on big data. I could choose a hotel near the conference venue, but I’d rather see more interesting parts of New York.
  • 17. Where should I stay? If I’ve never been to New York before, I could ask a friend. I could tell them that I like London’s West End and San Francisco’s downtown.
  • 18. Times Square = Piccadilly Circus New York London If they know both towns, they’d probably tell me that Times Square is the Piccadilly Circus of New York.
  • 19. What is the Greenwich Village of Tokyo? ... the Noe Valley of New York? ... the Shibuya of Los Angeles? But if we delve into the place graph, we could answer much more interesting questions, and create a “neighbourhood isomorphism” from city to city. People who like the Mission in SF and Shoreditch in London could find out that Williamsberg is probably the best place for them to stay in New York.
  • 20. Thank you. Matt Biddulph @mattb | matt@hackdiary.com