SlideShare a Scribd company logo
1 of 57
Download to read offline
Uncovering political connections of firms using
machine learning methods
BURN meetup, 9th February 2016
» János Divényi @janosdivenyi « » Jenő Pál @paljenczy «
CEU Microdata
Ádám SzeidlMiklós Koren
CEU Microdata
Ádám SzeidlMiklós Koren
Political connections
and favoritism in
Hungary
Political connections matter
Political connections matter
Guess the color: left or right
Altus Zrt. Fittelina Kft. Mahír Zrt. Közgép Zrt.
Guess the color: left or right
Altus Zrt. Fittelina Kft. Mahír Zrt. Közgép Zrt.
Guess the color: left or right
Altus Zrt. Fittelina Kft. Mahír Zrt. Közgép Zrt.
Guess the color: left or right
How to automate the process
for each firm?
Framework
Information Decision rule
Information
Firm register Election data
Information
Firm register Election data
Information
Firm register Election data
~5M rows ~350k rows
Information
Firm register Election data
~5M rows ~350k rows
data.table
Decision rule
~1B rows ~1B rows
The firm is right
if there are
more right than left politicians
in the firm
Guess the color: left or right
Altus Zrt. Fittelina Kft.Mahír Zrt. Közgép Zrt.
Guess the color: left or right
Altus Zrt. Fittelina Kft. Mahír Zrt. Közgép Zrt.
firm1 firm6 firm11 firm16 firm21 firm26 firm31 firm36 firm41 firm46
firm2 firm7 firm12 firm17 firm22 firm27 firm32 firm37 firm42 firm47
firm3 firm8 firm13 firm18 firm23 firm28 firm33 firm38 firm43 firm48
firm4 firm9 firm14 firm19 firm24 firm29 firm34 firm39 firm44 firm49
firm5 firm10 firm15 firm20 firm25 firm30 firm35 firm40 firm45 firm50
Guess the color: left or right
firm1 firm6 firm11 firm16 firm21 firm26 firm31 firm36 firm41 firm46
firm2 firm7 firm12 firm17 firm22 firm27 firm32 firm37 firm42 firm47
firm3 firm8 firm13 firm18 firm23 firm28 firm33 firm38 firm43 firm48
firm4 firm9 firm14 firm19 firm24 firm29 firm34 firm39 firm44 firm49
firm5 firm10 firm15 firm20 firm25 firm30 firm35 firm40 firm45 firm50
Guess the color: left or right
firm1 firm6 firm11 firm16 firm21 firm26 firm31 firms36 firm41 firm46
firm2 firm7 firm12 firm17 firm22 firm27 firm32 firm37 firm42 firm47
firm3 firm8 firm13 firm18 firm23 firm28 firm33 firm38 firm43 firm48
firm4 firm9 firm14 firm19 firm24 firm29 firm34 firm39 firm44 firm49
firm5 firm10 firm15 firm20 firm25 firm30 firm35 firm40 firm45 firm50
Guess the color: left or right
Ferenc Gyurcsány
PM of left coalition
2004-2009
Ferenc Gyurcsány
PM of left coalition
2004-2009
Ferenc Gyurcsán
local representative at Nyíregyháza
1998
Framework
Information Decision rule
Improve data
What is the chance that
firm person & politician
is the same?
Improve data
What is the chance that
firm person & politician
is the same?
Probabilistic
coloring
Improve data
What is the chance that
firm person & politician
is the same?
69% left 31% other
Probabilistic
coloring
Decision rule
~1B rows ~1B rows
The firm is right
if the average right probability
is larger
than the average left probability
Guess the color: left or right
Altus Zrt. Fittelina Kft. Mahír Zrt. Közgép Zrt.
Guess the color: left or right
Altus Zrt. Fittelina Kft. Mahír Zrt. Közgép Zrt.
firm1 firm6 firm11 firm16 firm21 firm26 firm31 firm36 firm41 firm46
firm2 firm7 firm12 firm17 firm22 firm27 firm32 firm37 firm42 firm47
firm3 firm8 firm13 firm18 firm23 firm28 firm33 firm38 firm43 firm48
firm4 firm9 firm14 firm19 firm24 firm29 firm34 firm39 firm44 firm49
firm5 firm10 firm15 firm20 firm25 firm30 firm35 firm40 firm45 firm50
Guess the color: left or right
Framework
Information Decision rule
Improve information
Improve information
Improve information
Links: common ownership or location
Improve information
Oligarchopedia
Improve information
Improve information
Improve information
Improve information
igraph
Improve decision rule
use machine learning
instead of ad hoc algorithms
Improve decision rule
use machine learning
instead of ad hoc algorithms
need training data
Improve decision rule
Improve decision rule
Improve decision rule
one interface to many algorithms
streamlines the process of machine learning
parallel computation with reproducibility
Improve decision rule
caret
classification and
regression training
one interface to many algorithms
streamlines the process of machine learning
parallel computation with reproducibility
Improve decision rule
caret
classification and
regression training
doParallel
The train function
Parallel computation
Seeds for parallel stochastic models
firm1 firm6 firm11 firm16 firm21 firm26 firm31 firm36 firm41 firm46
firm2 firm7 firm12 firm17 firm22 firm27 firm32 firm37 firm42 firm47
firm3 firm8 firm13 firm18 firm23 firm28 firm33 firm38 firm43 firm48
firm4 firm9 firm14 firm19 firm24 firm29 firm34 firm39 firm44 firm49
firm5 firm10 firm15 firm20 firm25 firm30 firm35 firm40 firm45 firm50
Guess the color: left or right
iterative process
involving manipulation, visualization, modelling, etc
Takeaways
iterative process
involving manipulation, visualization, modelling, etc
data.table
Takeaways
igraph
ggplot2
caret
ROCR
doParallel
Miklós Koren, Ádám Szeidl, Márta Bisztray, Anna Csonka,
Krisztián Fekete, Attila Gáspár, Dániel Molnár, Gábor Nyéki,
Krisztina Orbán, Rita Pető, Balázs Reizer, Mátyás Steiner,
Bálint Szilágyi, Ferenc Szűcs, András Vereckei, Zsófia
Kőműves, Olivér Kiss, Dániel Pass, Dávid Popper and others...
Thanks for the attention

More Related Content

More from János Divényi

More from János Divényi (8)

Slide4 bme adat_2015
Slide4 bme adat_2015Slide4 bme adat_2015
Slide4 bme adat_2015
 
Slide3 bme adat_2015
Slide3 bme adat_2015Slide3 bme adat_2015
Slide3 bme adat_2015
 
Slide2 bme adat_2015
Slide2 bme adat_2015Slide2 bme adat_2015
Slide2 bme adat_2015
 
Slide1 bme adat_2015
Slide1 bme adat_2015Slide1 bme adat_2015
Slide1 bme adat_2015
 
Why code
Why codeWhy code
Why code
 
How to-code-r
How to-code-rHow to-code-r
How to-code-r
 
How to code?
How to code?How to code?
How to code?
 
Why code?
Why code?Why code?
Why code?
 

Recently uploaded

Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSAishani27
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationshipsccctableauusergroup
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxolyaivanovalion
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023ymrp368
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130Suhani Kapoor
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxolyaivanovalion
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...Suhani Kapoor
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxolyaivanovalion
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxMohammedJunaid861692
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiSuhani Kapoor
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysismanisha194592
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionfulawalesam
 
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改atducpo
 

Recently uploaded (20)

Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICS
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFx
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptx
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
 

Uncovering Political Connections of Firms Using Machine Learning