SlideShare a Scribd company logo
1 of 21
Download to read offline
Arizona State University
Crowdsourcing via Tensor Augmentation
and Completion (TAC)
Presenter: Yao Zhou joint work with: Dr. Jingrui He
- 1 -
Arizona State University Arizona State University
Arizona State University- 2 -
Roadmap
 Background
 Related work
 Crowdsourcing based on TAC
 Experimental results
 Conclusion
Arizona State University- 3 -
Crowdsourcing in machine learning
 Training a supervised machine learning model needs
training labels
 Many crowdsourcing platforms provide services to collect
labels information.
Arizona State University- 4 -
An example of crowdsourcing
Lynx (wildcat)
Tabby (domestic cat)
Arizona State University- 5 -
Key problem of crowdsourcing
 Cons:
 Low quality: Collected labels from the crowd (non-expert) are noisy.
 Missing labels: Some workers are not willing to label all of the items.
How to infer the true labels from a large number of labels
collected from crowd?
 Pros:
 Low cost: Collecting large amounts of labels is economic.
Noisy
labels
Missing
labels
Arizona State University- 6 -
Some related work
MV
 Majority Voting, a simple baseline.
DS-EM [Dawid and Skene, 1979]
 Infer worker’s ability matrix and true labels.
 Two-coin model for a binary labelling task.
GLAD [Whitehill et al., 2009].
 Infer the worker’s ability, item difficulty and item true labels simultaneously.
DS-MF [Liu et al., 2012].
 Employ variational Bayesian inference using meanfield algorithm.
MMCE [Zhou et al., 2012].
 Employ the minimax entropy principle to infer worker ability, item difficulty and
true labels at the same time.
Structural
information of labels
is not utilized !!
Arizona State University- 7 -
Roadmap
 Background
 Related work
 Crowdsourcing based on TAC
 Experimental results
 Conclusion
Arizona State University- 8 -
Tensor augmentation
 Notation:
 Re-organize labels of crowds as a three-way
tensor:
 Based on worker’s labelling decision,
generate an index set:
• Workers: i = 1,2, …, Nw
• Items: j = 1,2, …, Ni
• Classes: k = 1, …, Nc
Tensor T
# of items Ni
#ofworkersNw
Ground truth layer
+1
 The ground truth layer:
 Extra tensor slice of size 𝑁𝑖 × 𝑁𝑐.
 Augmented on tensor along the
worker dimension.
Arizona State University- 9 -
Tensor augmentation and completion (TAC)
 Main principle of TAC:
 Rank minimization
NP-hard
Tightest
convex
envelope
 Trace norm minimization
 Goal of TAC:
 Complete the augmented tensor
Arizona State University- 10 -
Tensor augmentation and completion (TAC)
 Definition of trace norm for an n-way tensor [Liu et.al 2012]:
Here, represents for the unfold of a tensor X. The reverse operation is fold.𝑋 𝑙
Tensor: 𝑋 ∈ 𝑅3×2×2
Unfolded matrices:
𝑋(1) ∈ 𝑅3×4
𝑋(2) ∈ 𝑅2×6
𝑋 3 ∈ 𝑅2×6
Reference: Ji Liu et.al. Tensor completion for estimating missing values in visual data. TPAMI 2012
Arizona State University- 11 -
Tensor augmentation and completion (TAC)
Relaxed objective of TAC with regularization:
Regularization
term
Index of the
ground truth layer
Intermediate
relaxed matrices
Solution:
 Block Coordinate Descend (BCD)
 Four blocks of variables:
Arizona State University- 12 -
Updating 𝑴𝒍
Sub-problem:
Closed form solution, proved by [Cai et.al. 2009]:
Here, and τ =
Reference: Jian-Feng Cai, et.al. A singular value thresholding algorithm for matrix completion. SIAM Journal on Optimization, 2010.
Singular Value
Thresholding
Arizona State University- 13 -
Updating X
Two formulations:
 Prior guided ground truth inference (PG-TAC)
Prior Statistics
 Relaxed simplex ground truth inference (RS-TAC)
Slack variable
Arizona State University- 14 -
Prior guided ground truth inference (PG-TAC)
 Solution:
Elements of tensor X can be divided into three sets {𝐶1, 𝐶2, 𝐶3}
Elements of set 𝑪 𝟏:
Elements of set 𝑪 𝟐:
Elements of set 𝑪 𝟑:
Updating X
Tensor T
Ground truth layer
Elements of
Set C3
Elements of
Set C1
Elements of
Set C2
Arizona State University- 15 -
Elements of set 𝑪 𝟑:
Updating X
Prior guided ground truth inference (RS-TAC)
 Solution:
Elements of tensor X can be divided into three sets {𝐶1, 𝐶2, 𝐶3}
Elements of set 𝑪 𝟏:
Elements of set 𝑪 𝟐:
Tensor T
Ground truth layer
Elements of
Set C3
Elements of
Set C1
Elements of
Set C2
Different
from PG-TAC
Arizona State University- 16 -
Roadmap
 Background
 Related work
 Crowdsourcing based on TAC
 Experimental results
 Conclusion
Arizona State University- 17 -
Experimental Results
Synthetic Data Set:
 Four configurations:
 Notations:
• # of Workers: Nw
• # of Items: Ni
• # of Classes: Nc
• Probability of not giving
labels q
Lower
is
better
Lower
is
better
 Initial configuration:
• Nw = 50, Ni = 400
• Nc = 4, q = 0.7
Arizona State University- 18 -
Experimental Results
Real-world Data Set:
References:
Dengyong Zhou, et.al. Learning from the wisdom of crowds by minimax entropy. NIPS, 2012.
Dengyong Zhou, et.al. Regularized minimax conditional entropy for crowdsourcing. CoRR, 2015.
Hu Han, et.al. Demographic estimation from face images: Human vs. machine performance. TPAMI, 2015.
Rion Snow, et.al. Cheap and fast—but is it good?: Evaluating non-expert annotations for natural language tasks. EMNLP, 2008.
Arizona State University- 19 -
Experimental Results
Real-world Data Set results:
References:
Qiang Liu, et al. Variational inference for crowdsourcing. NIPS, 2012.
Dengyong Zhou, et.al. Learning from the wisdom of crowds by minimax entropy. NIPS, 2012.
A. P. Dawid, et al. Maximum likelihood estimation of observer error-rates using the EM algorithm. Applied Statistics, 1979.
Jacob Whitehill et al. Whose vote should count more: Optimal integration of labels from labelers of unknown expertise. NIPS , 2009.
Arizona State University- 20 -
Conclusion
 Two novel methods PG-TAC and RS-TAC:
 Augment the data tensor with a ground truth layer.
 Utilize the structural information of crowd labels.
 Infer the true labels of items in binary and multi-class settings.
 Experimental results:
 Six real data sets.
 Outperform state-of-the-art methods.
Arizona State University- 21 -
Thank you !
&
Questions?

More Related Content

Similar to Crowdsourcing via Tensor Augmentation and Completion

Inside the Atoms: Mining a Network of Networks and Beyond by HangHang Tong at...
Inside the Atoms: Mining a Network of Networks and Beyond by HangHang Tong at...Inside the Atoms: Mining a Network of Networks and Beyond by HangHang Tong at...
Inside the Atoms: Mining a Network of Networks and Beyond by HangHang Tong at...BigMine
 
韩国会议
韩国会议韩国会议
韩国会议YAO YUAN
 
Oversampling technique in student performance classification from engineering...
Oversampling technique in student performance classification from engineering...Oversampling technique in student performance classification from engineering...
Oversampling technique in student performance classification from engineering...IJECEIAES
 
MAXIMUM CORRENTROPY BASED DICTIONARY LEARNING FOR PHYSICAL ACTIVITY RECOGNITI...
MAXIMUM CORRENTROPY BASED DICTIONARY LEARNING FOR PHYSICAL ACTIVITY RECOGNITI...MAXIMUM CORRENTROPY BASED DICTIONARY LEARNING FOR PHYSICAL ACTIVITY RECOGNITI...
MAXIMUM CORRENTROPY BASED DICTIONARY LEARNING FOR PHYSICAL ACTIVITY RECOGNITI...sherinmm
 
Maximum Correntropy Based Dictionary Learning Framework for Physical Activity...
Maximum Correntropy Based Dictionary Learning Framework for Physical Activity...Maximum Correntropy Based Dictionary Learning Framework for Physical Activity...
Maximum Correntropy Based Dictionary Learning Framework for Physical Activity...sherinmm
 
Unlearn What You Have Learned: Adaptive Crowd Teaching with Exponentially Dec...
Unlearn What You Have Learned: Adaptive Crowd Teaching with Exponentially Dec...Unlearn What You Have Learned: Adaptive Crowd Teaching with Exponentially Dec...
Unlearn What You Have Learned: Adaptive Crowd Teaching with Exponentially Dec...collwe
 
Machine learning and Neural Networks
Machine learning and Neural NetworksMachine learning and Neural Networks
Machine learning and Neural Networksbutest
 
IMAGE CLASSIFICATION USING DIFFERENT CLASSICAL APPROACHES
IMAGE CLASSIFICATION USING DIFFERENT CLASSICAL APPROACHESIMAGE CLASSIFICATION USING DIFFERENT CLASSICAL APPROACHES
IMAGE CLASSIFICATION USING DIFFERENT CLASSICAL APPROACHESVikash Kumar
 
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...ijceronline
 
Application of combined support vector machines in process fault diagnosis
Application of combined support vector machines in process fault diagnosisApplication of combined support vector machines in process fault diagnosis
Application of combined support vector machines in process fault diagnosisDr.Pooja Jain
 
Collaborative Filtering Survey
Collaborative Filtering SurveyCollaborative Filtering Survey
Collaborative Filtering Surveymobilizer1000
 
Morgan uw maGIV v1.3 dist
Morgan uw maGIV v1.3 distMorgan uw maGIV v1.3 dist
Morgan uw maGIV v1.3 distddm314
 
know Machine Learning Basic Concepts.pdf
know Machine Learning Basic Concepts.pdfknow Machine Learning Basic Concepts.pdf
know Machine Learning Basic Concepts.pdfhemangppatel
 
More investment in Research and Development for better Education in the future?
More investment in Research and Development for better Education in the future?More investment in Research and Development for better Education in the future?
More investment in Research and Development for better Education in the future?Dhafer Malouche
 
L008.Eigenfaces And Nn Som
L008.Eigenfaces And Nn SomL008.Eigenfaces And Nn Som
L008.Eigenfaces And Nn Somramesh kumar
 
[RecSys 2014] Deviation-Based and Similarity-Based Contextual SLIM Recommenda...
[RecSys 2014] Deviation-Based and Similarity-Based Contextual SLIM Recommenda...[RecSys 2014] Deviation-Based and Similarity-Based Contextual SLIM Recommenda...
[RecSys 2014] Deviation-Based and Similarity-Based Contextual SLIM Recommenda...YONG ZHENG
 
Data.Mining.C.6(II).classification and prediction
Data.Mining.C.6(II).classification and predictionData.Mining.C.6(II).classification and prediction
Data.Mining.C.6(II).classification and predictionMargaret Wang
 
Machine learning by Dr. Vivek Vijay and Dr. Sandeep Yadav
Machine learning by Dr. Vivek Vijay and Dr. Sandeep YadavMachine learning by Dr. Vivek Vijay and Dr. Sandeep Yadav
Machine learning by Dr. Vivek Vijay and Dr. Sandeep YadavAgile Testing Alliance
 

Similar to Crowdsourcing via Tensor Augmentation and Completion (20)

Data Mining.ppt
Data Mining.pptData Mining.ppt
Data Mining.ppt
 
Inside the Atoms: Mining a Network of Networks and Beyond by HangHang Tong at...
Inside the Atoms: Mining a Network of Networks and Beyond by HangHang Tong at...Inside the Atoms: Mining a Network of Networks and Beyond by HangHang Tong at...
Inside the Atoms: Mining a Network of Networks and Beyond by HangHang Tong at...
 
韩国会议
韩国会议韩国会议
韩国会议
 
Oversampling technique in student performance classification from engineering...
Oversampling technique in student performance classification from engineering...Oversampling technique in student performance classification from engineering...
Oversampling technique in student performance classification from engineering...
 
MAXIMUM CORRENTROPY BASED DICTIONARY LEARNING FOR PHYSICAL ACTIVITY RECOGNITI...
MAXIMUM CORRENTROPY BASED DICTIONARY LEARNING FOR PHYSICAL ACTIVITY RECOGNITI...MAXIMUM CORRENTROPY BASED DICTIONARY LEARNING FOR PHYSICAL ACTIVITY RECOGNITI...
MAXIMUM CORRENTROPY BASED DICTIONARY LEARNING FOR PHYSICAL ACTIVITY RECOGNITI...
 
Maximum Correntropy Based Dictionary Learning Framework for Physical Activity...
Maximum Correntropy Based Dictionary Learning Framework for Physical Activity...Maximum Correntropy Based Dictionary Learning Framework for Physical Activity...
Maximum Correntropy Based Dictionary Learning Framework for Physical Activity...
 
Unlearn What You Have Learned: Adaptive Crowd Teaching with Exponentially Dec...
Unlearn What You Have Learned: Adaptive Crowd Teaching with Exponentially Dec...Unlearn What You Have Learned: Adaptive Crowd Teaching with Exponentially Dec...
Unlearn What You Have Learned: Adaptive Crowd Teaching with Exponentially Dec...
 
Machine learning and Neural Networks
Machine learning and Neural NetworksMachine learning and Neural Networks
Machine learning and Neural Networks
 
IMAGE CLASSIFICATION USING DIFFERENT CLASSICAL APPROACHES
IMAGE CLASSIFICATION USING DIFFERENT CLASSICAL APPROACHESIMAGE CLASSIFICATION USING DIFFERENT CLASSICAL APPROACHES
IMAGE CLASSIFICATION USING DIFFERENT CLASSICAL APPROACHES
 
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
 
Application of combined support vector machines in process fault diagnosis
Application of combined support vector machines in process fault diagnosisApplication of combined support vector machines in process fault diagnosis
Application of combined support vector machines in process fault diagnosis
 
RESULT MINING: ANALYSIS OF DATA MINING TECHNIQUES IN EDUCATION
RESULT MINING: ANALYSIS OF DATA MINING TECHNIQUES IN EDUCATIONRESULT MINING: ANALYSIS OF DATA MINING TECHNIQUES IN EDUCATION
RESULT MINING: ANALYSIS OF DATA MINING TECHNIQUES IN EDUCATION
 
Collaborative Filtering Survey
Collaborative Filtering SurveyCollaborative Filtering Survey
Collaborative Filtering Survey
 
Morgan uw maGIV v1.3 dist
Morgan uw maGIV v1.3 distMorgan uw maGIV v1.3 dist
Morgan uw maGIV v1.3 dist
 
know Machine Learning Basic Concepts.pdf
know Machine Learning Basic Concepts.pdfknow Machine Learning Basic Concepts.pdf
know Machine Learning Basic Concepts.pdf
 
More investment in Research and Development for better Education in the future?
More investment in Research and Development for better Education in the future?More investment in Research and Development for better Education in the future?
More investment in Research and Development for better Education in the future?
 
L008.Eigenfaces And Nn Som
L008.Eigenfaces And Nn SomL008.Eigenfaces And Nn Som
L008.Eigenfaces And Nn Som
 
[RecSys 2014] Deviation-Based and Similarity-Based Contextual SLIM Recommenda...
[RecSys 2014] Deviation-Based and Similarity-Based Contextual SLIM Recommenda...[RecSys 2014] Deviation-Based and Similarity-Based Contextual SLIM Recommenda...
[RecSys 2014] Deviation-Based and Similarity-Based Contextual SLIM Recommenda...
 
Data.Mining.C.6(II).classification and prediction
Data.Mining.C.6(II).classification and predictionData.Mining.C.6(II).classification and prediction
Data.Mining.C.6(II).classification and prediction
 
Machine learning by Dr. Vivek Vijay and Dr. Sandeep Yadav
Machine learning by Dr. Vivek Vijay and Dr. Sandeep YadavMachine learning by Dr. Vivek Vijay and Dr. Sandeep Yadav
Machine learning by Dr. Vivek Vijay and Dr. Sandeep Yadav
 

Recently uploaded

GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxAleenaTreesaSaji
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​kaibalyasahoo82800
 
Cultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxCultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxpradhanghanshyam7136
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...Sérgio Sacani
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Sérgio Sacani
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsSérgio Sacani
 
Boyles law module in the grade 10 science
Boyles law module in the grade 10 scienceBoyles law module in the grade 10 science
Boyles law module in the grade 10 sciencefloriejanemacaya1
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...anilsa9823
 
G9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptG9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptMAESTRELLAMesa2
 
Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Patrick Diehl
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSarthak Sekhar Mondal
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxgindu3009
 
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCESTERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCEPRINCE C P
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)PraveenaKalaiselvan1
 
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |aasikanpl
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Lokesh Kothari
 

Recently uploaded (20)

GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptx
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​
 
Cultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxCultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptx
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
 
Boyles law module in the grade 10 science
Boyles law module in the grade 10 scienceBoyles law module in the grade 10 science
Boyles law module in the grade 10 science
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
 
G9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptG9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.ppt
 
Engler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomyEngler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomy
 
Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
 
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
 
CELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdfCELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdf
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptx
 
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCESTERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
 
The Philosophy of Science
The Philosophy of ScienceThe Philosophy of Science
The Philosophy of Science
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)
 
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
 

Crowdsourcing via Tensor Augmentation and Completion

  • 1. Arizona State University Crowdsourcing via Tensor Augmentation and Completion (TAC) Presenter: Yao Zhou joint work with: Dr. Jingrui He - 1 - Arizona State University Arizona State University
  • 2. Arizona State University- 2 - Roadmap  Background  Related work  Crowdsourcing based on TAC  Experimental results  Conclusion
  • 3. Arizona State University- 3 - Crowdsourcing in machine learning  Training a supervised machine learning model needs training labels  Many crowdsourcing platforms provide services to collect labels information.
  • 4. Arizona State University- 4 - An example of crowdsourcing Lynx (wildcat) Tabby (domestic cat)
  • 5. Arizona State University- 5 - Key problem of crowdsourcing  Cons:  Low quality: Collected labels from the crowd (non-expert) are noisy.  Missing labels: Some workers are not willing to label all of the items. How to infer the true labels from a large number of labels collected from crowd?  Pros:  Low cost: Collecting large amounts of labels is economic. Noisy labels Missing labels
  • 6. Arizona State University- 6 - Some related work MV  Majority Voting, a simple baseline. DS-EM [Dawid and Skene, 1979]  Infer worker’s ability matrix and true labels.  Two-coin model for a binary labelling task. GLAD [Whitehill et al., 2009].  Infer the worker’s ability, item difficulty and item true labels simultaneously. DS-MF [Liu et al., 2012].  Employ variational Bayesian inference using meanfield algorithm. MMCE [Zhou et al., 2012].  Employ the minimax entropy principle to infer worker ability, item difficulty and true labels at the same time. Structural information of labels is not utilized !!
  • 7. Arizona State University- 7 - Roadmap  Background  Related work  Crowdsourcing based on TAC  Experimental results  Conclusion
  • 8. Arizona State University- 8 - Tensor augmentation  Notation:  Re-organize labels of crowds as a three-way tensor:  Based on worker’s labelling decision, generate an index set: • Workers: i = 1,2, …, Nw • Items: j = 1,2, …, Ni • Classes: k = 1, …, Nc Tensor T # of items Ni #ofworkersNw Ground truth layer +1  The ground truth layer:  Extra tensor slice of size 𝑁𝑖 × 𝑁𝑐.  Augmented on tensor along the worker dimension.
  • 9. Arizona State University- 9 - Tensor augmentation and completion (TAC)  Main principle of TAC:  Rank minimization NP-hard Tightest convex envelope  Trace norm minimization  Goal of TAC:  Complete the augmented tensor
  • 10. Arizona State University- 10 - Tensor augmentation and completion (TAC)  Definition of trace norm for an n-way tensor [Liu et.al 2012]: Here, represents for the unfold of a tensor X. The reverse operation is fold.𝑋 𝑙 Tensor: 𝑋 ∈ 𝑅3×2×2 Unfolded matrices: 𝑋(1) ∈ 𝑅3×4 𝑋(2) ∈ 𝑅2×6 𝑋 3 ∈ 𝑅2×6 Reference: Ji Liu et.al. Tensor completion for estimating missing values in visual data. TPAMI 2012
  • 11. Arizona State University- 11 - Tensor augmentation and completion (TAC) Relaxed objective of TAC with regularization: Regularization term Index of the ground truth layer Intermediate relaxed matrices Solution:  Block Coordinate Descend (BCD)  Four blocks of variables:
  • 12. Arizona State University- 12 - Updating 𝑴𝒍 Sub-problem: Closed form solution, proved by [Cai et.al. 2009]: Here, and τ = Reference: Jian-Feng Cai, et.al. A singular value thresholding algorithm for matrix completion. SIAM Journal on Optimization, 2010. Singular Value Thresholding
  • 13. Arizona State University- 13 - Updating X Two formulations:  Prior guided ground truth inference (PG-TAC) Prior Statistics  Relaxed simplex ground truth inference (RS-TAC) Slack variable
  • 14. Arizona State University- 14 - Prior guided ground truth inference (PG-TAC)  Solution: Elements of tensor X can be divided into three sets {𝐶1, 𝐶2, 𝐶3} Elements of set 𝑪 𝟏: Elements of set 𝑪 𝟐: Elements of set 𝑪 𝟑: Updating X Tensor T Ground truth layer Elements of Set C3 Elements of Set C1 Elements of Set C2
  • 15. Arizona State University- 15 - Elements of set 𝑪 𝟑: Updating X Prior guided ground truth inference (RS-TAC)  Solution: Elements of tensor X can be divided into three sets {𝐶1, 𝐶2, 𝐶3} Elements of set 𝑪 𝟏: Elements of set 𝑪 𝟐: Tensor T Ground truth layer Elements of Set C3 Elements of Set C1 Elements of Set C2 Different from PG-TAC
  • 16. Arizona State University- 16 - Roadmap  Background  Related work  Crowdsourcing based on TAC  Experimental results  Conclusion
  • 17. Arizona State University- 17 - Experimental Results Synthetic Data Set:  Four configurations:  Notations: • # of Workers: Nw • # of Items: Ni • # of Classes: Nc • Probability of not giving labels q Lower is better Lower is better  Initial configuration: • Nw = 50, Ni = 400 • Nc = 4, q = 0.7
  • 18. Arizona State University- 18 - Experimental Results Real-world Data Set: References: Dengyong Zhou, et.al. Learning from the wisdom of crowds by minimax entropy. NIPS, 2012. Dengyong Zhou, et.al. Regularized minimax conditional entropy for crowdsourcing. CoRR, 2015. Hu Han, et.al. Demographic estimation from face images: Human vs. machine performance. TPAMI, 2015. Rion Snow, et.al. Cheap and fast—but is it good?: Evaluating non-expert annotations for natural language tasks. EMNLP, 2008.
  • 19. Arizona State University- 19 - Experimental Results Real-world Data Set results: References: Qiang Liu, et al. Variational inference for crowdsourcing. NIPS, 2012. Dengyong Zhou, et.al. Learning from the wisdom of crowds by minimax entropy. NIPS, 2012. A. P. Dawid, et al. Maximum likelihood estimation of observer error-rates using the EM algorithm. Applied Statistics, 1979. Jacob Whitehill et al. Whose vote should count more: Optimal integration of labels from labelers of unknown expertise. NIPS , 2009.
  • 20. Arizona State University- 20 - Conclusion  Two novel methods PG-TAC and RS-TAC:  Augment the data tensor with a ground truth layer.  Utilize the structural information of crowd labels.  Infer the true labels of items in binary and multi-class settings.  Experimental results:  Six real data sets.  Outperform state-of-the-art methods.
  • 21. Arizona State University- 21 - Thank you ! & Questions?