SlideShare a Scribd company logo
1 of 32
Open Analytics NYC – 11/08/2012




Building Effective Frameworks
  for Social Media Analysis
Agenda

 •   Social Media: An Intelligence perspective
 •   Common Analytic Pitfalls
 •   An Analytic Framework
 •   Case Study: Brand Management
     –   Problem Definition
     –   Source Selection
     –   Data Capture
     –   Data Reporting
     –   Data Analysis
 • Ways Forward, Future Analysis
 • Questions?
Intelligence

 • Intelligence is information that has been
   transformed to meet an operational need




       Data                Intelligence




        Operational Lens
Intelligence Cycle

 • No matter what methodology you use…

                         Collect

              Distribute            Store

                         Analyze

   intelligence analysis is an iterative process.
Social Media: Intelligence Perspective

 • Social Media Intelligence is a combination of the
   best and worst features of:
    – HUMINT
    – OSINT
    – SIGINT                        HUMINT




                            OSINT            SIGINT
Social Media Analysis Goals

 • Provide value to the organization – turn data into
   intelligence using an “operational lens”
 • Ensure cyclical feedback occurs during collection,
   processing, analysis, and consumption
 • Validate that a particular network is the right source
   of data for the questions you need answered
Common Misconceptions

 • Social media is not a panacea
    – Not everyone uses social media
    – Users of social media use it unevenly
    – User behavior changes based on situations


 • Just because people can talk about anything does
   not mean they talk about everything all the time.
Common Pitfalls

 • Analyzing What Instead of Why:
   The important thing is often not what people are
   saying… but why they are saying it.

 • Using the Wrong Analysis Tools:
   Reporting tools rarely help dig into the why. Many
   common tools, reports, and metrics are actually
   misleading:
    – Word clouds atomize message context
    – Sentiment metrics are often highly inaccurate
    – Information in aggregate hides more than it reveals
Pitfalls: An Example of the Challenge
Pitfalls: An Example of the Challenge
Dangers of Disintegration




                    Source: Matthew Auer, Policy Studies Journal,
                    Volume 39, Issue 4, pages 709–736, Nov 2011
Analytic Framework

 • Data Capture (DC)                                                    Capture
 • Data Reporting (DR)
 • Data Analysis (DA)
   – What to measure           Analyze     Report
   – What the data is saying
   – What should be done based on the data




   Source: Avinash Kaushik, Occam’s Razor Blog http://www.kaushik.net/avinash/web-analytics-consulting-
                                                                          framework-smarter-decisions/
Choosing a Platform

 • Social media, and the ways that it is used, is
   relatively new and evolving rapidly:
   – Static approaches to social media are flawed from
     the outset
   – No one metric or set of metrics will always let you
     know what is happening
 • Platforms need to be open and highly
   adaptable to facilitate data capture, reporting,
   and analysis
Case Study: Brand Management

 • Industry: Gaming
   – Experiencing 10% growth annually
   – Overall revenue expected to exceed $80 billion by
     2014
 • In May, Zenimax Online Studios announced
   Elder Scrolls Online
   – Elder Scrolls V: Skyrim 2nd largest game of 2011
Problem Definition

 • Question: How can brand managers use social
   media to track and understand public
   attitudes toward a product?
 • Challenge: Capture relevant information for
   social media sources.
   – Query too large = false positives
   – Query too small = miss potential information
Twitter

 • Twitter has excellent analytical potential:
   – Enormous volume, 400 million+ tweets per day
   – Large user base, 140 million+ accounts
   – Open API
 • But its not without its limitations:
   – 140 characters
   – Limited historical (lookback) capacity without
     using a 3rd party provider like DataSift or GNIP
Data Capture: Initial Query

 • Twitter search for “Elder Scrolls Online”
   – Simplest possible way to access information
   – RSS feed for 10 days (Jun 27 – July 6 2012)
Data Capture: Entities & Associations

                     Hashtag             TwitterHandle         URL




                                                         Unstructured Keywords
                                           Time / Date Stamp


   Who             What                     When               Where
   TwitterHandle   Hashtags, Keywords,     Time, Date          Geo (if Available)
                   URLs
Data Reporting
Data Reporting
Data Analysis

 • Analysis needs to be rooted in the operational
   need:
    “How can I use social media to track and
    understand public attitudes toward my
    product”

 • Emphasis on hypothesis generation, testing,
   and experimentation
Data Analysis: Hashtags

 • Top hashtags were almost all generic or
   abstract
    – Undermines tracking and understanding
    – Top hashtags tied to franchise, not to the game

 Hashtags
 #ElderScrolls               #concept
 #games                      #nerd
 #online                     #geek
 #MMO                        #gamer
 #skyrim                     #ScreenShot
Data Analysis: Expanding the Query

 • Hash tags from an initial subset of Tweets fed
   back into the initial query

          Initial Query     Expanded Query
             Results            Results




        Twitter Stream
Data Analysis: Sentiment

 • Sentiment analysis on small snippets of text
   like Tweets is generally poor
 • Follow and convert linked URLs into derivative
   sources
 • Larger text sources offer potential value
   with sentiment analysis
   that tweets alone cannot
   offer
Data Analysis: Sentiment

 • Top negative and positive sentiment scores
   can provide a glimpse into aggregate attitudes
 • Provide starting points for additional analysis
Next Steps: Shape the Conversation

 • Create and promote hashtags that help shape
   the conversation and make it easier to collect
   and analyze the Twitter stream
Next Steps: Segment the Data

 • Segment, or cluster, your data by:
   – User name or handle
   – Hashtags
   – Keywords
   – Geographic region
     to explore patterns and trends at the micro level
     versus the entire dataset
Next Steps: Segment the Data
Next Steps: Graph Analysis
Lessons Learned

 • Don’t:
   – Try drinking from a fire hose, sometimes less
     really is more;
   – Use metrics you can’t tie to actions;
   – Use visualizations or reports that strip the data
     from its context.
Lessons Learned

 • Do:
   – Segment data rather than attempting to work in
     the aggregate;
   – Look for the why behind the message;
   – Always return to the source material;
   – Explore alternative explanations;
   – Always consider the ultimate goal.
Thank You!


                Craig Vitter


              www.ikanow.com
             cvitter@ikanow.com


             github.com/ikanow/Infinit.e

More Related Content

What's hot

Text Analytics Today
Text Analytics TodayText Analytics Today
Text Analytics TodaySeth Grimes
 
Big Data Analytics: Facts and Feelings
Big Data Analytics: Facts and FeelingsBig Data Analytics: Facts and Feelings
Big Data Analytics: Facts and FeelingsSeth Grimes
 
Challenges of using Twitter for sentiment analysis
Challenges of using Twitter for sentiment analysisChallenges of using Twitter for sentiment analysis
Challenges of using Twitter for sentiment analysisAna Canhoto
 
Website Analytics and Measurement
Website Analytics and MeasurementWebsite Analytics and Measurement
Website Analytics and MeasurementAdam Lee
 
Dec 21 122112 Marshall Sponder
Dec 21 122112 Marshall SponderDec 21 122112 Marshall Sponder
Dec 21 122112 Marshall SponderMarshall Sponder
 
Sa discover text webinar
Sa discover text webinarSa discover text webinar
Sa discover text webinarQuestionPro
 
7 dee finding the right methodologies marshall sponder - 9-12-12 - submitted
7 dee finding the right methodologies   marshall sponder - 9-12-12 - submitted7 dee finding the right methodologies   marshall sponder - 9-12-12 - submitted
7 dee finding the right methodologies marshall sponder - 9-12-12 - submittedMarshall Sponder
 
Text Analytics Applied (LIDER roadmapping presentation)
Text Analytics Applied (LIDER roadmapping presentation)Text Analytics Applied (LIDER roadmapping presentation)
Text Analytics Applied (LIDER roadmapping presentation)Seth Grimes
 
Digital analytics lecture1
Digital analytics lecture1Digital analytics lecture1
Digital analytics lecture1Joni Salminen
 
MasterThesis_HafsaAsif
MasterThesis_HafsaAsifMasterThesis_HafsaAsif
MasterThesis_HafsaAsifHafsa Asif
 
Social Media AND THE Enterprise Business Intelligence/Analytics Connection
Social Media AND THE Enterprise Business Intelligence/Analytics ConnectionSocial Media AND THE Enterprise Business Intelligence/Analytics Connection
Social Media AND THE Enterprise Business Intelligence/Analytics ConnectionSeth Grimes
 
Organizing 2.0 Social Analytics
Organizing 2.0 Social AnalyticsOrganizing 2.0 Social Analytics
Organizing 2.0 Social AnalyticsBeth Becker
 
Measuring Social Media
Measuring Social MediaMeasuring Social Media
Measuring Social MediaBob Bertsch
 
Emotion Drives Behavior: Building a Data Narrative
Emotion Drives Behavior: Building a Data NarrativeEmotion Drives Behavior: Building a Data Narrative
Emotion Drives Behavior: Building a Data Narrativeevolve24
 
Practical sentiment analysis
Practical sentiment analysisPractical sentiment analysis
Practical sentiment analysisDiana Maynard
 
#ThinkPH Social Media Sentiment Analysis
#ThinkPH Social Media Sentiment Analysis#ThinkPH Social Media Sentiment Analysis
#ThinkPH Social Media Sentiment AnalysisRobin Leonard
 
Project for executive summary v2
Project for executive summary v2Project for executive summary v2
Project for executive summary v200000000A1
 

What's hot (20)

Text Analytics Today
Text Analytics TodayText Analytics Today
Text Analytics Today
 
Big Data Analytics: Facts and Feelings
Big Data Analytics: Facts and FeelingsBig Data Analytics: Facts and Feelings
Big Data Analytics: Facts and Feelings
 
Challenges of using Twitter for sentiment analysis
Challenges of using Twitter for sentiment analysisChallenges of using Twitter for sentiment analysis
Challenges of using Twitter for sentiment analysis
 
Website Analytics and Measurement
Website Analytics and MeasurementWebsite Analytics and Measurement
Website Analytics and Measurement
 
Dec 21 122112 Marshall Sponder
Dec 21 122112 Marshall SponderDec 21 122112 Marshall Sponder
Dec 21 122112 Marshall Sponder
 
Social Media Data Analytics
Social Media Data AnalyticsSocial Media Data Analytics
Social Media Data Analytics
 
Sa discover text webinar
Sa discover text webinarSa discover text webinar
Sa discover text webinar
 
Data Science in Digital Marketing - Forest Cassidy, LeadFerret
Data Science in Digital Marketing - Forest Cassidy, LeadFerretData Science in Digital Marketing - Forest Cassidy, LeadFerret
Data Science in Digital Marketing - Forest Cassidy, LeadFerret
 
7 dee finding the right methodologies marshall sponder - 9-12-12 - submitted
7 dee finding the right methodologies   marshall sponder - 9-12-12 - submitted7 dee finding the right methodologies   marshall sponder - 9-12-12 - submitted
7 dee finding the right methodologies marshall sponder - 9-12-12 - submitted
 
Text Analytics Applied (LIDER roadmapping presentation)
Text Analytics Applied (LIDER roadmapping presentation)Text Analytics Applied (LIDER roadmapping presentation)
Text Analytics Applied (LIDER roadmapping presentation)
 
Digital analytics lecture1
Digital analytics lecture1Digital analytics lecture1
Digital analytics lecture1
 
MasterThesis_HafsaAsif
MasterThesis_HafsaAsifMasterThesis_HafsaAsif
MasterThesis_HafsaAsif
 
Social Media AND THE Enterprise Business Intelligence/Analytics Connection
Social Media AND THE Enterprise Business Intelligence/Analytics ConnectionSocial Media AND THE Enterprise Business Intelligence/Analytics Connection
Social Media AND THE Enterprise Business Intelligence/Analytics Connection
 
Organizing 2.0 Social Analytics
Organizing 2.0 Social AnalyticsOrganizing 2.0 Social Analytics
Organizing 2.0 Social Analytics
 
Measuring Social Media
Measuring Social MediaMeasuring Social Media
Measuring Social Media
 
Emotion Drives Behavior: Building a Data Narrative
Emotion Drives Behavior: Building a Data NarrativeEmotion Drives Behavior: Building a Data Narrative
Emotion Drives Behavior: Building a Data Narrative
 
Neigh october2012
Neigh october2012Neigh october2012
Neigh october2012
 
Practical sentiment analysis
Practical sentiment analysisPractical sentiment analysis
Practical sentiment analysis
 
#ThinkPH Social Media Sentiment Analysis
#ThinkPH Social Media Sentiment Analysis#ThinkPH Social Media Sentiment Analysis
#ThinkPH Social Media Sentiment Analysis
 
Project for executive summary v2
Project for executive summary v2Project for executive summary v2
Project for executive summary v2
 

Similar to Open Analytics: Building Effective Frameworks for Social Media Analysis

Building Effective Frameworks for Social Media Analysis
Building Effective Frameworks for Social Media AnalysisBuilding Effective Frameworks for Social Media Analysis
Building Effective Frameworks for Social Media AnalysisOpen Analytics
 
Building Effective Frameworks for Social Media Analysis
Building Effective Frameworks for Social Media AnalysisBuilding Effective Frameworks for Social Media Analysis
Building Effective Frameworks for Social Media Analysisikanow
 
Open analytics social media framework
Open analytics   social media frameworkOpen analytics   social media framework
Open analytics social media frameworkOpen Analytics
 
Approaching Big Data: Lesson Plan
Approaching Big Data: Lesson Plan Approaching Big Data: Lesson Plan
Approaching Big Data: Lesson Plan Bessie Chu
 
Narrative Mind Week 7 H4D Stanford 2016
Narrative Mind Week 7 H4D Stanford 2016Narrative Mind Week 7 H4D Stanford 2016
Narrative Mind Week 7 H4D Stanford 2016Stanford University
 
A picture is worth a thousand words
A picture is worth a thousand wordsA picture is worth a thousand words
A picture is worth a thousand wordsMasum Billah
 
Social media data analysis
Social media data analysisSocial media data analysis
Social media data analysisShweta Patnaik
 
Advanced Analytics and Data Science Expertise
Advanced Analytics and Data Science ExpertiseAdvanced Analytics and Data Science Expertise
Advanced Analytics and Data Science ExpertiseSoftServe
 
Turning Listening into an Organizational Advantage
Turning Listening into an Organizational AdvantageTurning Listening into an Organizational Advantage
Turning Listening into an Organizational AdvantageW2O Group
 
Social Data Intelligence: Webinar with Susan Etlinger
Social Data Intelligence: Webinar with Susan EtlingerSocial Data Intelligence: Webinar with Susan Etlinger
Social Data Intelligence: Webinar with Susan EtlingerSusan Etlinger
 
Univ. of AZ Global Racing Symposium 2015 - Digital Strategies
Univ. of AZ Global Racing Symposium 2015 - Digital StrategiesUniv. of AZ Global Racing Symposium 2015 - Digital Strategies
Univ. of AZ Global Racing Symposium 2015 - Digital Strategiessmfrisby
 
[Slides] Social Data Intelligence Webinar, By Susan Etlinger
[Slides] Social Data Intelligence Webinar, By Susan Etlinger[Slides] Social Data Intelligence Webinar, By Susan Etlinger
[Slides] Social Data Intelligence Webinar, By Susan EtlingerAltimeter, a Prophet Company
 
Lecture 5: Mining, Analysis and Visualisation
Lecture 5: Mining, Analysis and VisualisationLecture 5: Mining, Analysis and Visualisation
Lecture 5: Mining, Analysis and VisualisationMarieke van Erp
 
No BS Monitoring and Measurement
No BS Monitoring and MeasurementNo BS Monitoring and Measurement
No BS Monitoring and MeasurementJason Falls
 
Introduction Data Science.pptx
Introduction Data Science.pptxIntroduction Data Science.pptx
Introduction Data Science.pptxAkhirulAminulloh2
 
Introductions to Business Analytics
Introductions to Business Analytics Introductions to Business Analytics
Introductions to Business Analytics Venkat .P
 
ScienceOnline impact workshop
ScienceOnline impact workshop ScienceOnline impact workshop
ScienceOnline impact workshop SpotOnLondon
 
Getting Under the Hood: What Analytics and Metrics Can Show You About Your We...
Getting Under the Hood: What Analytics and Metrics Can Show You About Your We...Getting Under the Hood: What Analytics and Metrics Can Show You About Your We...
Getting Under the Hood: What Analytics and Metrics Can Show You About Your We...Hartford Foundation for Public Giving
 

Similar to Open Analytics: Building Effective Frameworks for Social Media Analysis (20)

Building Effective Frameworks for Social Media Analysis
Building Effective Frameworks for Social Media AnalysisBuilding Effective Frameworks for Social Media Analysis
Building Effective Frameworks for Social Media Analysis
 
Building Effective Frameworks for Social Media Analysis
Building Effective Frameworks for Social Media AnalysisBuilding Effective Frameworks for Social Media Analysis
Building Effective Frameworks for Social Media Analysis
 
Open analytics social media framework
Open analytics   social media frameworkOpen analytics   social media framework
Open analytics social media framework
 
Approaching Big Data: Lesson Plan
Approaching Big Data: Lesson Plan Approaching Big Data: Lesson Plan
Approaching Big Data: Lesson Plan
 
Narrative Mind Week 7 H4D Stanford 2016
Narrative Mind Week 7 H4D Stanford 2016Narrative Mind Week 7 H4D Stanford 2016
Narrative Mind Week 7 H4D Stanford 2016
 
A picture is worth a thousand words
A picture is worth a thousand wordsA picture is worth a thousand words
A picture is worth a thousand words
 
Social media data analysis
Social media data analysisSocial media data analysis
Social media data analysis
 
Advanced Analytics and Data Science Expertise
Advanced Analytics and Data Science ExpertiseAdvanced Analytics and Data Science Expertise
Advanced Analytics and Data Science Expertise
 
Week2 chapters1 3
Week2 chapters1 3Week2 chapters1 3
Week2 chapters1 3
 
Turning Listening into an Organizational Advantage
Turning Listening into an Organizational AdvantageTurning Listening into an Organizational Advantage
Turning Listening into an Organizational Advantage
 
Assessing Digital Output in New Ways
Assessing Digital Output in New WaysAssessing Digital Output in New Ways
Assessing Digital Output in New Ways
 
Social Data Intelligence: Webinar with Susan Etlinger
Social Data Intelligence: Webinar with Susan EtlingerSocial Data Intelligence: Webinar with Susan Etlinger
Social Data Intelligence: Webinar with Susan Etlinger
 
Univ. of AZ Global Racing Symposium 2015 - Digital Strategies
Univ. of AZ Global Racing Symposium 2015 - Digital StrategiesUniv. of AZ Global Racing Symposium 2015 - Digital Strategies
Univ. of AZ Global Racing Symposium 2015 - Digital Strategies
 
[Slides] Social Data Intelligence Webinar, By Susan Etlinger
[Slides] Social Data Intelligence Webinar, By Susan Etlinger[Slides] Social Data Intelligence Webinar, By Susan Etlinger
[Slides] Social Data Intelligence Webinar, By Susan Etlinger
 
Lecture 5: Mining, Analysis and Visualisation
Lecture 5: Mining, Analysis and VisualisationLecture 5: Mining, Analysis and Visualisation
Lecture 5: Mining, Analysis and Visualisation
 
No BS Monitoring and Measurement
No BS Monitoring and MeasurementNo BS Monitoring and Measurement
No BS Monitoring and Measurement
 
Introduction Data Science.pptx
Introduction Data Science.pptxIntroduction Data Science.pptx
Introduction Data Science.pptx
 
Introductions to Business Analytics
Introductions to Business Analytics Introductions to Business Analytics
Introductions to Business Analytics
 
ScienceOnline impact workshop
ScienceOnline impact workshop ScienceOnline impact workshop
ScienceOnline impact workshop
 
Getting Under the Hood: What Analytics and Metrics Can Show You About Your We...
Getting Under the Hood: What Analytics and Metrics Can Show You About Your We...Getting Under the Hood: What Analytics and Metrics Can Show You About Your We...
Getting Under the Hood: What Analytics and Metrics Can Show You About Your We...
 

More from ikanow

Aliasing Use Cases - How to Use IKANOW to Crunch Big Data
Aliasing Use Cases - How to Use IKANOW to Crunch Big DataAliasing Use Cases - How to Use IKANOW to Crunch Big Data
Aliasing Use Cases - How to Use IKANOW to Crunch Big Dataikanow
 
Mongo db washington dc 2014
Mongo db washington dc 2014Mongo db washington dc 2014
Mongo db washington dc 2014ikanow
 
Dr. Michael Valivullah, NASS/USDA - Cloud Computing
Dr. Michael Valivullah, NASS/USDA - Cloud ComputingDr. Michael Valivullah, NASS/USDA - Cloud Computing
Dr. Michael Valivullah, NASS/USDA - Cloud Computingikanow
 
Cloud computing with AWS
Cloud computing with AWS Cloud computing with AWS
Cloud computing with AWS ikanow
 
Open Analytics DC June 2012 Presentation
Open Analytics DC June 2012 PresentationOpen Analytics DC June 2012 Presentation
Open Analytics DC June 2012 Presentationikanow
 
MongoDC - Ikanow April 2012 Meetup
MongoDC - Ikanow April 2012 MeetupMongoDC - Ikanow April 2012 Meetup
MongoDC - Ikanow April 2012 Meetupikanow
 
Open Analytics DC April 2012 Meetup
Open Analytics DC April 2012 MeetupOpen Analytics DC April 2012 Meetup
Open Analytics DC April 2012 Meetupikanow
 
Hadoop MapReduce - I'm Sold, Now What?
Hadoop MapReduce - I'm Sold, Now What?Hadoop MapReduce - I'm Sold, Now What?
Hadoop MapReduce - I'm Sold, Now What?ikanow
 
Agile intelligence through Open Analytics
Agile intelligence through Open AnalyticsAgile intelligence through Open Analytics
Agile intelligence through Open Analyticsikanow
 
Social Intelligence: Realizing Business Value in Big Data
Social Intelligence: Realizing Business Value in Big DataSocial Intelligence: Realizing Business Value in Big Data
Social Intelligence: Realizing Business Value in Big Dataikanow
 
How IKANOW uses MongoDB to help organizations solve really big problems
How IKANOW uses MongoDB to help organizations solve really big problemsHow IKANOW uses MongoDB to help organizations solve really big problems
How IKANOW uses MongoDB to help organizations solve really big problemsikanow
 
Value Mining: How Entity Extraction Informs Analysis
Value Mining: How Entity Extraction Informs AnalysisValue Mining: How Entity Extraction Informs Analysis
Value Mining: How Entity Extraction Informs Analysisikanow
 

More from ikanow (12)

Aliasing Use Cases - How to Use IKANOW to Crunch Big Data
Aliasing Use Cases - How to Use IKANOW to Crunch Big DataAliasing Use Cases - How to Use IKANOW to Crunch Big Data
Aliasing Use Cases - How to Use IKANOW to Crunch Big Data
 
Mongo db washington dc 2014
Mongo db washington dc 2014Mongo db washington dc 2014
Mongo db washington dc 2014
 
Dr. Michael Valivullah, NASS/USDA - Cloud Computing
Dr. Michael Valivullah, NASS/USDA - Cloud ComputingDr. Michael Valivullah, NASS/USDA - Cloud Computing
Dr. Michael Valivullah, NASS/USDA - Cloud Computing
 
Cloud computing with AWS
Cloud computing with AWS Cloud computing with AWS
Cloud computing with AWS
 
Open Analytics DC June 2012 Presentation
Open Analytics DC June 2012 PresentationOpen Analytics DC June 2012 Presentation
Open Analytics DC June 2012 Presentation
 
MongoDC - Ikanow April 2012 Meetup
MongoDC - Ikanow April 2012 MeetupMongoDC - Ikanow April 2012 Meetup
MongoDC - Ikanow April 2012 Meetup
 
Open Analytics DC April 2012 Meetup
Open Analytics DC April 2012 MeetupOpen Analytics DC April 2012 Meetup
Open Analytics DC April 2012 Meetup
 
Hadoop MapReduce - I'm Sold, Now What?
Hadoop MapReduce - I'm Sold, Now What?Hadoop MapReduce - I'm Sold, Now What?
Hadoop MapReduce - I'm Sold, Now What?
 
Agile intelligence through Open Analytics
Agile intelligence through Open AnalyticsAgile intelligence through Open Analytics
Agile intelligence through Open Analytics
 
Social Intelligence: Realizing Business Value in Big Data
Social Intelligence: Realizing Business Value in Big DataSocial Intelligence: Realizing Business Value in Big Data
Social Intelligence: Realizing Business Value in Big Data
 
How IKANOW uses MongoDB to help organizations solve really big problems
How IKANOW uses MongoDB to help organizations solve really big problemsHow IKANOW uses MongoDB to help organizations solve really big problems
How IKANOW uses MongoDB to help organizations solve really big problems
 
Value Mining: How Entity Extraction Informs Analysis
Value Mining: How Entity Extraction Informs AnalysisValue Mining: How Entity Extraction Informs Analysis
Value Mining: How Entity Extraction Informs Analysis
 

Recently uploaded

From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demoHarshalMandlekar2
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxBkGupta21
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rick Flair
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 

Recently uploaded (20)

From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demo
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptx
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 

Open Analytics: Building Effective Frameworks for Social Media Analysis

  • 1. Open Analytics NYC – 11/08/2012 Building Effective Frameworks for Social Media Analysis
  • 2. Agenda • Social Media: An Intelligence perspective • Common Analytic Pitfalls • An Analytic Framework • Case Study: Brand Management – Problem Definition – Source Selection – Data Capture – Data Reporting – Data Analysis • Ways Forward, Future Analysis • Questions?
  • 3. Intelligence • Intelligence is information that has been transformed to meet an operational need Data Intelligence Operational Lens
  • 4. Intelligence Cycle • No matter what methodology you use… Collect Distribute Store Analyze intelligence analysis is an iterative process.
  • 5. Social Media: Intelligence Perspective • Social Media Intelligence is a combination of the best and worst features of: – HUMINT – OSINT – SIGINT HUMINT OSINT SIGINT
  • 6. Social Media Analysis Goals • Provide value to the organization – turn data into intelligence using an “operational lens” • Ensure cyclical feedback occurs during collection, processing, analysis, and consumption • Validate that a particular network is the right source of data for the questions you need answered
  • 7. Common Misconceptions • Social media is not a panacea – Not everyone uses social media – Users of social media use it unevenly – User behavior changes based on situations • Just because people can talk about anything does not mean they talk about everything all the time.
  • 8. Common Pitfalls • Analyzing What Instead of Why: The important thing is often not what people are saying… but why they are saying it. • Using the Wrong Analysis Tools: Reporting tools rarely help dig into the why. Many common tools, reports, and metrics are actually misleading: – Word clouds atomize message context – Sentiment metrics are often highly inaccurate – Information in aggregate hides more than it reveals
  • 9. Pitfalls: An Example of the Challenge
  • 10. Pitfalls: An Example of the Challenge
  • 11. Dangers of Disintegration Source: Matthew Auer, Policy Studies Journal, Volume 39, Issue 4, pages 709–736, Nov 2011
  • 12. Analytic Framework • Data Capture (DC) Capture • Data Reporting (DR) • Data Analysis (DA) – What to measure Analyze Report – What the data is saying – What should be done based on the data Source: Avinash Kaushik, Occam’s Razor Blog http://www.kaushik.net/avinash/web-analytics-consulting- framework-smarter-decisions/
  • 13. Choosing a Platform • Social media, and the ways that it is used, is relatively new and evolving rapidly: – Static approaches to social media are flawed from the outset – No one metric or set of metrics will always let you know what is happening • Platforms need to be open and highly adaptable to facilitate data capture, reporting, and analysis
  • 14. Case Study: Brand Management • Industry: Gaming – Experiencing 10% growth annually – Overall revenue expected to exceed $80 billion by 2014 • In May, Zenimax Online Studios announced Elder Scrolls Online – Elder Scrolls V: Skyrim 2nd largest game of 2011
  • 15. Problem Definition • Question: How can brand managers use social media to track and understand public attitudes toward a product? • Challenge: Capture relevant information for social media sources. – Query too large = false positives – Query too small = miss potential information
  • 16. Twitter • Twitter has excellent analytical potential: – Enormous volume, 400 million+ tweets per day – Large user base, 140 million+ accounts – Open API • But its not without its limitations: – 140 characters – Limited historical (lookback) capacity without using a 3rd party provider like DataSift or GNIP
  • 17. Data Capture: Initial Query • Twitter search for “Elder Scrolls Online” – Simplest possible way to access information – RSS feed for 10 days (Jun 27 – July 6 2012)
  • 18. Data Capture: Entities & Associations Hashtag TwitterHandle URL Unstructured Keywords Time / Date Stamp Who What When Where TwitterHandle Hashtags, Keywords, Time, Date Geo (if Available) URLs
  • 21. Data Analysis • Analysis needs to be rooted in the operational need: “How can I use social media to track and understand public attitudes toward my product” • Emphasis on hypothesis generation, testing, and experimentation
  • 22. Data Analysis: Hashtags • Top hashtags were almost all generic or abstract – Undermines tracking and understanding – Top hashtags tied to franchise, not to the game Hashtags #ElderScrolls #concept #games #nerd #online #geek #MMO #gamer #skyrim #ScreenShot
  • 23. Data Analysis: Expanding the Query • Hash tags from an initial subset of Tweets fed back into the initial query Initial Query Expanded Query Results Results Twitter Stream
  • 24. Data Analysis: Sentiment • Sentiment analysis on small snippets of text like Tweets is generally poor • Follow and convert linked URLs into derivative sources • Larger text sources offer potential value with sentiment analysis that tweets alone cannot offer
  • 25. Data Analysis: Sentiment • Top negative and positive sentiment scores can provide a glimpse into aggregate attitudes • Provide starting points for additional analysis
  • 26. Next Steps: Shape the Conversation • Create and promote hashtags that help shape the conversation and make it easier to collect and analyze the Twitter stream
  • 27. Next Steps: Segment the Data • Segment, or cluster, your data by: – User name or handle – Hashtags – Keywords – Geographic region to explore patterns and trends at the micro level versus the entire dataset
  • 29. Next Steps: Graph Analysis
  • 30. Lessons Learned • Don’t: – Try drinking from a fire hose, sometimes less really is more; – Use metrics you can’t tie to actions; – Use visualizations or reports that strip the data from its context.
  • 31. Lessons Learned • Do: – Segment data rather than attempting to work in the aggregate; – Look for the why behind the message; – Always return to the source material; – Explore alternative explanations; – Always consider the ultimate goal.
  • 32. Thank You! Craig Vitter www.ikanow.com cvitter@ikanow.com github.com/ikanow/Infinit.e

Editor's Notes

  1. Introduction and Topic
  2. Agenda:Social Media: An Intelligence perspectiveCommon Analytic PitfallsAn Analytic FrameworkCase Study: Brand ManagementWays Forward, Future AnalysisQuestions?
  3. Intelligence is information (data) that has been transformed to meet an operational need.There are a lot of ways to move from raw data to usable intelligence.
  4. No matter what methodology you use…intelligence analysis is an iterative processYou Collect the data, Store it, Analyze it, and Distribute the end results to your organization in some usable format.
  5. HUMINT, Human Intelligence: intelligence gathering by means of interpersonal contact. Pros: Can reveal intentions Cons: Can be unreliableOSINT, Open Source Intelligence: intelligence collected from publicly available sources. Pros: Fast and accessible Cons: NoiseSIGINT, Signals Intelligence: intelligence-gathering by interception of signals. Pros: High volume Cons: Noise
  6. My mom does not Tweet or have a FaceBook profile. It only seems like your friends post or tweet every 30 seconds.For example, people use different networks for different reasons so tracking individuals consistently can be difficult
  7. Why is someone tweeting or posting? If some checks in from a store is it really because the store is so incredible that they need to share that information or is because they are trying to form an impression about their lifestyle (i.e. image shaping)?Why is much harder than What.
  8. http://apps.washingtonpost.com/politics/transcripts/2012/presidential/live/737/Washington Post and Votertide collaboration to analyze how viewers reacted to Clinton’s speech at the DNC Convention a few months ago.They captured 496,222 tweets and generated what amounts to a very basic word cloud that really provides limited value from an analysis perspective.
  9. What can you learn from this type of experiment with the wrong tools?A lot of people were tweeting when Clinton was speaking but not many were really tweeting about what Clinton was saying.People like funny tweets
  10. Word clouds can tell you something about the language used but not the meaning behind the language. What you see in the cloud is what but not why.
  11. So how do you avoid some of these pitfalls and get useful intelligence from social media? The answer to that question, or partial answer at least is the focus of the remainder of the presentation.The key ingredients for a framework include Data Capture, Data Reporting and Data Analysis components. All of which are important but the Data Analysis components are the most interesting. :>
  12. There are a lot of platforms that you could use to do social analysis but a few key issues to consider before making a commitment:
  13. This is where I repeat that I shamelessly stole most of this presentation from a coworker Andrew who is a huge gaming fan as well as being a super bright analyst.
  14. Almost all NLP/text extraction/unstructured data analysis tools perform poorly on small blobs of text
  15. Negative: Users weren’t impressed by the game’s teaser and graphics suggesting that the trailer hadn’t been well received.Positive: Other hashtags showed that fans still had positive sentiment towards the Elder Scrolls franchise in general.
  16. Use Graph Analysis to explore the links between entities extracted from your data, for example:Identify Key InfluencersView links between tweets, websites, and blogs