SlideShare a Scribd company logo
1 of 24
Download to read offline
Open Source Community Metrics
Tips and Techniques for Measuring Participation


              Open Source Bridge
                  June 2011

                 Dawn M. Foster
          MeeGo Community Manager at Intel

                  @geekygirldawn
               dawn@fastwonder.com

                 fastwonderblog.com
                     meego.com
Stuff I'll Talk About

  ●   What, why and example metrics from MeeGo
  ●   Coming up with the right metrics
  ●   Tips and techniques for collecting metrics
  ●   Sharing metrics and highlighting community members




                                    Photo: http://www.flickr.com/photos/falcifer/3136673599

                                2
Community Definition

 ●   Community includes all of the people who work on the project
      ●   Product contributors: kernel / distribution developers, release
          managers, quality assurance, localization, etc.
      ●   App developers: writing applications
      ●   Users: people who run your software and provide feedback
      ●   Vendors: companies creating products based on your project
      ●   Other contributors: promotion, moderation, documentation and more


     Some people contribute as part of their employment at companies,
     while others contribute free time. The community includes all of the
                  people who are working on your project.




                                        3
Metrics are Useful for Open Source Projects

  ●   Measure progress in your community over time
  ●   Who contributes
  ●   Where are people contributing
  ●   Spot trends
  ●   Gauge interest
  ●   Learn more about key contributors
  ●   Recognize contributions




                                4
Example: Components of an Open Source Community




                        5
Example: April MeeGo Community Metrics Summary
• 3,534,575 unique people have visited MeeGo.com (cumulative total)
 •   295,992 unique people visited this month (333,293 last month).

• 22,914 people are members of MeeGo.com (was 21,823 last month)
 •   Dev ML subscribers = 4983; Community = 3929; iL10N = 2871; SDK = 3313

• Mailing Lists: 4891 posts this month; 220 people posted 2+ msgs
• Forums: 862 posts. 123 people posted 2+ messages
• New Bugs Created: 1757; Bugs Resolved: 2988
• 1.1 Downloads: 39,044 Netbook, 4171 Tablet, 3346 IVI, 2699 N900
• Active Users: Estimated at 800 – 1000 people.
 •   Mailing Lists: 343 people with unique email addresses posted (367 last month)
 •   Forums: 229 people posted at least one item (281 last month)
 •   Bugzilla: 716 people performed some action (552 last month)
 •   IRC: 410-500 people logged into #meego simultaneously most days



                  http://wiki.meego.com/Metrics
                                              6
What are the Right Metrics for YOUR Project

 ●   Goals
     ●   What are your overall goals for the project?
     ●   How can you measure progress toward those goals?
     ●   What is important to you and your progress?
 ●   Trends
     ●   What should you measure to recognize trends?
     ●   How do you recognize when something is going wrong?
     ●   Do you notice big improvements?




                  Note: I measure way too much
                                  7          http://www.flickr.com/photos/bandfan/5548675317/
Mailing Lists: mlstats

  Mailing List Stats is a command line tool used to analyze
   mailing list archives. It downloads the archives, places
 them in a directory and stores all the information contained
           in each mailing list post into a database




               http://libresoft.es/tools/mlstats


                              8
Mailing Lists: mlstats
●   Grab data from your mailing & store in db (repeat per ML)
       –   /mlstats --db-user=user --db-password=pw
           http://lists.meego.com/pipermail/meego-community/
●   Top Content Query
       –   select subject,monthname(first_date) as m,count(*) as c from
           messages where month(first_date)=$MONTH and
           year(first_date)=$YEAR group by subject, month(first_date)
           order by m, c;
●   Top Poster Query
       –   select p.email_address,year(m.first_date) as y,
           monthname(m.first_date),count(*) as c from messages as
           m,messages_people as p where
           m.message_id=p.message_ID and
           month(first_date)=$MONTH and year(first_date)=$YEAR
           group by p.email_address, month(m.first_date) order by y,
           month(m.first_date), c;
                                    9
Mailing Lists: Top Content Result (graphed)




         What are people talking about?
                          10
Mailing Lists: Top Posters Result
           Rank   Email Address                   Posts
           1      arjan@linux.intel.com           48
           2      peter.j.zhu@intel.com           47
           3      shane.bryan@linux.intel.com     41
           4      prajwal.karur.mohan@intel.com   37
           5      carsten@maemo.org               37
           6      yi.y.yang@intel.com             33
           7      marko.saukko@cybercom.com       29
           8      nashif@linux.intel.com          28
           9      fathi.boudra@nokia.com          28
           10     rusty.lynch@intel.com           27
           11     alan@linux.intel.com            26
           12     nielsmayer@gmail.com            26
           13     joel.clark@intel.com            24
           14     kristen@linux.intel.com         23
           15     rolla.n.selbak@intel.com        23
           16     shaofeng.tang@intel.com         23
           17     stephen.gadsby@gmail.com        22
           18     casey.schaufler@nokia.com       21
           19     gabrbedd@gmail.com              21
           20     dawn.m.foster@intel.com         21

      Who contributes to ML discussions?
                                  11
IRC: irssistats

  Generates IRC stats for active people, by hour of the day,
        by day, most used words, quotes and more.



             http://royale.zerezo.com/irssistats/




                             12
IRC: irssistats




                  13
IRC: irssistats




                  14
IRC: irssistats




                  15
Bugs

 ●   New bugs vs. resolved bugs
     ●   Can't just look at monthly trends
     ●   Need to take release cycle into account
     ●   Before release: more resolved bugs
     ●   After release: more new bugs
 ●   Participants
     ●   People who file new bugs
     ●   Participate in bugs (comment, etc.)
     ●   Careful with people who resolve bugs (usually QA)




                                       Image: http://www.thegeekstuff.com/2010/05/install-bugzilla-on-linux/

                                  16
New Bugs Filed
        Rank   Reporter                           New Bugs
        1      yukarix.yamashita@intel.com        69
        2      petrax.m.klein@intel.com           62
        3      caseyx.m.harper@intel.com          57
        4      gang.ye@intel.com                  51
        5      alexandru.c.georgescu@intel.com    45
        6      jenny.lin@intel.com                40
        7      leileix.l.yang@intel.com           39
        8      danielax.l.morescalchi@intel.com   38
        9      tomrantom@gmail.com                38
        10     arnabx.banerjee@intel.com          35
        11     pablox.barbecho@intel.com          35
        12     neo.wang@intel.com                 34
        13     mariex.r.goodart@intel.com         30
        14     marko.saukko@gmail.com             28
        15     davidx.g.brown@intel.com           27
        16     ext-gabor.2.juhasz@nokia.com       26
        17     michael.kuperstein@intel.com       25
        18     xux.tian@intel.com                 25
        19     qiongfangx.xiong@intel.com         24
        20     songx.wang@intel.com               23

                                 17
Media Wiki




             18
Media Wiki

 ●   Get Statistics
     ●   http://wiki.meego.com/Special:Statistics
     ●   wget "http://wiki.meego.com/api.php?
         action=query&meta=siteinfo&siprop=statistics&format=ya
         mlfm"




                                 19
Website: Google Analytics




              Nokia Announcement
Roadmaps           Mobile World Congress




Source: Google Analytics
(excludes wiki prior to Dec 21)



                                           20
Automate
 ●   My less than elegant method
     ●   Giant bash script
     ●   Uses wget, awk, mysql queries, etc.
     ●   Dumps a bunch of csv files on my hard drive
 ●   A better dashboard approach (WIP)
     ●   Open source metrics dashboard
     ●   Uses Pentaho for reporting, runs regularly and produces a
         dashboard anyone can view at any time
     ●   Will be finished in the next couple of months
     ●   http://wiki.meego.com/Metrics/Dashboard




                                  21
Now What?

 ●   Report Regularly
     ●   Monthly – may be too often
     ●   Quartely? Yearly?
 ●   Share
     ●   Share the reports with the
         community
     ●   http://wiki.meego.com/Metrics
 ●   Recognize
     ●   Recognize your top contributors


                                           http://www.flickr.com/photos/play4smee/2439494411/




                                 22
Dawn Foster
MeeGo Community Manager for Intel

          @geekygirldawn

     dawn.m.foster@intel.com




        http://www.flickr.com/photos/tlk/5630885373/
Credits
   Thank you to the many people who have contributed to
   the metrics
   ●   Dave Neary for many helpful tips & for providing a lot of
       help with mailing list stats (mlstats)
   ●   Reggie Suplido for automating forum stats:
       http://forum.meego.com/stats/
   ●   Carsten Munk for the IRC stats
   ●   Stephen Gadsby for the bug jars:
       http://www.octofish.net/meegobugjar/
   ●   Mike Shaver for a variety of help
   ●   Arjan Van De Ven for some Perl magic
   ●   Adam Gretzinger for providing download data


                                24

More Related Content

Similar to Measuring Open Source Community Participation

Crunching the numbers: Open Source Community Metrics
Crunching the numbers: Open Source Community MetricsCrunching the numbers: Open Source Community Metrics
Crunching the numbers: Open Source Community MetricsDawn Foster
 
Open Source Community Metrics LibreOffice Conference
Open Source Community Metrics LibreOffice ConferenceOpen Source Community Metrics LibreOffice Conference
Open Source Community Metrics LibreOffice ConferenceDawn Foster
 
Open Source Community Metrics for FOSDEM
Open Source Community Metrics for FOSDEMOpen Source Community Metrics for FOSDEM
Open Source Community Metrics for FOSDEMDawn Foster
 
Open Source Community Metrics: LinuxCon Barcelona
Open Source Community Metrics: LinuxCon BarcelonaOpen Source Community Metrics: LinuxCon Barcelona
Open Source Community Metrics: LinuxCon BarcelonaDawn Foster
 
What's new in the latest source{d} releases!
What's new in the latest source{d} releases!What's new in the latest source{d} releases!
What's new in the latest source{d} releases!source{d}
 
Continuum Analytics and Python
Continuum Analytics and PythonContinuum Analytics and Python
Continuum Analytics and PythonTravis Oliphant
 
Everyone wants (someone else) to do it: writing documentation for open source...
Everyone wants (someone else) to do it: writing documentation for open source...Everyone wants (someone else) to do it: writing documentation for open source...
Everyone wants (someone else) to do it: writing documentation for open source...Jody Garnett
 
UXDX Berlin - Test & Deploy, by Quentin Berder, President, WiredCraft
UXDX Berlin - Test & Deploy, by Quentin Berder, President, WiredCraftUXDX Berlin - Test & Deploy, by Quentin Berder, President, WiredCraft
UXDX Berlin - Test & Deploy, by Quentin Berder, President, WiredCraftUXDXConf
 
Electric Capital Developer Report 2022
Electric Capital Developer Report 2022Electric Capital Developer Report 2022
Electric Capital Developer Report 2022MariaShen2
 
Open Source Lessons from the TODO Group
Open Source Lessons from the TODO GroupOpen Source Lessons from the TODO Group
Open Source Lessons from the TODO GroupChris Aniszczyk
 
Tips for Nonprofits- Manage Content & Collaborate in the Cloud.
Tips for Nonprofits-  Manage Content & Collaborate in the Cloud.Tips for Nonprofits-  Manage Content & Collaborate in the Cloud.
Tips for Nonprofits- Manage Content & Collaborate in the Cloud.Box
 
How to Run a Successful Open Source Java EE Project
How to Run a Successful Open Source Java EE ProjectHow to Run a Successful Open Source Java EE Project
How to Run a Successful Open Source Java EE Projectphilipdurbin
 
Developing Effective Software Productively
Developing Effective Software ProductivelyDeveloping Effective Software Productively
Developing Effective Software ProductivelyGail Murphy
 
Marvin Platform – Potencializando equipes de Machine Learning
Marvin Platform – Potencializando equipes de Machine LearningMarvin Platform – Potencializando equipes de Machine Learning
Marvin Platform – Potencializando equipes de Machine LearningDaniel Takabayashi, MSc
 
Streaming Trend Discovery: Real-Time Discovery in a Sea of Events with Scott ...
Streaming Trend Discovery: Real-Time Discovery in a Sea of Events with Scott ...Streaming Trend Discovery: Real-Time Discovery in a Sea of Events with Scott ...
Streaming Trend Discovery: Real-Time Discovery in a Sea of Events with Scott ...Databricks
 
Oscon 2016: open source lessons from the todo group
Oscon 2016: open source lessons from the todo groupOscon 2016: open source lessons from the todo group
Oscon 2016: open source lessons from the todo groupBen VanEvery
 
Electric Capital Crypto Dev Report · 2022
Electric Capital Crypto Dev Report · 2022Electric Capital Crypto Dev Report · 2022
Electric Capital Crypto Dev Report · 2022Maria Xinhe Shen
 
[CS570] Machine Learning Team Project (I know what items really are)
[CS570] Machine Learning Team Project (I know what items really are)[CS570] Machine Learning Team Project (I know what items really are)
[CS570] Machine Learning Team Project (I know what items really are)Kunwoo Park
 
Introduction to Machine Learning with H2O and Python
Introduction to Machine Learning with H2O and PythonIntroduction to Machine Learning with H2O and Python
Introduction to Machine Learning with H2O and PythonJo-fai Chow
 

Similar to Measuring Open Source Community Participation (20)

Crunching the numbers: Open Source Community Metrics
Crunching the numbers: Open Source Community MetricsCrunching the numbers: Open Source Community Metrics
Crunching the numbers: Open Source Community Metrics
 
Open Source Community Metrics LibreOffice Conference
Open Source Community Metrics LibreOffice ConferenceOpen Source Community Metrics LibreOffice Conference
Open Source Community Metrics LibreOffice Conference
 
Open Source Community Metrics for FOSDEM
Open Source Community Metrics for FOSDEMOpen Source Community Metrics for FOSDEM
Open Source Community Metrics for FOSDEM
 
Open Source Community Metrics: LinuxCon Barcelona
Open Source Community Metrics: LinuxCon BarcelonaOpen Source Community Metrics: LinuxCon Barcelona
Open Source Community Metrics: LinuxCon Barcelona
 
What's new in the latest source{d} releases!
What's new in the latest source{d} releases!What's new in the latest source{d} releases!
What's new in the latest source{d} releases!
 
Continuum Analytics and Python
Continuum Analytics and PythonContinuum Analytics and Python
Continuum Analytics and Python
 
Everyone wants (someone else) to do it: writing documentation for open source...
Everyone wants (someone else) to do it: writing documentation for open source...Everyone wants (someone else) to do it: writing documentation for open source...
Everyone wants (someone else) to do it: writing documentation for open source...
 
UXDX Berlin - Test & Deploy, by Quentin Berder, President, WiredCraft
UXDX Berlin - Test & Deploy, by Quentin Berder, President, WiredCraftUXDX Berlin - Test & Deploy, by Quentin Berder, President, WiredCraft
UXDX Berlin - Test & Deploy, by Quentin Berder, President, WiredCraft
 
Electric Capital Developer Report 2022
Electric Capital Developer Report 2022Electric Capital Developer Report 2022
Electric Capital Developer Report 2022
 
Open Source Lessons from the TODO Group
Open Source Lessons from the TODO GroupOpen Source Lessons from the TODO Group
Open Source Lessons from the TODO Group
 
Crypto Dev Report · 2022
Crypto Dev Report · 2022Crypto Dev Report · 2022
Crypto Dev Report · 2022
 
Tips for Nonprofits- Manage Content & Collaborate in the Cloud.
Tips for Nonprofits-  Manage Content & Collaborate in the Cloud.Tips for Nonprofits-  Manage Content & Collaborate in the Cloud.
Tips for Nonprofits- Manage Content & Collaborate in the Cloud.
 
How to Run a Successful Open Source Java EE Project
How to Run a Successful Open Source Java EE ProjectHow to Run a Successful Open Source Java EE Project
How to Run a Successful Open Source Java EE Project
 
Developing Effective Software Productively
Developing Effective Software ProductivelyDeveloping Effective Software Productively
Developing Effective Software Productively
 
Marvin Platform – Potencializando equipes de Machine Learning
Marvin Platform – Potencializando equipes de Machine LearningMarvin Platform – Potencializando equipes de Machine Learning
Marvin Platform – Potencializando equipes de Machine Learning
 
Streaming Trend Discovery: Real-Time Discovery in a Sea of Events with Scott ...
Streaming Trend Discovery: Real-Time Discovery in a Sea of Events with Scott ...Streaming Trend Discovery: Real-Time Discovery in a Sea of Events with Scott ...
Streaming Trend Discovery: Real-Time Discovery in a Sea of Events with Scott ...
 
Oscon 2016: open source lessons from the todo group
Oscon 2016: open source lessons from the todo groupOscon 2016: open source lessons from the todo group
Oscon 2016: open source lessons from the todo group
 
Electric Capital Crypto Dev Report · 2022
Electric Capital Crypto Dev Report · 2022Electric Capital Crypto Dev Report · 2022
Electric Capital Crypto Dev Report · 2022
 
[CS570] Machine Learning Team Project (I know what items really are)
[CS570] Machine Learning Team Project (I know what items really are)[CS570] Machine Learning Team Project (I know what items really are)
[CS570] Machine Learning Team Project (I know what items really are)
 
Introduction to Machine Learning with H2O and Python
Introduction to Machine Learning with H2O and PythonIntroduction to Machine Learning with H2O and Python
Introduction to Machine Learning with H2O and Python
 

More from Dawn Foster

CHAOSS Metrics Overview and Examples
CHAOSS Metrics Overview and ExamplesCHAOSS Metrics Overview and Examples
CHAOSS Metrics Overview and ExamplesDawn Foster
 
Be a Good Corporate Citizen in Kubernetes
Be a Good Corporate Citizen in KubernetesBe a Good Corporate Citizen in Kubernetes
Be a Good Corporate Citizen in KubernetesDawn Foster
 
Overcoming Imposter Syndrome to Become a Conference Speaker!
Overcoming Imposter Syndrome to Become a Conference Speaker!Overcoming Imposter Syndrome to Become a Conference Speaker!
Overcoming Imposter Syndrome to Become a Conference Speaker!Dawn Foster
 
How to Be a Good Corporate Citizen in Open Source
How to Be a Good Corporate Citizen in Open SourceHow to Be a Good Corporate Citizen in Open Source
How to Be a Good Corporate Citizen in Open SourceDawn Foster
 
Open Source Collaboration and Companies: Finding the Right Balance
Open Source Collaboration and Companies: Finding the Right BalanceOpen Source Collaboration and Companies: Finding the Right Balance
Open Source Collaboration and Companies: Finding the Right BalanceDawn Foster
 
Navigating Open Source Risk
Navigating Open Source RiskNavigating Open Source Risk
Navigating Open Source RiskDawn Foster
 
Measuring Project Health at VMware
Measuring Project Health at VMwareMeasuring Project Health at VMware
Measuring Project Health at VMwareDawn Foster
 
Navigating Open Source Risk
Navigating Open Source RiskNavigating Open Source Risk
Navigating Open Source RiskDawn Foster
 
Collaborative Leadership: Governance Beyond Company Affiliation
Collaborative Leadership: Governance Beyond Company AffiliationCollaborative Leadership: Governance Beyond Company Affiliation
Collaborative Leadership: Governance Beyond Company AffiliationDawn Foster
 
Collaborative Leadership: Governance Beyond Company Affiliation
Collaborative Leadership: Governance Beyond Company AffiliationCollaborative Leadership: Governance Beyond Company Affiliation
Collaborative Leadership: Governance Beyond Company AffiliationDawn Foster
 
Collaborative Leadership: Governance Beyond Company Affiliation
Collaborative Leadership: Governance Beyond Company AffiliationCollaborative Leadership: Governance Beyond Company Affiliation
Collaborative Leadership: Governance Beyond Company AffiliationDawn Foster
 
Collaborative Leadership: Governance Beyond Company Affiliation
Collaborative Leadership: Governance Beyond Company AffiliationCollaborative Leadership: Governance Beyond Company Affiliation
Collaborative Leadership: Governance Beyond Company AffiliationDawn Foster
 
Is this Open Source Project Healthy or Lifeless?
Is this Open Source Project Healthy or Lifeless?Is this Open Source Project Healthy or Lifeless?
Is this Open Source Project Healthy or Lifeless?Dawn Foster
 
Collaboration in Linux Kernel Mailing Lists
Collaboration in Linux Kernel Mailing Lists Collaboration in Linux Kernel Mailing Lists
Collaboration in Linux Kernel Mailing Lists Dawn Foster
 
Be a Good Corporate Citizen in Kubernetes
Be a Good Corporate Citizen in KubernetesBe a Good Corporate Citizen in Kubernetes
Be a Good Corporate Citizen in KubernetesDawn Foster
 
Being a Good Corporate Citizen in Open Source
Being a Good Corporate Citizen in Open SourceBeing a Good Corporate Citizen in Open Source
Being a Good Corporate Citizen in Open SourceDawn Foster
 
Building Community for your Company’s OSS Projects
Building Community for your Company’s OSS ProjectsBuilding Community for your Company’s OSS Projects
Building Community for your Company’s OSS ProjectsDawn Foster
 
Building Community for your Company’s OSS Project
Building Community for your Company’s OSS ProjectBuilding Community for your Company’s OSS Project
Building Community for your Company’s OSS ProjectDawn Foster
 
How to be a terrible hiring manager
How to be a terrible hiring managerHow to be a terrible hiring manager
How to be a terrible hiring managerDawn Foster
 
A week in the Life of Kubernetes
A week in the Life of KubernetesA week in the Life of Kubernetes
A week in the Life of KubernetesDawn Foster
 

More from Dawn Foster (20)

CHAOSS Metrics Overview and Examples
CHAOSS Metrics Overview and ExamplesCHAOSS Metrics Overview and Examples
CHAOSS Metrics Overview and Examples
 
Be a Good Corporate Citizen in Kubernetes
Be a Good Corporate Citizen in KubernetesBe a Good Corporate Citizen in Kubernetes
Be a Good Corporate Citizen in Kubernetes
 
Overcoming Imposter Syndrome to Become a Conference Speaker!
Overcoming Imposter Syndrome to Become a Conference Speaker!Overcoming Imposter Syndrome to Become a Conference Speaker!
Overcoming Imposter Syndrome to Become a Conference Speaker!
 
How to Be a Good Corporate Citizen in Open Source
How to Be a Good Corporate Citizen in Open SourceHow to Be a Good Corporate Citizen in Open Source
How to Be a Good Corporate Citizen in Open Source
 
Open Source Collaboration and Companies: Finding the Right Balance
Open Source Collaboration and Companies: Finding the Right BalanceOpen Source Collaboration and Companies: Finding the Right Balance
Open Source Collaboration and Companies: Finding the Right Balance
 
Navigating Open Source Risk
Navigating Open Source RiskNavigating Open Source Risk
Navigating Open Source Risk
 
Measuring Project Health at VMware
Measuring Project Health at VMwareMeasuring Project Health at VMware
Measuring Project Health at VMware
 
Navigating Open Source Risk
Navigating Open Source RiskNavigating Open Source Risk
Navigating Open Source Risk
 
Collaborative Leadership: Governance Beyond Company Affiliation
Collaborative Leadership: Governance Beyond Company AffiliationCollaborative Leadership: Governance Beyond Company Affiliation
Collaborative Leadership: Governance Beyond Company Affiliation
 
Collaborative Leadership: Governance Beyond Company Affiliation
Collaborative Leadership: Governance Beyond Company AffiliationCollaborative Leadership: Governance Beyond Company Affiliation
Collaborative Leadership: Governance Beyond Company Affiliation
 
Collaborative Leadership: Governance Beyond Company Affiliation
Collaborative Leadership: Governance Beyond Company AffiliationCollaborative Leadership: Governance Beyond Company Affiliation
Collaborative Leadership: Governance Beyond Company Affiliation
 
Collaborative Leadership: Governance Beyond Company Affiliation
Collaborative Leadership: Governance Beyond Company AffiliationCollaborative Leadership: Governance Beyond Company Affiliation
Collaborative Leadership: Governance Beyond Company Affiliation
 
Is this Open Source Project Healthy or Lifeless?
Is this Open Source Project Healthy or Lifeless?Is this Open Source Project Healthy or Lifeless?
Is this Open Source Project Healthy or Lifeless?
 
Collaboration in Linux Kernel Mailing Lists
Collaboration in Linux Kernel Mailing Lists Collaboration in Linux Kernel Mailing Lists
Collaboration in Linux Kernel Mailing Lists
 
Be a Good Corporate Citizen in Kubernetes
Be a Good Corporate Citizen in KubernetesBe a Good Corporate Citizen in Kubernetes
Be a Good Corporate Citizen in Kubernetes
 
Being a Good Corporate Citizen in Open Source
Being a Good Corporate Citizen in Open SourceBeing a Good Corporate Citizen in Open Source
Being a Good Corporate Citizen in Open Source
 
Building Community for your Company’s OSS Projects
Building Community for your Company’s OSS ProjectsBuilding Community for your Company’s OSS Projects
Building Community for your Company’s OSS Projects
 
Building Community for your Company’s OSS Project
Building Community for your Company’s OSS ProjectBuilding Community for your Company’s OSS Project
Building Community for your Company’s OSS Project
 
How to be a terrible hiring manager
How to be a terrible hiring managerHow to be a terrible hiring manager
How to be a terrible hiring manager
 
A week in the Life of Kubernetes
A week in the Life of KubernetesA week in the Life of Kubernetes
A week in the Life of Kubernetes
 

Recently uploaded

Manual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditManual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditSkynet Technologies
 
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...AliaaTarek5
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch TuesdayIvanti
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Farhan Tariq
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsNathaniel Shimoni
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesThousandEyes
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentPim van der Noll
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsRavi Sanghani
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Strongerpanagenda
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI AgeCprime
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demoHarshalMandlekar2
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...Wes McKinney
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterMydbops
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Mark Goldstein
 

Recently uploaded (20)

Manual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditManual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance Audit
 
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch Tuesday
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directions
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and Insights
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI Age
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demo
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL Router
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
 

Measuring Open Source Community Participation

  • 1. Open Source Community Metrics Tips and Techniques for Measuring Participation Open Source Bridge June 2011 Dawn M. Foster MeeGo Community Manager at Intel @geekygirldawn dawn@fastwonder.com fastwonderblog.com meego.com
  • 2. Stuff I'll Talk About ● What, why and example metrics from MeeGo ● Coming up with the right metrics ● Tips and techniques for collecting metrics ● Sharing metrics and highlighting community members Photo: http://www.flickr.com/photos/falcifer/3136673599 2
  • 3. Community Definition ● Community includes all of the people who work on the project ● Product contributors: kernel / distribution developers, release managers, quality assurance, localization, etc. ● App developers: writing applications ● Users: people who run your software and provide feedback ● Vendors: companies creating products based on your project ● Other contributors: promotion, moderation, documentation and more Some people contribute as part of their employment at companies, while others contribute free time. The community includes all of the people who are working on your project. 3
  • 4. Metrics are Useful for Open Source Projects ● Measure progress in your community over time ● Who contributes ● Where are people contributing ● Spot trends ● Gauge interest ● Learn more about key contributors ● Recognize contributions 4
  • 5. Example: Components of an Open Source Community 5
  • 6. Example: April MeeGo Community Metrics Summary • 3,534,575 unique people have visited MeeGo.com (cumulative total) • 295,992 unique people visited this month (333,293 last month). • 22,914 people are members of MeeGo.com (was 21,823 last month) • Dev ML subscribers = 4983; Community = 3929; iL10N = 2871; SDK = 3313 • Mailing Lists: 4891 posts this month; 220 people posted 2+ msgs • Forums: 862 posts. 123 people posted 2+ messages • New Bugs Created: 1757; Bugs Resolved: 2988 • 1.1 Downloads: 39,044 Netbook, 4171 Tablet, 3346 IVI, 2699 N900 • Active Users: Estimated at 800 – 1000 people. • Mailing Lists: 343 people with unique email addresses posted (367 last month) • Forums: 229 people posted at least one item (281 last month) • Bugzilla: 716 people performed some action (552 last month) • IRC: 410-500 people logged into #meego simultaneously most days http://wiki.meego.com/Metrics 6
  • 7. What are the Right Metrics for YOUR Project ● Goals ● What are your overall goals for the project? ● How can you measure progress toward those goals? ● What is important to you and your progress? ● Trends ● What should you measure to recognize trends? ● How do you recognize when something is going wrong? ● Do you notice big improvements? Note: I measure way too much 7 http://www.flickr.com/photos/bandfan/5548675317/
  • 8. Mailing Lists: mlstats Mailing List Stats is a command line tool used to analyze mailing list archives. It downloads the archives, places them in a directory and stores all the information contained in each mailing list post into a database http://libresoft.es/tools/mlstats 8
  • 9. Mailing Lists: mlstats ● Grab data from your mailing & store in db (repeat per ML) – /mlstats --db-user=user --db-password=pw http://lists.meego.com/pipermail/meego-community/ ● Top Content Query – select subject,monthname(first_date) as m,count(*) as c from messages where month(first_date)=$MONTH and year(first_date)=$YEAR group by subject, month(first_date) order by m, c; ● Top Poster Query – select p.email_address,year(m.first_date) as y, monthname(m.first_date),count(*) as c from messages as m,messages_people as p where m.message_id=p.message_ID and month(first_date)=$MONTH and year(first_date)=$YEAR group by p.email_address, month(m.first_date) order by y, month(m.first_date), c; 9
  • 10. Mailing Lists: Top Content Result (graphed) What are people talking about? 10
  • 11. Mailing Lists: Top Posters Result Rank Email Address Posts 1 arjan@linux.intel.com 48 2 peter.j.zhu@intel.com 47 3 shane.bryan@linux.intel.com 41 4 prajwal.karur.mohan@intel.com 37 5 carsten@maemo.org 37 6 yi.y.yang@intel.com 33 7 marko.saukko@cybercom.com 29 8 nashif@linux.intel.com 28 9 fathi.boudra@nokia.com 28 10 rusty.lynch@intel.com 27 11 alan@linux.intel.com 26 12 nielsmayer@gmail.com 26 13 joel.clark@intel.com 24 14 kristen@linux.intel.com 23 15 rolla.n.selbak@intel.com 23 16 shaofeng.tang@intel.com 23 17 stephen.gadsby@gmail.com 22 18 casey.schaufler@nokia.com 21 19 gabrbedd@gmail.com 21 20 dawn.m.foster@intel.com 21 Who contributes to ML discussions? 11
  • 12. IRC: irssistats Generates IRC stats for active people, by hour of the day, by day, most used words, quotes and more. http://royale.zerezo.com/irssistats/ 12
  • 16. Bugs ● New bugs vs. resolved bugs ● Can't just look at monthly trends ● Need to take release cycle into account ● Before release: more resolved bugs ● After release: more new bugs ● Participants ● People who file new bugs ● Participate in bugs (comment, etc.) ● Careful with people who resolve bugs (usually QA) Image: http://www.thegeekstuff.com/2010/05/install-bugzilla-on-linux/ 16
  • 17. New Bugs Filed Rank Reporter New Bugs 1 yukarix.yamashita@intel.com 69 2 petrax.m.klein@intel.com 62 3 caseyx.m.harper@intel.com 57 4 gang.ye@intel.com 51 5 alexandru.c.georgescu@intel.com 45 6 jenny.lin@intel.com 40 7 leileix.l.yang@intel.com 39 8 danielax.l.morescalchi@intel.com 38 9 tomrantom@gmail.com 38 10 arnabx.banerjee@intel.com 35 11 pablox.barbecho@intel.com 35 12 neo.wang@intel.com 34 13 mariex.r.goodart@intel.com 30 14 marko.saukko@gmail.com 28 15 davidx.g.brown@intel.com 27 16 ext-gabor.2.juhasz@nokia.com 26 17 michael.kuperstein@intel.com 25 18 xux.tian@intel.com 25 19 qiongfangx.xiong@intel.com 24 20 songx.wang@intel.com 23 17
  • 19. Media Wiki ● Get Statistics ● http://wiki.meego.com/Special:Statistics ● wget "http://wiki.meego.com/api.php? action=query&meta=siteinfo&siprop=statistics&format=ya mlfm" 19
  • 20. Website: Google Analytics Nokia Announcement Roadmaps Mobile World Congress Source: Google Analytics (excludes wiki prior to Dec 21) 20
  • 21. Automate ● My less than elegant method ● Giant bash script ● Uses wget, awk, mysql queries, etc. ● Dumps a bunch of csv files on my hard drive ● A better dashboard approach (WIP) ● Open source metrics dashboard ● Uses Pentaho for reporting, runs regularly and produces a dashboard anyone can view at any time ● Will be finished in the next couple of months ● http://wiki.meego.com/Metrics/Dashboard 21
  • 22. Now What? ● Report Regularly ● Monthly – may be too often ● Quartely? Yearly? ● Share ● Share the reports with the community ● http://wiki.meego.com/Metrics ● Recognize ● Recognize your top contributors http://www.flickr.com/photos/play4smee/2439494411/ 22
  • 23. Dawn Foster MeeGo Community Manager for Intel @geekygirldawn dawn.m.foster@intel.com http://www.flickr.com/photos/tlk/5630885373/
  • 24. Credits Thank you to the many people who have contributed to the metrics ● Dave Neary for many helpful tips & for providing a lot of help with mailing list stats (mlstats) ● Reggie Suplido for automating forum stats: http://forum.meego.com/stats/ ● Carsten Munk for the IRC stats ● Stephen Gadsby for the bug jars: http://www.octofish.net/meegobugjar/ ● Mike Shaver for a variety of help ● Arjan Van De Ven for some Perl magic ● Adam Gretzinger for providing download data 24