Funded by the Alfred P. Sloan Foundation, the OrgPedia project is developing a free, not-for-profit online directory based on open data about domestic and international, public and private companies.
The ORGpedia beta site make available and downloadable a rich tapestry of information including corporate owners of regulated facilities including nuclear power plants located in the US. ORGpedia uses open government data published by the U.S. EPA, U.S. Nuclear Regulatory Commission, and U.S. Securities and Exchange Commission, as well as, crowd-sourced content from sites including Open Street Maps and ORGpedia itself.
Ms Motilal Padampat Sugar Mills vs. State of Uttar Pradesh & Ors. - A Milesto...
ORGpedia: The Open Organizational Data Project
1. ORGpedia
The Open Organizational
Data Project
Prof. Beth Simone Noveck
Joel Gurin, Executive Director
Website: info@3RoundStones.com
@3RoundStones
Main +1-877-290-2127
Direct +1-571-331-3758
ORGpedia site development team:
Luke Ruth, Application Developer
David Wood, CTO
Bernadette Hyland, CEO
Funded by the
Project: Do Tank
http://dotank.nyls.edu
Friday, June 21, 13
2. Agenda
• Update on ORGpedia site development
• Project status
• Datasets
• Functionality review
• Review Next steps
• NYU stakeholder review
• Engaging contributors
Friday, June 21, 13
3. Problem Statement
• Taxpayers spend billions to have government collect data
• Machine readable content is the new default for
government (OS&T M13-13)
• Yet finding, accessing and combining data is difficult ...
• ORGpedia: the Open Organizational Data Project is
opening new possibilities
• Based entirely on OpenWeb Standards, Open
Source & open government data
• Leverages the crowd for data augmentation
Friday, June 21, 13
4. Project Status Highlights
ORGpedia Phase 1 began
15-April 2013
Completed as of 20 June 2013
• Site live (QA)
• Data platform installed on
Rackspace Cloud
• Use case: Combine open
government data from
regulatory agencies for
nuclear plants
• Data sets: EPA Facilities,
EPA Toxic Releases, NRC
Violations, SEC, Open
Street Maps, DBpedia
• User Contributions
• Wikipedia
• Open Corporates
• Visualizations: maps &
charts
• Support for public &
authenticated users thru
Google, Facebook,Yahoo!
Friday, June 21, 13
5. ORGpedia Datasets
• U.S. EPA
• Facilities
• Toxic Releases
• US Nuclear Regulatory
Commission
• Violations
Friday, June 21, 13
6. ORGpedia is Innovative
• Combines US Government Regulatory data
without the expense, time and high failure rate of
traditional approaches
• Leverages crowdsourcing to fix dirty data and
builds on expert and/or local knowledge
• Open vs. proprietary approach
• Creates opportunity for new businesses &
startups
Friday, June 21, 13
7. Technical Innovation
• Linked Data --> Easy combination of datasets
• Leverages an innovative Open Source data
platform (Callimachus)
• Crowdsourced contributions
• Contributions overlay (do not overwrite)
authoritative data
• Web-scale
Friday, June 21, 13
9. “Linked Data allows for
cooperation without coordination”
- DavidWood
Friday, June 21, 13
10. OpenStack Cloud Services Amazon Web Services
Traditional networks
Persistent URL (PURL)
Services
Linked Data
Services
Mobile Web Print
Friday, June 21, 13
11. ORGpedia is Interactive
• The public can view, comment, discuss
• Contributors can:
• overlay controlled fields
• Associate facilities with Linked Open
Data and other Web content (e.g. stock
quotes, images)
• Template-driven approach make extending
the site easy
Friday, June 21, 13
12. ORGpedia
User Contributions
Add/modify
• EPA Facility name
• Wikipedia abstract
• Corporate Owner(s)
• Wikipedia abstract
• Open Corporates ID
• Stock ticker
• SEC ID
• Related images
Friday, June 21, 13
30. ORGpedia Site Summary
• Addresses a compelling problem
• Live site
• Hosted on the cloud
• Driven by Open data
• Government
• Linked Open Data
• Crowdsourced
• Open Source Data platform
• Template driven approach
• Drives hundreds of millions of web pages
Callimachus
Friday, June 21, 13