SlideShare a Scribd company logo
1 of 29
1
More Meaning. Better Results.
1
Building the Inform Semantic Publishing Ecosystem:
from Author to Audience
Marc Hadfield
VP, Research & Development
marc@inform.com
2
Marc Hadfield
• Semantic Technology, Computer Science
• Inform Technologies (Head of R&D)
‣ Semantic Technologies applied to Content Analysis & Distribution
• Alitora Systems (Co-Founder / CTO)
‣ Life Science Semantic Technology, Research, Big Data Analytics, Semantic HPC
‣ Life Science Natural Language Processing
• Columbia Genome Center
‣ NLP applied to Life Science Research Articles
• LCconnect (CTO)
‣ Letter-of-Credit Exchange
2
3
Semantics in Publishing…
3
• Ongoing Theme at ISWC 2010…
‣ NY Times
‣ Facebook (OpenGraph)
‣ Elsevier
‣ BBC
4
What is Inform?
4
• Inform is a content enrichment solution designed to increase consumer
engagement, page views and revenue.
• We provide a hosted Semantic Web Service for content publishers that:
1. Reads your article before you publish it
2. Turns main topics and entities (people, places, companies, organizations) into links
3. Provides feeds of related web content when you publish it
• New Direction: Optimizing Content Distribution via Direct Channels
• Web users moving away from destination web sites, but still want the destination web
site content.
• Companies utilizing Inform include:
Connecting your content
55
Audio, Video & Blogs
from the Web
Articles from
the Web
Content from Inform
Your Affiliates’ Content
Your Content
Affiliated
Content
Your
Content
Licensed
Content
Google Street View Topic 0.90
Google Company 1.00
Ireland Place 0.70
Norway Place 0.70
South Africa Place 0.70
Sweden Place 0.70
Brian McClendon Person 0.80
Mountain View, California Place 0.60
Wi-Fi Topic 0.50
6
Related Content Widgets
6
7
Inform Topic Pages, Micro Sites
7
8
My Job: Building the Semantic Platform…
8
• “Silo”-ed Semantic Technology  Semantic Web
‣ Aligned with Wikipedia, Leverage Linked Data for Mash-Ups
‣ RDFa, SKOS, Semantic SEO
• Semantic / NLP Engine
‣ Improve Features, Quality
• Semantic Data Infrastructure
‣ Scalable Infrastructure
• Semantic Data Analysis
‣ Algorithms (Topology of Graphs), Inference
‣ “PageRank” on semantic data
• Personalization, Usage Analysis
• Micro Sites
‣ Clusters of Topics, Generating Rich Content Experience
• Distributing to Social Platforms
‣ i.e. Facebook
9
Inform: Author to Audience
9
10
Leverage Inform Taxonomy
10
1111
Author 
‣ Content Creation Services
‣ Semantic Data Repository
‣ Semantic Data Analysis
‣ Content Selection Algorithms
‣ Webservices
‣ Content Distribution Services
 Audience
Inside the
Semantic
System
Architecture
12
Content Creation
12
• Article Creation Tool (ACT)
‣ Author Tools
‣ Embed in CMS, Tumblr / Wordpress Plugin
• Publisher Portal
‣ Editorial Tool
‣ Content Feeds
• Web Crawl
• Summarizer
‣ Create smart “blurbs” to advertise article
• LinkedData
‣ Freebase, Wikipedia, DBPedia, et cetera.
13
ACT Tool
13
14
ACT Tool
14
15
ACT Tool, Tumblr, Wordpress
15
16
Publisher Portal
16
17
Summarizer
17
18
Semantic Data Repository
18
• Data Master / Data Node
‣ Federated Semantic Data Managers
‣ SPARQL Triplestore (scalable cluster)
‣ Semantic Search
‣ Search Indexes (Semi-Structured and Full-Text Search)
‣ Lucene/Siren (Sindice)
‣ Facets, Frequency Counts
‣ Cache (In-Memory)
‣ Blob Store (Voldemort)
‣ Listener to Activity (Flume)
‣ User Activity (clicks)
‣ Content Activity (content updates)
‣ Near Real-Time Trends, Analysis
‣ Compute Algorithms (Stored Procedures in Groovy)
‣ Long Term Content Archive (offline)
19
Semantic Data Analysis
19
• Natural Language Processing
‣ Rules & Machine Learning, Training
‣ 500K articles per day, 4,000 unique sites
‣ Text Extraction, Section/Sentence Extraction
‣ Tokenization, Part-of-Speech, Noun/Verb Phrases
‣ Entity Extraction, Entity Normalization
‣ Topic Extraction, Summarization, Clustering
• User Activity
‣ User Model (Personalization)
• Semantic Inference
‣ F-Logic, Multi-Domain
‣ Linked Data Mash-Ups
• Semantic Graph Topology
‣ Entity / Property Importance Metrics, Ranking, “PageRank”
‣ Which triples in LinkedData are interesting?
20
Content Selection Algorithms
20
• Model of User, Personalization
‣ Social Networks provide Context
• Semantic Analysis of Content
• Algorithms
‣ Maximize Relevancy / Relatedness (Meets Editorial Criteria)
‣ Maximize Click-Through
‣ Cute Kitten vs. Engagement Issue
‣ Maximize Monetization
Goal: Content Exchange
21
Webservices
21
• REST
‣ Outputs RDF / JSON Data
• Natural Language Processing
‣ Article to Semantic MetaData
• Related Content
‣ Inputs: Content, Personalization, Algorithm
‣ Articles
‣ Semantic Mash-Ups
‣ Topics
‣ Entities
• Semantic Query, Site Search
• Storage, Content Repository
22
Content Distribution Services
22
• Customer Destinations (Traditional Business)
‣ Deep Integration
• Publisher Widgets
‣ Levels of Lightweight Integration
‣ Example: Related-Content-Widget in JavaScript
• Inform.com
‣ Topic Pages
• Micro Sites
‣ Several Thousand Owned-and-Operated Domains/Sites, Topic Driven
• Social Networks
‣ Facebook
Tools:
• Semantic SEO
‣ RDFa, SKOS
23
Semantic MetaData, RDFa
23
http://inspector.sindice.com
24
Facebook App
24
25
Using Facebook OpenGraph
25
Relevancy Algorithm:
Combine:
•Trending / Popular Topics
•Trending / Popular Articles
•Personalization “Liked” Topics
•Personalization “Liked” Articles
•User Profiles (“Users like you…”)
26
Facebook “Liked” Topics
26
27
Facebook Article Stream
27
28
Inform: Author to Audience via Semantics
28
29
Thanks for your attention!
29
Questions?
Contact Information:
Marc Hadfield
marc@inform.com

More Related Content

What's hot

Structured data: Where did that come from & why are Google asking for it
Structured data: Where did that come from & why are Google asking for itStructured data: Where did that come from & why are Google asking for it
Structured data: Where did that come from & why are Google asking for itRichard Wallis
 
Fried data summit data quality data analytics together
Fried data summit data quality data analytics togetherFried data summit data quality data analytics together
Fried data summit data quality data analytics togetherJeff Fried
 
A Real-World Implementation of Linked Data
A Real-World Implementation of Linked DataA Real-World Implementation of Linked Data
A Real-World Implementation of Linked DataDimitri van Hees
 
How to build your own Delve: combining machine learning, big data and SharePoint
How to build your own Delve: combining machine learning, big data and SharePointHow to build your own Delve: combining machine learning, big data and SharePoint
How to build your own Delve: combining machine learning, big data and SharePointJoris Poelmans
 
How LinkedIn Democratizes Big Data Visualization
How LinkedIn Democratizes Big Data VisualizationHow LinkedIn Democratizes Big Data Visualization
How LinkedIn Democratizes Big Data VisualizationChi-Yi Kuan
 
Focused Crawling for Structured Data
Focused Crawling for Structured DataFocused Crawling for Structured Data
Focused Crawling for Structured DataRobert Meusel
 
Schema.org Structured data the What, Why, & How
Schema.org Structured data the What, Why, & HowSchema.org Structured data the What, Why, & How
Schema.org Structured data the What, Why, & HowRichard Wallis
 
The SAS Search Journey: Using AI to Move from Google to Lucidworks - Alex Fl...
The SAS Search Journey:  Using AI to Move from Google to Lucidworks - Alex Fl...The SAS Search Journey:  Using AI to Move from Google to Lucidworks - Alex Fl...
The SAS Search Journey: Using AI to Move from Google to Lucidworks - Alex Fl...Lucidworks
 
Neo4j graphs in the real world - graph days d.c. - april 14, 2015
Neo4j   graphs in the real world - graph days d.c. - april 14, 2015Neo4j   graphs in the real world - graph days d.c. - april 14, 2015
Neo4j graphs in the real world - graph days d.c. - april 14, 2015Neo4j
 
DWCNZ - Content Types: Love Them or Lose It
DWCNZ - Content Types: Love Them or Lose ItDWCNZ - Content Types: Love Them or Lose It
DWCNZ - Content Types: Love Them or Lose ItMarc D Anderson
 
Real-time big data analytics based on product recommendations case study
Real-time big data analytics based on product recommendations case studyReal-time big data analytics based on product recommendations case study
Real-time big data analytics based on product recommendations case studydeep.bi
 
KESeDa: Knowledge Extraction from Heterogeneous Semi-Structured Data Sources
KESeDa: Knowledge Extraction from Heterogeneous Semi-Structured Data SourcesKESeDa: Knowledge Extraction from Heterogeneous Semi-Structured Data Sources
KESeDa: Knowledge Extraction from Heterogeneous Semi-Structured Data SourcesLinked Enterprise Date Services
 
Understanding voice of the member via text mining
Understanding voice of the member via text miningUnderstanding voice of the member via text mining
Understanding voice of the member via text miningChi-Yi Kuan
 
Instant Security and User Management in Spring Boot
Instant Security and User Management in Spring BootInstant Security and User Management in Spring Boot
Instant Security and User Management in Spring BootRemy Champion
 
S4: The Self-Service Semantic Suite
S4: The Self-Service Semantic SuiteS4: The Self-Service Semantic Suite
S4: The Self-Service Semantic SuiteMarin Dimitrov
 
Understanding Voice of Members via Text Mining – How Linkedin Built a Text An...
Understanding Voice of Members via Text Mining – How Linkedin Built a Text An...Understanding Voice of Members via Text Mining – How Linkedin Built a Text An...
Understanding Voice of Members via Text Mining – How Linkedin Built a Text An...Yongzheng (Tiger) Zhang
 
Semantically integrated Enterprise Data Lakes and Co-Evolution of Public / Pr...
Semantically integrated Enterprise Data Lakes and Co-Evolution of Public / Pr...Semantically integrated Enterprise Data Lakes and Co-Evolution of Public / Pr...
Semantically integrated Enterprise Data Lakes and Co-Evolution of Public / Pr...Linked Enterprise Date Services
 
Couchbase and Apache Kafka - Bridging the gap between RDBMS and NoSQL
Couchbase and Apache Kafka - Bridging the gap between RDBMS and NoSQLCouchbase and Apache Kafka - Bridging the gap between RDBMS and NoSQL
Couchbase and Apache Kafka - Bridging the gap between RDBMS and NoSQLDATAVERSITY
 
O365Con18 - Invest in Search - Matthew McDermott
O365Con18 - Invest in Search - Matthew McDermottO365Con18 - Invest in Search - Matthew McDermott
O365Con18 - Invest in Search - Matthew McDermottNCCOMMS
 
Semantically Enabled Personal Information Management with Cluug.com
Semantically Enabled Personal Information Management with Cluug.comSemantically Enabled Personal Information Management with Cluug.com
Semantically Enabled Personal Information Management with Cluug.comBernhard Schandl
 

What's hot (20)

Structured data: Where did that come from & why are Google asking for it
Structured data: Where did that come from & why are Google asking for itStructured data: Where did that come from & why are Google asking for it
Structured data: Where did that come from & why are Google asking for it
 
Fried data summit data quality data analytics together
Fried data summit data quality data analytics togetherFried data summit data quality data analytics together
Fried data summit data quality data analytics together
 
A Real-World Implementation of Linked Data
A Real-World Implementation of Linked DataA Real-World Implementation of Linked Data
A Real-World Implementation of Linked Data
 
How to build your own Delve: combining machine learning, big data and SharePoint
How to build your own Delve: combining machine learning, big data and SharePointHow to build your own Delve: combining machine learning, big data and SharePoint
How to build your own Delve: combining machine learning, big data and SharePoint
 
How LinkedIn Democratizes Big Data Visualization
How LinkedIn Democratizes Big Data VisualizationHow LinkedIn Democratizes Big Data Visualization
How LinkedIn Democratizes Big Data Visualization
 
Focused Crawling for Structured Data
Focused Crawling for Structured DataFocused Crawling for Structured Data
Focused Crawling for Structured Data
 
Schema.org Structured data the What, Why, & How
Schema.org Structured data the What, Why, & HowSchema.org Structured data the What, Why, & How
Schema.org Structured data the What, Why, & How
 
The SAS Search Journey: Using AI to Move from Google to Lucidworks - Alex Fl...
The SAS Search Journey:  Using AI to Move from Google to Lucidworks - Alex Fl...The SAS Search Journey:  Using AI to Move from Google to Lucidworks - Alex Fl...
The SAS Search Journey: Using AI to Move from Google to Lucidworks - Alex Fl...
 
Neo4j graphs in the real world - graph days d.c. - april 14, 2015
Neo4j   graphs in the real world - graph days d.c. - april 14, 2015Neo4j   graphs in the real world - graph days d.c. - april 14, 2015
Neo4j graphs in the real world - graph days d.c. - april 14, 2015
 
DWCNZ - Content Types: Love Them or Lose It
DWCNZ - Content Types: Love Them or Lose ItDWCNZ - Content Types: Love Them or Lose It
DWCNZ - Content Types: Love Them or Lose It
 
Real-time big data analytics based on product recommendations case study
Real-time big data analytics based on product recommendations case studyReal-time big data analytics based on product recommendations case study
Real-time big data analytics based on product recommendations case study
 
KESeDa: Knowledge Extraction from Heterogeneous Semi-Structured Data Sources
KESeDa: Knowledge Extraction from Heterogeneous Semi-Structured Data SourcesKESeDa: Knowledge Extraction from Heterogeneous Semi-Structured Data Sources
KESeDa: Knowledge Extraction from Heterogeneous Semi-Structured Data Sources
 
Understanding voice of the member via text mining
Understanding voice of the member via text miningUnderstanding voice of the member via text mining
Understanding voice of the member via text mining
 
Instant Security and User Management in Spring Boot
Instant Security and User Management in Spring BootInstant Security and User Management in Spring Boot
Instant Security and User Management in Spring Boot
 
S4: The Self-Service Semantic Suite
S4: The Self-Service Semantic SuiteS4: The Self-Service Semantic Suite
S4: The Self-Service Semantic Suite
 
Understanding Voice of Members via Text Mining – How Linkedin Built a Text An...
Understanding Voice of Members via Text Mining – How Linkedin Built a Text An...Understanding Voice of Members via Text Mining – How Linkedin Built a Text An...
Understanding Voice of Members via Text Mining – How Linkedin Built a Text An...
 
Semantically integrated Enterprise Data Lakes and Co-Evolution of Public / Pr...
Semantically integrated Enterprise Data Lakes and Co-Evolution of Public / Pr...Semantically integrated Enterprise Data Lakes and Co-Evolution of Public / Pr...
Semantically integrated Enterprise Data Lakes and Co-Evolution of Public / Pr...
 
Couchbase and Apache Kafka - Bridging the gap between RDBMS and NoSQL
Couchbase and Apache Kafka - Bridging the gap between RDBMS and NoSQLCouchbase and Apache Kafka - Bridging the gap between RDBMS and NoSQL
Couchbase and Apache Kafka - Bridging the gap between RDBMS and NoSQL
 
O365Con18 - Invest in Search - Matthew McDermott
O365Con18 - Invest in Search - Matthew McDermottO365Con18 - Invest in Search - Matthew McDermott
O365Con18 - Invest in Search - Matthew McDermott
 
Semantically Enabled Personal Information Management with Cluug.com
Semantically Enabled Personal Information Management with Cluug.comSemantically Enabled Personal Information Management with Cluug.com
Semantically Enabled Personal Information Management with Cluug.com
 

Similar to Building the Inform Semantic Publishing Ecosystem: from Author to Audience

2017 01-11 intelligent search and intranet - chihuahuas vs muffins v1
2017 01-11 intelligent search and intranet - chihuahuas vs muffins v12017 01-11 intelligent search and intranet - chihuahuas vs muffins v1
2017 01-11 intelligent search and intranet - chihuahuas vs muffins v1Don Miller
 
Oracle Big Data Spatial & Graph 
Social Media Analysis - Case Study
Oracle Big Data Spatial & Graph 
Social Media Analysis - Case StudyOracle Big Data Spatial & Graph 
Social Media Analysis - Case Study
Oracle Big Data Spatial & Graph 
Social Media Analysis - Case StudyMark Rittman
 
What do we want computers to do for us?
What do we want computers to do for us? What do we want computers to do for us?
What do we want computers to do for us? Andrea Volpini
 
Social Network Analysis using Oracle Big Data Spatial & Graph (incl. why I di...
Social Network Analysis using Oracle Big Data Spatial & Graph (incl. why I di...Social Network Analysis using Oracle Big Data Spatial & Graph (incl. why I di...
Social Network Analysis using Oracle Big Data Spatial & Graph (incl. why I di...Mark Rittman
 
Structuring Serendipitous Collaboration - Nick Inglis at Collab365 Conference
Structuring Serendipitous Collaboration - Nick Inglis at Collab365 ConferenceStructuring Serendipitous Collaboration - Nick Inglis at Collab365 Conference
Structuring Serendipitous Collaboration - Nick Inglis at Collab365 ConferenceNick Inglis
 
Climbing the Slippery Slope of SharePoint Migrations Webinar
Climbing the Slippery Slope of SharePoint Migrations WebinarClimbing the Slippery Slope of SharePoint Migrations Webinar
Climbing the Slippery Slope of SharePoint Migrations WebinarConcept Searching, Inc
 
Big problems Big data, simple AWS solution
Big problems Big data, simple AWS solutionBig problems Big data, simple AWS solution
Big problems Big data, simple AWS solutionJean-Claude Sotto
 
MLaaS - Machine Learning as a Service
MLaaS - Machine Learning as a ServiceMLaaS - Machine Learning as a Service
MLaaS - Machine Learning as a ServiceKarl Seiler
 
II-SDV 2017: Approaches of Web Information Analysis in a Day to Day Work Envi...
II-SDV 2017: Approaches of Web Information Analysis in a Day to Day Work Envi...II-SDV 2017: Approaches of Web Information Analysis in a Day to Day Work Envi...
II-SDV 2017: Approaches of Web Information Analysis in a Day to Day Work Envi...Dr. Haxel Consult
 
Social Media, Cloud Computing, Machine Learning, Open Source, and Big Data An...
Social Media, Cloud Computing, Machine Learning, Open Source, and Big Data An...Social Media, Cloud Computing, Machine Learning, Open Source, and Big Data An...
Social Media, Cloud Computing, Machine Learning, Open Source, and Big Data An...Open Analytics
 
Open Data Summit Presentation by Joe Olsen
Open Data Summit Presentation by Joe OlsenOpen Data Summit Presentation by Joe Olsen
Open Data Summit Presentation by Joe OlsenChristopher Whitaker
 
Social Media Data Collection & Analysis
Social Media Data Collection & AnalysisSocial Media Data Collection & Analysis
Social Media Data Collection & AnalysisScott Sanders
 
Big problems Big Data, simple solutions
Big problems Big Data, simple solutionsBig problems Big Data, simple solutions
Big problems Big Data, simple solutionsClaudio Pontili
 
The Next Web of Linked Data
The Next Web of Linked DataThe Next Web of Linked Data
The Next Web of Linked DataJay Myers
 
How to Empower Your Business Users with Oracle Data Visualization
How to Empower Your Business Users with Oracle Data VisualizationHow to Empower Your Business Users with Oracle Data Visualization
How to Empower Your Business Users with Oracle Data VisualizationPerficient, Inc.
 
Rapid Data Exploration With Hadoop
Rapid Data Exploration With HadoopRapid Data Exploration With Hadoop
Rapid Data Exploration With HadoopPeter Skomoroch
 

Similar to Building the Inform Semantic Publishing Ecosystem: from Author to Audience (20)

Semantics and Machine Learning
Semantics and Machine LearningSemantics and Machine Learning
Semantics and Machine Learning
 
2017 01-11 intelligent search and intranet - chihuahuas vs muffins v1
2017 01-11 intelligent search and intranet - chihuahuas vs muffins v12017 01-11 intelligent search and intranet - chihuahuas vs muffins v1
2017 01-11 intelligent search and intranet - chihuahuas vs muffins v1
 
Oracle Big Data Spatial & Graph 
Social Media Analysis - Case Study
Oracle Big Data Spatial & Graph 
Social Media Analysis - Case StudyOracle Big Data Spatial & Graph 
Social Media Analysis - Case Study
Oracle Big Data Spatial & Graph 
Social Media Analysis - Case Study
 
Webinar: The Slippery Slope of Migrating to SharePoint Online or On-Premise
Webinar: The Slippery Slope of Migrating to SharePoint Online or On-PremiseWebinar: The Slippery Slope of Migrating to SharePoint Online or On-Premise
Webinar: The Slippery Slope of Migrating to SharePoint Online or On-Premise
 
What do we want computers to do for us?
What do we want computers to do for us? What do we want computers to do for us?
What do we want computers to do for us?
 
Webinar: Slippery Slope of SharePoint Migrations
Webinar: Slippery Slope of SharePoint Migrations Webinar: Slippery Slope of SharePoint Migrations
Webinar: Slippery Slope of SharePoint Migrations
 
Social Network Analysis using Oracle Big Data Spatial & Graph (incl. why I di...
Social Network Analysis using Oracle Big Data Spatial & Graph (incl. why I di...Social Network Analysis using Oracle Big Data Spatial & Graph (incl. why I di...
Social Network Analysis using Oracle Big Data Spatial & Graph (incl. why I di...
 
Structuring Serendipitous Collaboration - Nick Inglis at Collab365 Conference
Structuring Serendipitous Collaboration - Nick Inglis at Collab365 ConferenceStructuring Serendipitous Collaboration - Nick Inglis at Collab365 Conference
Structuring Serendipitous Collaboration - Nick Inglis at Collab365 Conference
 
Climbing the Slippery Slope of SharePoint Migrations Webinar
Climbing the Slippery Slope of SharePoint Migrations WebinarClimbing the Slippery Slope of SharePoint Migrations Webinar
Climbing the Slippery Slope of SharePoint Migrations Webinar
 
Big problems Big data, simple AWS solution
Big problems Big data, simple AWS solutionBig problems Big data, simple AWS solution
Big problems Big data, simple AWS solution
 
MLaaS - Machine Learning as a Service
MLaaS - Machine Learning as a ServiceMLaaS - Machine Learning as a Service
MLaaS - Machine Learning as a Service
 
II-SDV 2017: Approaches of Web Information Analysis in a Day to Day Work Envi...
II-SDV 2017: Approaches of Web Information Analysis in a Day to Day Work Envi...II-SDV 2017: Approaches of Web Information Analysis in a Day to Day Work Envi...
II-SDV 2017: Approaches of Web Information Analysis in a Day to Day Work Envi...
 
Social Media, Cloud Computing, Machine Learning, Open Source, and Big Data An...
Social Media, Cloud Computing, Machine Learning, Open Source, and Big Data An...Social Media, Cloud Computing, Machine Learning, Open Source, and Big Data An...
Social Media, Cloud Computing, Machine Learning, Open Source, and Big Data An...
 
Open Data Summit Presentation by Joe Olsen
Open Data Summit Presentation by Joe OlsenOpen Data Summit Presentation by Joe Olsen
Open Data Summit Presentation by Joe Olsen
 
Social Media Data Collection & Analysis
Social Media Data Collection & AnalysisSocial Media Data Collection & Analysis
Social Media Data Collection & Analysis
 
Big problems Big Data, simple solutions
Big problems Big Data, simple solutionsBig problems Big Data, simple solutions
Big problems Big Data, simple solutions
 
The Next Web of Linked Data
The Next Web of Linked DataThe Next Web of Linked Data
The Next Web of Linked Data
 
Semantic Web For Dummies
Semantic Web For DummiesSemantic Web For Dummies
Semantic Web For Dummies
 
How to Empower Your Business Users with Oracle Data Visualization
How to Empower Your Business Users with Oracle Data VisualizationHow to Empower Your Business Users with Oracle Data Visualization
How to Empower Your Business Users with Oracle Data Visualization
 
Rapid Data Exploration With Hadoop
Rapid Data Exploration With HadoopRapid Data Exploration With Hadoop
Rapid Data Exploration With Hadoop
 

More from Vital.AI

Optimizing the
 Data Supply Chain
 for Data Science
Optimizing the
 Data Supply Chain
 for Data ScienceOptimizing the
 Data Supply Chain
 for Data Science
Optimizing the
 Data Supply Chain
 for Data ScienceVital.AI
 
Vital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and Spark
Vital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and SparkVital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and Spark
Vital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and SparkVital.AI
 
Vital AI: Big Data Modeling
Vital AI: Big Data ModelingVital AI: Big Data Modeling
Vital AI: Big Data ModelingVital.AI
 
Vital.AI Creating Intelligent Apps
Vital.AI Creating Intelligent AppsVital.AI Creating Intelligent Apps
Vital.AI Creating Intelligent AppsVital.AI
 
Natural Language Processing & Semantic Models in an Imperfect World
Natural Language Processing & Semantic Modelsin an Imperfect WorldNatural Language Processing & Semantic Modelsin an Imperfect World
Natural Language Processing & Semantic Models in an Imperfect WorldVital.AI
 
Inform: Targeting the Interest Graph
Inform: Targeting the Interest GraphInform: Targeting the Interest Graph
Inform: Targeting the Interest GraphVital.AI
 

More from Vital.AI (6)

Optimizing the
 Data Supply Chain
 for Data Science
Optimizing the
 Data Supply Chain
 for Data ScienceOptimizing the
 Data Supply Chain
 for Data Science
Optimizing the
 Data Supply Chain
 for Data Science
 
Vital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and Spark
Vital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and SparkVital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and Spark
Vital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and Spark
 
Vital AI: Big Data Modeling
Vital AI: Big Data ModelingVital AI: Big Data Modeling
Vital AI: Big Data Modeling
 
Vital.AI Creating Intelligent Apps
Vital.AI Creating Intelligent AppsVital.AI Creating Intelligent Apps
Vital.AI Creating Intelligent Apps
 
Natural Language Processing & Semantic Models in an Imperfect World
Natural Language Processing & Semantic Modelsin an Imperfect WorldNatural Language Processing & Semantic Modelsin an Imperfect World
Natural Language Processing & Semantic Models in an Imperfect World
 
Inform: Targeting the Interest Graph
Inform: Targeting the Interest GraphInform: Targeting the Interest Graph
Inform: Targeting the Interest Graph
 

Recently uploaded

Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Scott Andery
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesThousandEyes
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rick Flair
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfIngrid Airi González
 
Manual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditManual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditSkynet Technologies
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterMydbops
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPathCommunity
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesKari Kakkonen
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Alkin Tezuysal
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 

Recently uploaded (20)

Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdf
 
Manual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditManual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance Audit
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL Router
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to Hero
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examples
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 

Building the Inform Semantic Publishing Ecosystem: from Author to Audience

  • 1. 1 More Meaning. Better Results. 1 Building the Inform Semantic Publishing Ecosystem: from Author to Audience Marc Hadfield VP, Research & Development marc@inform.com
  • 2. 2 Marc Hadfield • Semantic Technology, Computer Science • Inform Technologies (Head of R&D) ‣ Semantic Technologies applied to Content Analysis & Distribution • Alitora Systems (Co-Founder / CTO) ‣ Life Science Semantic Technology, Research, Big Data Analytics, Semantic HPC ‣ Life Science Natural Language Processing • Columbia Genome Center ‣ NLP applied to Life Science Research Articles • LCconnect (CTO) ‣ Letter-of-Credit Exchange 2
  • 3. 3 Semantics in Publishing… 3 • Ongoing Theme at ISWC 2010… ‣ NY Times ‣ Facebook (OpenGraph) ‣ Elsevier ‣ BBC
  • 4. 4 What is Inform? 4 • Inform is a content enrichment solution designed to increase consumer engagement, page views and revenue. • We provide a hosted Semantic Web Service for content publishers that: 1. Reads your article before you publish it 2. Turns main topics and entities (people, places, companies, organizations) into links 3. Provides feeds of related web content when you publish it • New Direction: Optimizing Content Distribution via Direct Channels • Web users moving away from destination web sites, but still want the destination web site content. • Companies utilizing Inform include:
  • 5. Connecting your content 55 Audio, Video & Blogs from the Web Articles from the Web Content from Inform Your Affiliates’ Content Your Content Affiliated Content Your Content Licensed Content Google Street View Topic 0.90 Google Company 1.00 Ireland Place 0.70 Norway Place 0.70 South Africa Place 0.70 Sweden Place 0.70 Brian McClendon Person 0.80 Mountain View, California Place 0.60 Wi-Fi Topic 0.50
  • 7. 7 Inform Topic Pages, Micro Sites 7
  • 8. 8 My Job: Building the Semantic Platform… 8 • “Silo”-ed Semantic Technology  Semantic Web ‣ Aligned with Wikipedia, Leverage Linked Data for Mash-Ups ‣ RDFa, SKOS, Semantic SEO • Semantic / NLP Engine ‣ Improve Features, Quality • Semantic Data Infrastructure ‣ Scalable Infrastructure • Semantic Data Analysis ‣ Algorithms (Topology of Graphs), Inference ‣ “PageRank” on semantic data • Personalization, Usage Analysis • Micro Sites ‣ Clusters of Topics, Generating Rich Content Experience • Distributing to Social Platforms ‣ i.e. Facebook
  • 9. 9 Inform: Author to Audience 9
  • 11. 1111 Author  ‣ Content Creation Services ‣ Semantic Data Repository ‣ Semantic Data Analysis ‣ Content Selection Algorithms ‣ Webservices ‣ Content Distribution Services  Audience Inside the Semantic System Architecture
  • 12. 12 Content Creation 12 • Article Creation Tool (ACT) ‣ Author Tools ‣ Embed in CMS, Tumblr / Wordpress Plugin • Publisher Portal ‣ Editorial Tool ‣ Content Feeds • Web Crawl • Summarizer ‣ Create smart “blurbs” to advertise article • LinkedData ‣ Freebase, Wikipedia, DBPedia, et cetera.
  • 15. 15 ACT Tool, Tumblr, Wordpress 15
  • 18. 18 Semantic Data Repository 18 • Data Master / Data Node ‣ Federated Semantic Data Managers ‣ SPARQL Triplestore (scalable cluster) ‣ Semantic Search ‣ Search Indexes (Semi-Structured and Full-Text Search) ‣ Lucene/Siren (Sindice) ‣ Facets, Frequency Counts ‣ Cache (In-Memory) ‣ Blob Store (Voldemort) ‣ Listener to Activity (Flume) ‣ User Activity (clicks) ‣ Content Activity (content updates) ‣ Near Real-Time Trends, Analysis ‣ Compute Algorithms (Stored Procedures in Groovy) ‣ Long Term Content Archive (offline)
  • 19. 19 Semantic Data Analysis 19 • Natural Language Processing ‣ Rules & Machine Learning, Training ‣ 500K articles per day, 4,000 unique sites ‣ Text Extraction, Section/Sentence Extraction ‣ Tokenization, Part-of-Speech, Noun/Verb Phrases ‣ Entity Extraction, Entity Normalization ‣ Topic Extraction, Summarization, Clustering • User Activity ‣ User Model (Personalization) • Semantic Inference ‣ F-Logic, Multi-Domain ‣ Linked Data Mash-Ups • Semantic Graph Topology ‣ Entity / Property Importance Metrics, Ranking, “PageRank” ‣ Which triples in LinkedData are interesting?
  • 20. 20 Content Selection Algorithms 20 • Model of User, Personalization ‣ Social Networks provide Context • Semantic Analysis of Content • Algorithms ‣ Maximize Relevancy / Relatedness (Meets Editorial Criteria) ‣ Maximize Click-Through ‣ Cute Kitten vs. Engagement Issue ‣ Maximize Monetization Goal: Content Exchange
  • 21. 21 Webservices 21 • REST ‣ Outputs RDF / JSON Data • Natural Language Processing ‣ Article to Semantic MetaData • Related Content ‣ Inputs: Content, Personalization, Algorithm ‣ Articles ‣ Semantic Mash-Ups ‣ Topics ‣ Entities • Semantic Query, Site Search • Storage, Content Repository
  • 22. 22 Content Distribution Services 22 • Customer Destinations (Traditional Business) ‣ Deep Integration • Publisher Widgets ‣ Levels of Lightweight Integration ‣ Example: Related-Content-Widget in JavaScript • Inform.com ‣ Topic Pages • Micro Sites ‣ Several Thousand Owned-and-Operated Domains/Sites, Topic Driven • Social Networks ‣ Facebook Tools: • Semantic SEO ‣ RDFa, SKOS
  • 25. 25 Using Facebook OpenGraph 25 Relevancy Algorithm: Combine: •Trending / Popular Topics •Trending / Popular Articles •Personalization “Liked” Topics •Personalization “Liked” Articles •User Profiles (“Users like you…”)
  • 28. 28 Inform: Author to Audience via Semantics 28
  • 29. 29 Thanks for your attention! 29 Questions? Contact Information: Marc Hadfield marc@inform.com