SlideShare a Scribd company logo
1 of 22
Download to read offline
Enterprise search – how relevant is ‘relevance’?
Martin White, Managing Director, Intranet Focus Ltd
Visiting Professor, Information School, University of Sheffield
Meetup London 23 June 2020
Martin.white@intranetfocus.com
@intranetfocus
@Intranetfocus 1
My search history 1975 – 2020 (and beyond..!)
2
1975
1999
2002 - 2020
DECO
Unilever Computing Services
1982
http://intranetfocus.com/resources/reports/
@Intranetfocus 3
https://unsplash.com/photos/4zbEzE87zz4
What do we mean by search?
• WWW search
• Massive volumes of everything dominated by Google and the need for advertising revenue
• Extensive use of SEO curation of content to improve findability
• Inadvertently sets the ‘benchmark’ for enterprise search “Why can’t our search be like Google?’
• Web site/intranet search
• Highly curated content with a focus on browsing
• Search usually ends up as an afterthought
• Significant resources devoted to ‘being found’
• Repository search
• Services for (primarily) academic users with highly curated content on a library/archive management system
• Also extensive use of external search applications such as Scopus and Lexis
• Precision is important
• E-Commerce search
• Highly curated content and very good user tracking metrics
• Very clear RoI based on sales revenues and repeat customers
@Intranetfocus 4
Enterprise search
• Massive amounts of structured and unstructured content, very little of it curated and dominated by
Microsoft 365 Search in terms of installed base and ‘best practice’
• Often multiple applications in a federated search implementation
• Multiple content languages and unpredictable language skills of users
• Security trimming plays havoc with relevance
• Users are experts in their domain, and have a substantial about of information pushed to them by other
enterprise applications (ERP, CRM, HR) and of course email and social media
• Search is therefore ‘additive’ and not from a zero knowledge base
• Not based around business processes so it is very difficult to assess whether a search failure is a content
issue, a query issue or a technology issue
• Rarely is there any internal content/user focused team. Largely managed by IT on ‘technical performance’
(uptime/traffic) metrics
• Assumption is that search is intuitive, so no training and no mentoring available
@Intranetfocus 5
@Intranetfocus 6
https://rgu-repository.worktribe.com/output/249111https://www.clearbox.co.uk/diagnosing-enterprise-search/
Classic IR/IIR evaluation assumptions
• Access to a representative test collection
• Selection of a representative user cohort (all too often students!)
• Fluency in query and content languages
• No (or little) prior subject knowledge assumed of searchers
• No (or little) justification for undertaking the search
• No business imperative (or buy in) to find the information
• A/B tests used on a test collection to assess ranking improvement
• Relevance assessed (usually) on item title and snippet
• No assessment of content quality
@Intranetfocus 7
Information seeking – some numbers
@Intranetfocus 8
0 10 20 30 40 50 60 70 80 90
Information service
Electronic closing binders
Internal specialists
Enterprise search
Library books
Knowledge database
Library staff
Electronic research services
DM applications
Intranet
Global law firm – approx. 1000 respondents out of 2000 staff
Intranet Focus client
% using each option on a ‘very frequent’ or ‘frequent’ basis
Managing Babel
@Intranetfocus 9
The state of the art
@Intranetfocus 10
© 2020 IntraTeam Digital Workplace
Benchmarking Service
www.intrateam.com
Findwise Enterprise Findability Survey 2018
https://findwise.com/en/enterprise-search
Impact of search tasks on search performance
@Intranetfocus 11
The qualitative content analysis of work-task journal study
material pointed on the following work task scenarios: (1)
ordinary and (2) unordinary administrative tasks; everyday
professional tasks as (3) high-quality tasks, (4) “just-to-get-
done” tasks and (5) regular teamwork; and unordinary
professional tasks as (6) unique tasks and (7) inventive
teamwork
The impression we have, is that the majority of current
IIR research centres on Internet searching and everyday-
life information needs. ….However, there remains a need
for IIR research on information searching in relation to
information intensive work task performance with
respect to optimise information searching, the various
platforms used for information searching, and
understanding of the conditions under which work task
performance takes place.
https://drive.google.com/file/d/1BosRT0sDMvVjPxLCR4dpxuYT9ip-FSWW/view
2019 Strix Lecture given by Pia Borlund on search and task completion
Foraging for information
@Intranetfocus 12
Information patch
Information scent
Information diet
Complex Searcher Model
Information Scent, Searching and Stopping. Modelling SERP Level
Stopping Behaviour David Maxwell and Leif Azzopardi
https://www.cmswire.com/information-management/enterprise-search-development-start-
with-the-user-interface/
Stopping strategies
@Intranetfocus 14
Increase in user effort
Increase in
search success Stopping point
"I was expecting to see Document X on
the first couple of pages. It’s not there/
How can I trust the search application?"
"Five irrelevant results one after
another. I'll give John a call. He'll know
the answer"
"I'm on p3 of the results and I've
only seen one decent result and
that's a document I already have"
"I've found the French version of the
report. Where is the English master
version? I'm wasting my time"
Professional search – use cases are everything
@Intranetfocus 15
Information Management and Processing
5 (2018) 1042-1057
https://strathprints.strath.ac.uk/64935/ (open access version)
Synonyms
Boolean
Abbreviations
HealthcareLegal
Looking at click logs
@Intranetfocus 16
72,000 employees
From #150 – #500 average queries per month 200
box 83338 3
BOX 15526 26
Box 7276 76 106,140
concur 50577 5
Concur 8100 67
concur 11914 38 70,591
hr 15541 25
HR 9929 46 25,470
lti 9197 53
LTI 9101 56 18,298
my hr 22011 15
myhr 11233 43 33,244
ocm 19446 18
OCM 7407 74 26,853
saba 22667 14
saba cloud 13700 32 36,367
software 12715 35
Software 9455 49
software store 14453 29 36,623
workday 84380 2
Workday 8981 57 93,361
it 8287 65 15,776
IT 7489 70
kiosk 14120 30
Kiosk 6858 81 20,978
angel 36133 9
ANGEL 8557 61 44,690
Three use cases
@Intranetfocus 17
Use Case A
Quick access route to applications, people, office locations,
breaking news etc
Often ‘clever’ single query terms
Use Case B
Checking on corporate policies to find the most recent version,
Corporate/divisional news archive
Products, initiatives and projects
Query terms include date ranges/alphanumeric terms
Use Case C
Topic-related searches using well-considered query terms
Often periodic (i.e. monthly) or cyclic (‘end-of-year’) searches
Sum of Top 100 Searches 1,563,384
Sum of Next 400 1,389,782
‘Precision’ ‘Recall’
How relevant is relevance?
• Enterprise search as users will not click through each result in sequence as they
are using a range of clues to work out what is relevant.
• Ascertaining ‘intent’ is very difficult as people have multiple roles, multiple use-
cases and unequal and unknown prior knowledge
• It could be looking for an author that is in the same building so that they can go at
talk to them about a problem they have. The document search was actually to
find a local expert. The document may otherwise have been ‘irrelevant’
• The user may spot a very relevant document at say #3 but will not click on it
because they already have it. The presence of the document at #3 is a
reassurance that they are on the right track.
• These behaviours make it very challenging to work out what set of heuristics the
user is employing to assess each result and so what is ‘relevant’
@Intranetfocus 18
Evaluating enterprise search
1. Click logs have a value, but for every query term ask “Why?”
2. “Time well spent” http://intranetfocus.com/time-well-spent-a-potential-holistic-view-of-productivity/
3. Computational ethnography using data logging
4. Constant usability testing around tasks and task completion
5. Repetition (also constant) of high hit and low/zero hit searches
6. Embedded search mentors, especially in project teams
7. Clever surveys
8. Story telling – “how was it for you?”
9. Spot checks
10. Heuristic benchmarking (SearchCheck http://intranetfocus.com/enterprise-search-consulting-services/)
@Intranetfocus 19
Summary
• The primary search mode in the enterprise is information foraging, where users
apply a very personal array of heuristics to assessing relevance
• ‘Relevance’ is still relevant but has to be associated with specific use cases and
with a range of other search and query measurements
• Content quality, snippet quality and the extent of search expertise play very
important roles in search success and search satisfaction
• Security trimming can have a significant impact on ranking which is impossible to
allow for
• The complexity of enterprise search implementations is why having a skilled
multi-disciplinary search team is essential
@Intranetfocus 20
Martin.white@intranetfocus.com
Time for reflection
Further reading
@Intranetfocus 22
Enterprise Search (O’Reilly Media)
Achieving Enterprise Search Satisfaction http://intranetfocus.com/resources/reports/
Recent columns
Seven stress tests for your enterprise search and intranet search applications
Distributed information management – the oxygen of your organisation
Unpacking the complexities of enterprise search behaviour
Search won’t improve until we understand why people search, not just how
Diving into enterprise search query logs
When improving search performance don’t follow the clicks
Time spent searching – a chronology of the myth and some recent research
A list of twenty enterprise search myths
http://theses.gla.ac.uk/41132/

More Related Content

What's hot

Search Quality Evaluation to Help Reproducibility : an Open Source Approach
Search Quality Evaluation to Help Reproducibility : an Open Source ApproachSearch Quality Evaluation to Help Reproducibility : an Open Source Approach
Search Quality Evaluation to Help Reproducibility : an Open Source ApproachAlessandro Benedetti
 
Search Quality Evaluation: a Developer Perspective
Search Quality Evaluation: a Developer PerspectiveSearch Quality Evaluation: a Developer Perspective
Search Quality Evaluation: a Developer PerspectiveAndrea Gazzarini
 
Rated Ranking Evaluator (FOSDEM 2019)
Rated Ranking Evaluator (FOSDEM 2019)Rated Ranking Evaluator (FOSDEM 2019)
Rated Ranking Evaluator (FOSDEM 2019)Andrea Gazzarini
 
A Learning to Rank Project on a Daily Song Ranking Problem
A Learning to Rank Project on a Daily Song Ranking ProblemA Learning to Rank Project on a Daily Song Ranking Problem
A Learning to Rank Project on a Daily Song Ranking ProblemSease
 
Haystack London - Search Quality Evaluation, Tools and Techniques
Haystack London - Search Quality Evaluation, Tools and Techniques Haystack London - Search Quality Evaluation, Tools and Techniques
Haystack London - Search Quality Evaluation, Tools and Techniques Andrea Gazzarini
 
Interactive Questions and Answers - London Information Retrieval Meetup
Interactive Questions and Answers - London Information Retrieval MeetupInteractive Questions and Answers - London Information Retrieval Meetup
Interactive Questions and Answers - London Information Retrieval MeetupSease
 
From Academic Papers To Production : A Learning To Rank Story
From Academic Papers To Production : A Learning To Rank StoryFrom Academic Papers To Production : A Learning To Rank Story
From Academic Papers To Production : A Learning To Rank StoryAlessandro Benedetti
 
Explainability for Learning to Rank
Explainability for Learning to RankExplainability for Learning to Rank
Explainability for Learning to RankSease
 
Let's Build an Inverted Index: Introduction to Apache Lucene/Solr
Let's Build an Inverted Index: Introduction to Apache Lucene/SolrLet's Build an Inverted Index: Introduction to Apache Lucene/Solr
Let's Build an Inverted Index: Introduction to Apache Lucene/SolrSease
 
Advanced Document Similarity With Apache Lucene
Advanced Document Similarity With Apache LuceneAdvanced Document Similarity With Apache Lucene
Advanced Document Similarity With Apache LuceneAlessandro Benedetti
 
How the Lucene More Like This Works
How the Lucene More Like This WorksHow the Lucene More Like This Works
How the Lucene More Like This WorksSease
 
Haystack- Learning to rank in an hourly job market
Haystack- Learning to rank in an hourly job market Haystack- Learning to rank in an hourly job market
Haystack- Learning to rank in an hourly job market Xun Wang
 
Reflected Intelligence - Lucene/Solr as a self-learning data system: Presente...
Reflected Intelligence - Lucene/Solr as a self-learning data system: Presente...Reflected Intelligence - Lucene/Solr as a self-learning data system: Presente...
Reflected Intelligence - Lucene/Solr as a self-learning data system: Presente...Lucidworks
 
Владимир Гулин, Mail.Ru Group, Learning to rank using clickthrough data
Владимир Гулин, Mail.Ru Group, Learning to rank using clickthrough dataВладимир Гулин, Mail.Ru Group, Learning to rank using clickthrough data
Владимир Гулин, Mail.Ru Group, Learning to rank using clickthrough dataMail.ru Group
 
Interleaving, Evaluation to Self-learning Search @904Labs
Interleaving, Evaluation to Self-learning Search @904LabsInterleaving, Evaluation to Self-learning Search @904Labs
Interleaving, Evaluation to Self-learning Search @904LabsJohn T. Kane
 
Leveraging Lucene/Solr as a Knowledge Graph and Intent Engine: Presented by T...
Leveraging Lucene/Solr as a Knowledge Graph and Intent Engine: Presented by T...Leveraging Lucene/Solr as a Knowledge Graph and Intent Engine: Presented by T...
Leveraging Lucene/Solr as a Knowledge Graph and Intent Engine: Presented by T...Lucidworks
 
Crowdsourced query augmentation through the semantic discovery of domain spec...
Crowdsourced query augmentation through the semantic discovery of domain spec...Crowdsourced query augmentation through the semantic discovery of domain spec...
Crowdsourced query augmentation through the semantic discovery of domain spec...Trey Grainger
 

What's hot (18)

Search Quality Evaluation to Help Reproducibility : an Open Source Approach
Search Quality Evaluation to Help Reproducibility : an Open Source ApproachSearch Quality Evaluation to Help Reproducibility : an Open Source Approach
Search Quality Evaluation to Help Reproducibility : an Open Source Approach
 
Search Quality Evaluation: a Developer Perspective
Search Quality Evaluation: a Developer PerspectiveSearch Quality Evaluation: a Developer Perspective
Search Quality Evaluation: a Developer Perspective
 
Rated Ranking Evaluator (FOSDEM 2019)
Rated Ranking Evaluator (FOSDEM 2019)Rated Ranking Evaluator (FOSDEM 2019)
Rated Ranking Evaluator (FOSDEM 2019)
 
A Learning to Rank Project on a Daily Song Ranking Problem
A Learning to Rank Project on a Daily Song Ranking ProblemA Learning to Rank Project on a Daily Song Ranking Problem
A Learning to Rank Project on a Daily Song Ranking Problem
 
Haystack London - Search Quality Evaluation, Tools and Techniques
Haystack London - Search Quality Evaluation, Tools and Techniques Haystack London - Search Quality Evaluation, Tools and Techniques
Haystack London - Search Quality Evaluation, Tools and Techniques
 
Interactive Questions and Answers - London Information Retrieval Meetup
Interactive Questions and Answers - London Information Retrieval MeetupInteractive Questions and Answers - London Information Retrieval Meetup
Interactive Questions and Answers - London Information Retrieval Meetup
 
From Academic Papers To Production : A Learning To Rank Story
From Academic Papers To Production : A Learning To Rank StoryFrom Academic Papers To Production : A Learning To Rank Story
From Academic Papers To Production : A Learning To Rank Story
 
Explainability for Learning to Rank
Explainability for Learning to RankExplainability for Learning to Rank
Explainability for Learning to Rank
 
Let's Build an Inverted Index: Introduction to Apache Lucene/Solr
Let's Build an Inverted Index: Introduction to Apache Lucene/SolrLet's Build an Inverted Index: Introduction to Apache Lucene/Solr
Let's Build an Inverted Index: Introduction to Apache Lucene/Solr
 
Advanced Document Similarity With Apache Lucene
Advanced Document Similarity With Apache LuceneAdvanced Document Similarity With Apache Lucene
Advanced Document Similarity With Apache Lucene
 
How the Lucene More Like This Works
How the Lucene More Like This WorksHow the Lucene More Like This Works
How the Lucene More Like This Works
 
Haystack- Learning to rank in an hourly job market
Haystack- Learning to rank in an hourly job market Haystack- Learning to rank in an hourly job market
Haystack- Learning to rank in an hourly job market
 
Reflected Intelligence - Lucene/Solr as a self-learning data system: Presente...
Reflected Intelligence - Lucene/Solr as a self-learning data system: Presente...Reflected Intelligence - Lucene/Solr as a self-learning data system: Presente...
Reflected Intelligence - Lucene/Solr as a self-learning data system: Presente...
 
Haystacks slides
Haystacks slidesHaystacks slides
Haystacks slides
 
Владимир Гулин, Mail.Ru Group, Learning to rank using clickthrough data
Владимир Гулин, Mail.Ru Group, Learning to rank using clickthrough dataВладимир Гулин, Mail.Ru Group, Learning to rank using clickthrough data
Владимир Гулин, Mail.Ru Group, Learning to rank using clickthrough data
 
Interleaving, Evaluation to Self-learning Search @904Labs
Interleaving, Evaluation to Self-learning Search @904LabsInterleaving, Evaluation to Self-learning Search @904Labs
Interleaving, Evaluation to Self-learning Search @904Labs
 
Leveraging Lucene/Solr as a Knowledge Graph and Intent Engine: Presented by T...
Leveraging Lucene/Solr as a Knowledge Graph and Intent Engine: Presented by T...Leveraging Lucene/Solr as a Knowledge Graph and Intent Engine: Presented by T...
Leveraging Lucene/Solr as a Knowledge Graph and Intent Engine: Presented by T...
 
Crowdsourced query augmentation through the semantic discovery of domain spec...
Crowdsourced query augmentation through the semantic discovery of domain spec...Crowdsourced query augmentation through the semantic discovery of domain spec...
Crowdsourced query augmentation through the semantic discovery of domain spec...
 

Similar to Enterprise Search – How Relevant Is Relevance?

Introduction to Competitive Intelligence Portals
Introduction to Competitive Intelligence PortalsIntroduction to Competitive Intelligence Portals
Introduction to Competitive Intelligence PortalsComintelli
 
You Spoke, We Listened – Achieving a New Level of Search Optimization with Go...
You Spoke, We Listened – Achieving a New Level of Search Optimization with Go...You Spoke, We Listened – Achieving a New Level of Search Optimization with Go...
You Spoke, We Listened – Achieving a New Level of Search Optimization with Go...Concept Searching, Inc
 
Enterprise Search White Paper: Increase Your Competitiveness - Make a Knowled...
Enterprise Search White Paper: Increase Your Competitiveness - Make a Knowled...Enterprise Search White Paper: Increase Your Competitiveness - Make a Knowled...
Enterprise Search White Paper: Increase Your Competitiveness - Make a Knowled...Findwise
 
Optimising Your Content for findability
Optimising Your Content for findabilityOptimising Your Content for findability
Optimising Your Content for findabilityKristian Norling
 
The path to be a data scientist
The path to be a data scientistThe path to be a data scientist
The path to be a data scientistPoo Kuan Hoong
 
O365 Tools for Building a Digital Workplace
O365 Tools for Building a Digital WorkplaceO365 Tools for Building a Digital Workplace
O365 Tools for Building a Digital WorkplaceJeff Fried
 
How to Get Enterprise Search Right Webinar
How to Get Enterprise Search Right WebinarHow to Get Enterprise Search Right Webinar
How to Get Enterprise Search Right WebinarConcept Searching, Inc
 
Intranets and hybrid search
Intranets and hybrid searchIntranets and hybrid search
Intranets and hybrid searchIntranätverk
 
Introduction to Enterprise Search
Introduction to Enterprise SearchIntroduction to Enterprise Search
Introduction to Enterprise SearchFindwise
 
CAREERTEAM - NOAH19 Berlin
CAREERTEAM - NOAH19 BerlinCAREERTEAM - NOAH19 Berlin
CAREERTEAM - NOAH19 BerlinNOAH Advisors
 
The evolution of Search spscinci
The evolution of Search spscinciThe evolution of Search spscinci
The evolution of Search spscinciJohnny Lopez
 
Overcoming Capability Gaps in Information Transparency, Knowledge Management,...
Overcoming Capability Gaps in Information Transparency, Knowledge Management,...Overcoming Capability Gaps in Information Transparency, Knowledge Management,...
Overcoming Capability Gaps in Information Transparency, Knowledge Management,...Concept Searching, Inc
 
Optimising Your Content for Findability
Optimising Your Content for FindabilityOptimising Your Content for Findability
Optimising Your Content for FindabilityFindwise
 
How to Evolve Intelligence Organizations for Maximum Success
How to Evolve Intelligence Organizations for Maximum SuccessHow to Evolve Intelligence Organizations for Maximum Success
How to Evolve Intelligence Organizations for Maximum SuccessIntelCollab.com
 
How to Evolve Intelligence Organizations for Maximum Success
How to Evolve Intelligence Organizations for Maximum SuccessHow to Evolve Intelligence Organizations for Maximum Success
How to Evolve Intelligence Organizations for Maximum SuccessArik Johnson
 
CREATE SEARCH DRIVEN BUSINESS INTELLIGENCE APPLICATION USING FAST SEARCH FO...
CREATE SEARCH DRIVEN BUSINESS  INTELLIGENCE APPLICATION USING  FAST SEARCH FO...CREATE SEARCH DRIVEN BUSINESS  INTELLIGENCE APPLICATION USING  FAST SEARCH FO...
CREATE SEARCH DRIVEN BUSINESS INTELLIGENCE APPLICATION USING FAST SEARCH FO...Netwoven Inc.
 
eDiscovery at Nottinghamshire County Council
eDiscovery at Nottinghamshire County Council eDiscovery at Nottinghamshire County Council
eDiscovery at Nottinghamshire County Council Concept Searching, Inc
 
Why Metadata Matters in SharePoint Search and Information Governance Webinar
Why Metadata Matters in SharePoint Search and Information Governance WebinarWhy Metadata Matters in SharePoint Search and Information Governance Webinar
Why Metadata Matters in SharePoint Search and Information Governance WebinarConcept Searching, Inc
 

Similar to Enterprise Search – How Relevant Is Relevance? (20)

FAST Search-webinar-06-29-2010
FAST Search-webinar-06-29-2010FAST Search-webinar-06-29-2010
FAST Search-webinar-06-29-2010
 
Introduction to Competitive Intelligence Portals
Introduction to Competitive Intelligence PortalsIntroduction to Competitive Intelligence Portals
Introduction to Competitive Intelligence Portals
 
You Spoke, We Listened – Achieving a New Level of Search Optimization with Go...
You Spoke, We Listened – Achieving a New Level of Search Optimization with Go...You Spoke, We Listened – Achieving a New Level of Search Optimization with Go...
You Spoke, We Listened – Achieving a New Level of Search Optimization with Go...
 
Enterprise Search White Paper: Increase Your Competitiveness - Make a Knowled...
Enterprise Search White Paper: Increase Your Competitiveness - Make a Knowled...Enterprise Search White Paper: Increase Your Competitiveness - Make a Knowled...
Enterprise Search White Paper: Increase Your Competitiveness - Make a Knowled...
 
Optimising Your Content for findability
Optimising Your Content for findabilityOptimising Your Content for findability
Optimising Your Content for findability
 
The path to be a data scientist
The path to be a data scientistThe path to be a data scientist
The path to be a data scientist
 
O365 Tools for Building a Digital Workplace
O365 Tools for Building a Digital WorkplaceO365 Tools for Building a Digital Workplace
O365 Tools for Building a Digital Workplace
 
How to Get Enterprise Search Right Webinar
How to Get Enterprise Search Right WebinarHow to Get Enterprise Search Right Webinar
How to Get Enterprise Search Right Webinar
 
Intranets and hybrid search
Intranets and hybrid searchIntranets and hybrid search
Intranets and hybrid search
 
Introduction to Enterprise Search
Introduction to Enterprise SearchIntroduction to Enterprise Search
Introduction to Enterprise Search
 
CAREERTEAM - NOAH19 Berlin
CAREERTEAM - NOAH19 BerlinCAREERTEAM - NOAH19 Berlin
CAREERTEAM - NOAH19 Berlin
 
The evolution of Search spscinci
The evolution of Search spscinciThe evolution of Search spscinci
The evolution of Search spscinci
 
Overcoming Capability Gaps in Information Transparency, Knowledge Management,...
Overcoming Capability Gaps in Information Transparency, Knowledge Management,...Overcoming Capability Gaps in Information Transparency, Knowledge Management,...
Overcoming Capability Gaps in Information Transparency, Knowledge Management,...
 
Optimising Your Content for Findability
Optimising Your Content for FindabilityOptimising Your Content for Findability
Optimising Your Content for Findability
 
How to Evolve Intelligence Organizations for Maximum Success
How to Evolve Intelligence Organizations for Maximum SuccessHow to Evolve Intelligence Organizations for Maximum Success
How to Evolve Intelligence Organizations for Maximum Success
 
How to Evolve Intelligence Organizations for Maximum Success
How to Evolve Intelligence Organizations for Maximum SuccessHow to Evolve Intelligence Organizations for Maximum Success
How to Evolve Intelligence Organizations for Maximum Success
 
Slalmd2014 cid presentation
Slalmd2014 cid presentationSlalmd2014 cid presentation
Slalmd2014 cid presentation
 
CREATE SEARCH DRIVEN BUSINESS INTELLIGENCE APPLICATION USING FAST SEARCH FO...
CREATE SEARCH DRIVEN BUSINESS  INTELLIGENCE APPLICATION USING  FAST SEARCH FO...CREATE SEARCH DRIVEN BUSINESS  INTELLIGENCE APPLICATION USING  FAST SEARCH FO...
CREATE SEARCH DRIVEN BUSINESS INTELLIGENCE APPLICATION USING FAST SEARCH FO...
 
eDiscovery at Nottinghamshire County Council
eDiscovery at Nottinghamshire County Council eDiscovery at Nottinghamshire County Council
eDiscovery at Nottinghamshire County Council
 
Why Metadata Matters in SharePoint Search and Information Governance Webinar
Why Metadata Matters in SharePoint Search and Information Governance WebinarWhy Metadata Matters in SharePoint Search and Information Governance Webinar
Why Metadata Matters in SharePoint Search and Information Governance Webinar
 

More from Sease

Multi Valued Vectors Lucene
Multi Valued Vectors LuceneMulti Valued Vectors Lucene
Multi Valued Vectors LuceneSease
 
When SDMX meets AI-Leveraging Open Source LLMs To Make Official Statistics Mo...
When SDMX meets AI-Leveraging Open Source LLMs To Make Official Statistics Mo...When SDMX meets AI-Leveraging Open Source LLMs To Make Official Statistics Mo...
When SDMX meets AI-Leveraging Open Source LLMs To Make Official Statistics Mo...Sease
 
How To Implement Your Online Search Quality Evaluation With Kibana
How To Implement Your Online Search Quality Evaluation With KibanaHow To Implement Your Online Search Quality Evaluation With Kibana
How To Implement Your Online Search Quality Evaluation With KibanaSease
 
Introducing Multi Valued Vectors Fields in Apache Lucene
Introducing Multi Valued Vectors Fields in Apache LuceneIntroducing Multi Valued Vectors Fields in Apache Lucene
Introducing Multi Valued Vectors Fields in Apache LuceneSease
 
Stat-weight Improving the Estimator of Interleaved Methods Outcomes with Stat...
Stat-weight Improving the Estimator of Interleaved Methods Outcomes with Stat...Stat-weight Improving the Estimator of Interleaved Methods Outcomes with Stat...
Stat-weight Improving the Estimator of Interleaved Methods Outcomes with Stat...Sease
 
How does ChatGPT work: an Information Retrieval perspective
How does ChatGPT work: an Information Retrieval perspectiveHow does ChatGPT work: an Information Retrieval perspective
How does ChatGPT work: an Information Retrieval perspectiveSease
 
How To Implement Your Online Search Quality Evaluation With Kibana
How To Implement Your Online Search Quality Evaluation With KibanaHow To Implement Your Online Search Quality Evaluation With Kibana
How To Implement Your Online Search Quality Evaluation With KibanaSease
 
Neural Search Comes to Apache Solr
Neural Search Comes to Apache SolrNeural Search Comes to Apache Solr
Neural Search Comes to Apache SolrSease
 
Large Scale Indexing
Large Scale IndexingLarge Scale Indexing
Large Scale IndexingSease
 
Dense Retrieval with Apache Solr Neural Search.pdf
Dense Retrieval with Apache Solr Neural Search.pdfDense Retrieval with Apache Solr Neural Search.pdf
Dense Retrieval with Apache Solr Neural Search.pdfSease
 
Neural Search Comes to Apache Solr_ Approximate Nearest Neighbor, BERT and Mo...
Neural Search Comes to Apache Solr_ Approximate Nearest Neighbor, BERT and Mo...Neural Search Comes to Apache Solr_ Approximate Nearest Neighbor, BERT and Mo...
Neural Search Comes to Apache Solr_ Approximate Nearest Neighbor, BERT and Mo...Sease
 
Word2Vec model to generate synonyms on the fly in Apache Lucene.pdf
Word2Vec model to generate synonyms on the fly in Apache Lucene.pdfWord2Vec model to generate synonyms on the fly in Apache Lucene.pdf
Word2Vec model to generate synonyms on the fly in Apache Lucene.pdfSease
 
How to cache your searches_ an open source implementation.pptx
How to cache your searches_ an open source implementation.pptxHow to cache your searches_ an open source implementation.pptx
How to cache your searches_ an open source implementation.pptxSease
 
Online Testing Learning to Rank with Solr Interleaving
Online Testing Learning to Rank with Solr InterleavingOnline Testing Learning to Rank with Solr Interleaving
Online Testing Learning to Rank with Solr InterleavingSease
 
Apache Lucene/Solr Document Classification
Apache Lucene/Solr Document ClassificationApache Lucene/Solr Document Classification
Apache Lucene/Solr Document ClassificationSease
 
Advanced Document Similarity with Apache Lucene
Advanced Document Similarity with Apache LuceneAdvanced Document Similarity with Apache Lucene
Advanced Document Similarity with Apache LuceneSease
 
Search Quality Evaluation: a Developer Perspective
Search Quality Evaluation: a Developer PerspectiveSearch Quality Evaluation: a Developer Perspective
Search Quality Evaluation: a Developer PerspectiveSease
 
Introduction to Music Information Retrieval
Introduction to Music Information RetrievalIntroduction to Music Information Retrieval
Introduction to Music Information RetrievalSease
 
Rated Ranking Evaluator: an Open Source Approach for Search Quality Evaluation
Rated Ranking Evaluator: an Open Source Approach for Search Quality EvaluationRated Ranking Evaluator: an Open Source Approach for Search Quality Evaluation
Rated Ranking Evaluator: an Open Source Approach for Search Quality EvaluationSease
 
Feature Extraction for Large-Scale Text Collections
Feature Extraction for Large-Scale Text CollectionsFeature Extraction for Large-Scale Text Collections
Feature Extraction for Large-Scale Text CollectionsSease
 

More from Sease (20)

Multi Valued Vectors Lucene
Multi Valued Vectors LuceneMulti Valued Vectors Lucene
Multi Valued Vectors Lucene
 
When SDMX meets AI-Leveraging Open Source LLMs To Make Official Statistics Mo...
When SDMX meets AI-Leveraging Open Source LLMs To Make Official Statistics Mo...When SDMX meets AI-Leveraging Open Source LLMs To Make Official Statistics Mo...
When SDMX meets AI-Leveraging Open Source LLMs To Make Official Statistics Mo...
 
How To Implement Your Online Search Quality Evaluation With Kibana
How To Implement Your Online Search Quality Evaluation With KibanaHow To Implement Your Online Search Quality Evaluation With Kibana
How To Implement Your Online Search Quality Evaluation With Kibana
 
Introducing Multi Valued Vectors Fields in Apache Lucene
Introducing Multi Valued Vectors Fields in Apache LuceneIntroducing Multi Valued Vectors Fields in Apache Lucene
Introducing Multi Valued Vectors Fields in Apache Lucene
 
Stat-weight Improving the Estimator of Interleaved Methods Outcomes with Stat...
Stat-weight Improving the Estimator of Interleaved Methods Outcomes with Stat...Stat-weight Improving the Estimator of Interleaved Methods Outcomes with Stat...
Stat-weight Improving the Estimator of Interleaved Methods Outcomes with Stat...
 
How does ChatGPT work: an Information Retrieval perspective
How does ChatGPT work: an Information Retrieval perspectiveHow does ChatGPT work: an Information Retrieval perspective
How does ChatGPT work: an Information Retrieval perspective
 
How To Implement Your Online Search Quality Evaluation With Kibana
How To Implement Your Online Search Quality Evaluation With KibanaHow To Implement Your Online Search Quality Evaluation With Kibana
How To Implement Your Online Search Quality Evaluation With Kibana
 
Neural Search Comes to Apache Solr
Neural Search Comes to Apache SolrNeural Search Comes to Apache Solr
Neural Search Comes to Apache Solr
 
Large Scale Indexing
Large Scale IndexingLarge Scale Indexing
Large Scale Indexing
 
Dense Retrieval with Apache Solr Neural Search.pdf
Dense Retrieval with Apache Solr Neural Search.pdfDense Retrieval with Apache Solr Neural Search.pdf
Dense Retrieval with Apache Solr Neural Search.pdf
 
Neural Search Comes to Apache Solr_ Approximate Nearest Neighbor, BERT and Mo...
Neural Search Comes to Apache Solr_ Approximate Nearest Neighbor, BERT and Mo...Neural Search Comes to Apache Solr_ Approximate Nearest Neighbor, BERT and Mo...
Neural Search Comes to Apache Solr_ Approximate Nearest Neighbor, BERT and Mo...
 
Word2Vec model to generate synonyms on the fly in Apache Lucene.pdf
Word2Vec model to generate synonyms on the fly in Apache Lucene.pdfWord2Vec model to generate synonyms on the fly in Apache Lucene.pdf
Word2Vec model to generate synonyms on the fly in Apache Lucene.pdf
 
How to cache your searches_ an open source implementation.pptx
How to cache your searches_ an open source implementation.pptxHow to cache your searches_ an open source implementation.pptx
How to cache your searches_ an open source implementation.pptx
 
Online Testing Learning to Rank with Solr Interleaving
Online Testing Learning to Rank with Solr InterleavingOnline Testing Learning to Rank with Solr Interleaving
Online Testing Learning to Rank with Solr Interleaving
 
Apache Lucene/Solr Document Classification
Apache Lucene/Solr Document ClassificationApache Lucene/Solr Document Classification
Apache Lucene/Solr Document Classification
 
Advanced Document Similarity with Apache Lucene
Advanced Document Similarity with Apache LuceneAdvanced Document Similarity with Apache Lucene
Advanced Document Similarity with Apache Lucene
 
Search Quality Evaluation: a Developer Perspective
Search Quality Evaluation: a Developer PerspectiveSearch Quality Evaluation: a Developer Perspective
Search Quality Evaluation: a Developer Perspective
 
Introduction to Music Information Retrieval
Introduction to Music Information RetrievalIntroduction to Music Information Retrieval
Introduction to Music Information Retrieval
 
Rated Ranking Evaluator: an Open Source Approach for Search Quality Evaluation
Rated Ranking Evaluator: an Open Source Approach for Search Quality EvaluationRated Ranking Evaluator: an Open Source Approach for Search Quality Evaluation
Rated Ranking Evaluator: an Open Source Approach for Search Quality Evaluation
 
Feature Extraction for Large-Scale Text Collections
Feature Extraction for Large-Scale Text CollectionsFeature Extraction for Large-Scale Text Collections
Feature Extraction for Large-Scale Text Collections
 

Recently uploaded

08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxnull - The Open Security Community
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Neo4j
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 

Recently uploaded (20)

08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptxVulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
The transition to renewables in India.pdf
The transition to renewables in India.pdfThe transition to renewables in India.pdf
The transition to renewables in India.pdf
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 

Enterprise Search – How Relevant Is Relevance?

  • 1. Enterprise search – how relevant is ‘relevance’? Martin White, Managing Director, Intranet Focus Ltd Visiting Professor, Information School, University of Sheffield Meetup London 23 June 2020 Martin.white@intranetfocus.com @intranetfocus @Intranetfocus 1
  • 2. My search history 1975 – 2020 (and beyond..!) 2 1975 1999 2002 - 2020 DECO Unilever Computing Services 1982 http://intranetfocus.com/resources/reports/
  • 4. What do we mean by search? • WWW search • Massive volumes of everything dominated by Google and the need for advertising revenue • Extensive use of SEO curation of content to improve findability • Inadvertently sets the ‘benchmark’ for enterprise search “Why can’t our search be like Google?’ • Web site/intranet search • Highly curated content with a focus on browsing • Search usually ends up as an afterthought • Significant resources devoted to ‘being found’ • Repository search • Services for (primarily) academic users with highly curated content on a library/archive management system • Also extensive use of external search applications such as Scopus and Lexis • Precision is important • E-Commerce search • Highly curated content and very good user tracking metrics • Very clear RoI based on sales revenues and repeat customers @Intranetfocus 4
  • 5. Enterprise search • Massive amounts of structured and unstructured content, very little of it curated and dominated by Microsoft 365 Search in terms of installed base and ‘best practice’ • Often multiple applications in a federated search implementation • Multiple content languages and unpredictable language skills of users • Security trimming plays havoc with relevance • Users are experts in their domain, and have a substantial about of information pushed to them by other enterprise applications (ERP, CRM, HR) and of course email and social media • Search is therefore ‘additive’ and not from a zero knowledge base • Not based around business processes so it is very difficult to assess whether a search failure is a content issue, a query issue or a technology issue • Rarely is there any internal content/user focused team. Largely managed by IT on ‘technical performance’ (uptime/traffic) metrics • Assumption is that search is intuitive, so no training and no mentoring available @Intranetfocus 5
  • 7. Classic IR/IIR evaluation assumptions • Access to a representative test collection • Selection of a representative user cohort (all too often students!) • Fluency in query and content languages • No (or little) prior subject knowledge assumed of searchers • No (or little) justification for undertaking the search • No business imperative (or buy in) to find the information • A/B tests used on a test collection to assess ranking improvement • Relevance assessed (usually) on item title and snippet • No assessment of content quality @Intranetfocus 7
  • 8. Information seeking – some numbers @Intranetfocus 8 0 10 20 30 40 50 60 70 80 90 Information service Electronic closing binders Internal specialists Enterprise search Library books Knowledge database Library staff Electronic research services DM applications Intranet Global law firm – approx. 1000 respondents out of 2000 staff Intranet Focus client % using each option on a ‘very frequent’ or ‘frequent’ basis
  • 10. The state of the art @Intranetfocus 10 © 2020 IntraTeam Digital Workplace Benchmarking Service www.intrateam.com Findwise Enterprise Findability Survey 2018 https://findwise.com/en/enterprise-search
  • 11. Impact of search tasks on search performance @Intranetfocus 11 The qualitative content analysis of work-task journal study material pointed on the following work task scenarios: (1) ordinary and (2) unordinary administrative tasks; everyday professional tasks as (3) high-quality tasks, (4) “just-to-get- done” tasks and (5) regular teamwork; and unordinary professional tasks as (6) unique tasks and (7) inventive teamwork The impression we have, is that the majority of current IIR research centres on Internet searching and everyday- life information needs. ….However, there remains a need for IIR research on information searching in relation to information intensive work task performance with respect to optimise information searching, the various platforms used for information searching, and understanding of the conditions under which work task performance takes place. https://drive.google.com/file/d/1BosRT0sDMvVjPxLCR4dpxuYT9ip-FSWW/view 2019 Strix Lecture given by Pia Borlund on search and task completion
  • 12. Foraging for information @Intranetfocus 12 Information patch Information scent Information diet
  • 13. Complex Searcher Model Information Scent, Searching and Stopping. Modelling SERP Level Stopping Behaviour David Maxwell and Leif Azzopardi https://www.cmswire.com/information-management/enterprise-search-development-start- with-the-user-interface/
  • 14. Stopping strategies @Intranetfocus 14 Increase in user effort Increase in search success Stopping point "I was expecting to see Document X on the first couple of pages. It’s not there/ How can I trust the search application?" "Five irrelevant results one after another. I'll give John a call. He'll know the answer" "I'm on p3 of the results and I've only seen one decent result and that's a document I already have" "I've found the French version of the report. Where is the English master version? I'm wasting my time"
  • 15. Professional search – use cases are everything @Intranetfocus 15 Information Management and Processing 5 (2018) 1042-1057 https://strathprints.strath.ac.uk/64935/ (open access version) Synonyms Boolean Abbreviations HealthcareLegal
  • 16. Looking at click logs @Intranetfocus 16 72,000 employees From #150 – #500 average queries per month 200 box 83338 3 BOX 15526 26 Box 7276 76 106,140 concur 50577 5 Concur 8100 67 concur 11914 38 70,591 hr 15541 25 HR 9929 46 25,470 lti 9197 53 LTI 9101 56 18,298 my hr 22011 15 myhr 11233 43 33,244 ocm 19446 18 OCM 7407 74 26,853 saba 22667 14 saba cloud 13700 32 36,367 software 12715 35 Software 9455 49 software store 14453 29 36,623 workday 84380 2 Workday 8981 57 93,361 it 8287 65 15,776 IT 7489 70 kiosk 14120 30 Kiosk 6858 81 20,978 angel 36133 9 ANGEL 8557 61 44,690
  • 17. Three use cases @Intranetfocus 17 Use Case A Quick access route to applications, people, office locations, breaking news etc Often ‘clever’ single query terms Use Case B Checking on corporate policies to find the most recent version, Corporate/divisional news archive Products, initiatives and projects Query terms include date ranges/alphanumeric terms Use Case C Topic-related searches using well-considered query terms Often periodic (i.e. monthly) or cyclic (‘end-of-year’) searches Sum of Top 100 Searches 1,563,384 Sum of Next 400 1,389,782 ‘Precision’ ‘Recall’
  • 18. How relevant is relevance? • Enterprise search as users will not click through each result in sequence as they are using a range of clues to work out what is relevant. • Ascertaining ‘intent’ is very difficult as people have multiple roles, multiple use- cases and unequal and unknown prior knowledge • It could be looking for an author that is in the same building so that they can go at talk to them about a problem they have. The document search was actually to find a local expert. The document may otherwise have been ‘irrelevant’ • The user may spot a very relevant document at say #3 but will not click on it because they already have it. The presence of the document at #3 is a reassurance that they are on the right track. • These behaviours make it very challenging to work out what set of heuristics the user is employing to assess each result and so what is ‘relevant’ @Intranetfocus 18
  • 19. Evaluating enterprise search 1. Click logs have a value, but for every query term ask “Why?” 2. “Time well spent” http://intranetfocus.com/time-well-spent-a-potential-holistic-view-of-productivity/ 3. Computational ethnography using data logging 4. Constant usability testing around tasks and task completion 5. Repetition (also constant) of high hit and low/zero hit searches 6. Embedded search mentors, especially in project teams 7. Clever surveys 8. Story telling – “how was it for you?” 9. Spot checks 10. Heuristic benchmarking (SearchCheck http://intranetfocus.com/enterprise-search-consulting-services/) @Intranetfocus 19
  • 20. Summary • The primary search mode in the enterprise is information foraging, where users apply a very personal array of heuristics to assessing relevance • ‘Relevance’ is still relevant but has to be associated with specific use cases and with a range of other search and query measurements • Content quality, snippet quality and the extent of search expertise play very important roles in search success and search satisfaction • Security trimming can have a significant impact on ranking which is impossible to allow for • The complexity of enterprise search implementations is why having a skilled multi-disciplinary search team is essential @Intranetfocus 20
  • 22. Further reading @Intranetfocus 22 Enterprise Search (O’Reilly Media) Achieving Enterprise Search Satisfaction http://intranetfocus.com/resources/reports/ Recent columns Seven stress tests for your enterprise search and intranet search applications Distributed information management – the oxygen of your organisation Unpacking the complexities of enterprise search behaviour Search won’t improve until we understand why people search, not just how Diving into enterprise search query logs When improving search performance don’t follow the clicks Time spent searching – a chronology of the myth and some recent research A list of twenty enterprise search myths http://theses.gla.ac.uk/41132/