4. Agenda
• Enterprise Search Portal
• Insight into SP2013 Search
• Key changes from SP2010
• A bit of magic – relevancy calculation
• Search governance, useful hint & tips
4
5. Key search patterns
• I know what I’m searching and where to find it
• I know what I’m searching but don’t know where
to find it.
• I don’t‘ know what I’m searching
5
http://aghy.hu/AghyBlog_EN/Lists/Posts/Post.aspx?ID=199
6. • Demand:
• Fast growing enterprises
• Zoo of internal systems
• Solution:
• “google” inside enterprise
• Quick-wins for business:
• Single point of smart search and information retrieval
• Reduce search time by employee
• Better inner communications and simplified reuse of
conent
6
Enterprise Search Portal
7. But after deployment…
• «.. Search sucks»
• Out of the box search knows nothing about you
• «Typical But…
• … Microsoft takes care of decent search algorithm»
• … we’re not sure we can do better»
• ... we don’t need search, everybody know where content is»
• … make our search like in facebook/google/bing (instead of
requirements)»
7
8. Why it’s hard
• Ambiguous short queries
• Unstructured not optimized content
• Different active vocabulary of content users and
creators
• Limited resources ($), while in internet search:
• Auto and manual testing of search quality (assessors)
• Continuous improvement
8
10. Search in two phase
process
• Matching – all docs with keywords
• Linguistics: stemming, phonetics
• Synonyms
• Ranking
• «Фичи»
• TF-IDF, BM25
• Вес полей
• Тип файла
• Дата изменения
• Популярность
• …
10
15. Ranking in SP2013
• Default Relevancy Model
• Two neural networks
• Freshness in not included in ranking
• Features
15
Type Instance
BM25 BM25
Static UrlDepth
BucketedStatic InternalFileType
BucketedStatic Language
Static ClickDistance
Static QueryLogClicks
Static QueryLogSkips
Static LastClicks
Static EventRate
MinSpan - soft Title
MinSpan - soft Title
MinSpan - soft Title
MinSpan - soft Content
24. 2. Fine tuning
• Authoritative Pages
• Quick win – content source priority
• Query Rules
• Smart search for users
• Synonyms
• Separate mapping file
• Expansion only
• Termsets synonyms NOT working
• Relevancy models
24
25. Authoritative Pages
• Impacts ClickDistance
• ClickDistance, UrlDepth have hich impact on total
score (see explain rank)
• Configures in CA, CSOM
25
26. Query Rules (Rule +
Action)
• The tool to make search smarter
• Interactive feedback to user queries
• Post processing of queries
• Leverage navigational queries
• …
26
27. Condition for Query Rules
• Query Matches Keyword Exactly
• Advanced Query Text Match
• Query Matches Dictionary Exactly
• Query Contains Action Term
• Query More Common in Source
• Result Type Commonly Clicked
27
28. Actions для Query Rules
• Create and display a result block
• Change ranked search results
• Best Bets
• XRANK
• Works additive to total rank
• Not explained in rankdetail
• How to choose correct value?
28
29. Templates for
QueryRules
• Typical navigational keywords from our portal
• Software, soft, download, install
• How to
• Policy, Blog
• Portal
• Music, Video
• Presentation, Documents, Report
• Training, tutorial
• Book, ebook
• You will have different ones!
29
34. 4. Security «audit»
• Search reveals breaches in security
• Security by obscurity
• Examples of queries:
• «confidential»
• Salaries, performance reviews
• Solution – automatic monitoring of sensitive
queries
34
35. 5. Adoption of content
• Use with departments
• Get help with search monitoring of their queries
• Guideline to format content
• Basic SEO
• Titles
• Friendly urls
• Custom meta tags <meta name=…
• Title, description
• Custom Automatically appear in crawled properties
35
36. 6. Promotion within
company
• Image – «you will find everything here»
• Integrate with other portals
• Propose Search as a serivce
• Widget «Global search»
• Badges, gamification
36
38. Semantic search
• Cannot be solved in general
• Analytics + fine tuning
• See practices above
• NLP – question answering
• Rocket science
• English only
• Part of speech tagging, dependency parsing
• Stanford NLP, Open NLP, IR
38
39. «References»
• Patents - http://goo.gl/20sbR
• Explain Rank page - http://goo.gl/o3ZmN
• How SP2013 relevancy models works - http://goo.gl/arf0P
• MS Enterprise Search approach - http://goo.gl/x8SDO
• Customizing ranking models in SP 2013 - http://goo.gl/lBJAp
39