Presentation delivered at SPNHC-TDWG 2018 in Dunedin, New Zealand. Covers new and upcoming developments in the search functionality of the Biodiversity Heritage Library, an open access digital library.
(DIYA) Call Girls Sinhagad Road ( 7001035870 ) HI-Fi Pune Escorts Service
SPNHCTDWG2018_Improving Search Efficiency in BHL
1. Improving Search Efficiency
in the Biodiversity Heritage
Library corpus
Carolyn Sheffield
@BHLProgramMgr
28 August | SPNHC-TDWG 2018
Share your thoughts on social media using
#BHLib
3. Pam McClanahan
Marissa Kings
Alicia Esquivel
Katie Mika
Ariadne Rehbein
• Full Text Search
• Transcriptions and OCR
corrections
• Named entities enhancements
• Annotations
• Linking
• Overall usability
• Many more!
Priorities for improving research efficiency:
Recommendations from National Digital Stewardship Residents
4. Priorities for improving research efficiency:
Recommendations from National Digital Stewardship Residents
• Full Text Search
• Transcriptions and OCR
corrections
• Named entities enhancements
• Annotations
• Linking
• Overall usability
• Many more!
Pam McClanahan
Marissa Kings
Alicia Esquivel
Katie Mika
Ariadne Rehbein
5. New Service: Full Text Search!
Search across the text of all 54+ million pages in BHL!
Search results now display matches within both the
bibliographic information + the full text
Filter search results by content type, publication date,
subject, language, and author with new faceted browsing
Use “search inside” to search for terms within a
book you are viewing
15. • Results only as good as OCR
• And faceting only as good as metadata
• Ambiguity of identifiers mentioned in
texts
• API v.3 – Coming Soon!
Things to Keep in Mind
16. Priorities for improving research efficiency:
Recommendations from National Digital Stewardship Residents
• Full Text Search
• Transcriptions and OCR
corrections
• Named entities enhancements
• Annotations
• Linking
• Overall usability
• Many more!
Pam McClanahan
Marissa Kings
Alicia Esquivel
Katie Mika
Ariadne Rehbein
17. LEADS
• 10 weeks
• Identify place names in texts
• Associate with mentions of species
names or type specimens
• Disambiguate geographic name strings
and associate them with a point on a map
LEADS-4-NDP
LIS Education and Data Science for the National Digital Platform
Gretchen R. Stahlman
18. LEADS - Challenges
Challenges
• OCR
• Ambiguity of references
• Diverse corpus
LEADS-4-NDP
LIS Education and Data Science for the National Digital Platform
22. Methods
• Annotations
• Machine learning
• Visualization tools
LEADS-4-NDP
LIS Education and Data Science for the National Digital Platform
23. Priorities for improving research efficiency:
Recommendations from National Digital Stewardship Residents
• Full Text Search
• Transcriptions and OCR
corrections
• Named entities enhancements
• Annotations
• Linking
• Overall usability
• Many more!
Pam McClanahan
Marissa Kings
Alicia Esquivel
Katie Mika
Ariadne Rehbein