SlideShare a Scribd company logo
1 of 32
Download to read offline
Vector Databases and
Neural Search
Dmitry Kan, Max Irwin
Open NLP Meetup
About me
● PhD in NLP
● 16+ years of experience in developing search engines
for start-ups and multinational technology giants
● Expert in vector search engines and the host of the
Vector Podcast
● Blogging about vector search on Medium:
https://dmitry-kan.medium.com/
● Committer on search QA tool Quepid:
https://github.com/o19s/quepid
Team Sisu: BigANN Competition @ NeurIPS’21
● Max Irwin
● Alex Semenov
● Aarne Talman
● Leo Joffe
● Alex Klibisz
● Dmitry Kan
Billion-Scale ANN Algorithm Challenge
Scope
● Vector search in a nutshell
● Main Vector DBs
● Neural search frameworks
● Get practical
● Demos
Google Trends
Vector search in a nutshell
Vector search is a way to represent and search
your objects (documents, songs, images..) in a
geometric space (usually of high-dimension) in
the form of an embedding (a vector of
numbers: [0.9, -0.1, 0.15, …])
● At small scale you can apply exact KNN
search
● At larger scale you need to use ANN
search: trade some precision for speed
Credit: Weaviate V1.0 release - virtual meetup
Use cases
● Semantic search
● Image similarity
● Sound search
● Multimodality: searching images with text
● Recommenders
● E-commerce zero-hit long-tail (similarity search)
Big players in the game
● Spotify: ANNOY
● Microsoft (Bing team): Zoom, DiskANN, SPTAG
○ Azure Cognitive Search
● Amazon: KNN based on HNSW in OpenSearch
● Google: ScaNN
● Yahoo! Japan: NGT
● Facebook: FAISS, PQ (CPU & GPU)
● Baidu: IPDG (Baidu Cloud)
● Yandex
● NVidia
● Intell
ANN algorithms
https://bit.ly/3ApqYYQ
BuddyPQ
increases 12%
in Recall over
baseline
FAISS
Credit: The Pinecone Vector Database System (Edo Liberty)
BuddyPQ: improving over FAISS
BIGANN dataset Kolmogorov-Smirnov dimension test matrix for the first
100000 points. A higher number indicates a less similar distribution
Variance Inflation Factor (Multicollinearity) (~2.25)
Smaller Vector DB players: 71% are Open Source
Company Product Cloud Open Source:
Y/N
Algorithms
SeMI Weaviate Y Y (Go) custom HNSW
Pinecone Pinecone Y N FAISS + own
GSI APU chip for
Elasticsearch /
Opensearch
N N Neural hashing
/ Hamming
distance
Qdrant Qdrant N Y (Rust) HNSW (graph)
Yahoo! Vespa Y Y (Java, C++) HNSW (graph)
Ziliz Milvus N Y (Go, C++,
Python)
FAISS, HNSW
Yahoo! Vald N Y (Go) NGT
Milvus
🌍 milvus.io
💡 self-hosted vector database
🤖 open source
Value proposition:
● attention to scalability of the
entire search engine: (re)indexing
and search
● ability to index data with multiple
ANN algorithms to compare their
performance for your use case
Pinecone
🌍 pinecone.io
💡managed vector database
🤖 close source
Value proposition:
● Fully managed vector
database
● Single-stage filtering
capability: search for your
objects (sweaters) + filter by
metadata (color, size, price) in
one query
Vespa
🌍 vespa.ai/
💡 managed / self-hosted
🤖 Code: open source
Value proposition:
● low-latency computation over large
data sets
● stores and indexes your data so that
queries, selection and processing
over the data can be performed at
serving time
● customizable functionality
● deep data structures geared towards
deep-learning like data science, like
Tensors
Weaviate
🌍
semi.technology/developers/weavia
te/current/
💡 managed / self-hosted
🤖 open source
Value proposition:
● Expressive query syntax
● Graphql-like interface
● combo of vector search, object
storage and inverted index
● Wow-effect: Has an impressive
question answering
component — esp for demos
Vald
🌍 Link: vald.vdaas.org/
💡 Type: Self-hosted vector
database
🤖 Code: open source
Value proposition:
● Billion-scale
● Cloud-native architecture
● Fastest ANN Algo: NGT
● Custom reranking / filtering
algorithm plugins
GSI APU
🌍 Link: gsitechnology.com/APU
💡 Type: Vector search hardware backend
for your Elasticsearch / OpenSearch
🤖 Code: close source
Value proposition:
● Billion-scale
● Extends your Elasticsearch /
OpenSearch capabilities to similarity
search
● On-prem / hosted APU board hosted
cloud backend
Qdrant
🌍 qdrant.tech/
💡 self-hosted vector database (Cloud in
roadmap)
🤖 open source
Value proposition:
● The vector similarity engine with
extended filtering support
● dynamic query planning and payload
data indexing
● string matching, numerical ranges,
geo-locations, and more
● Metric Deep Learning
Semantic frameworks / layers: 57% Open Source
Company Product Open Source: Y/N Focus
Deepset.ai Haystack Y NLP, neural search
Jina.AI Jina, Hub, Finetuner Y NLP, CV, ASR
Featureform Feature store,
EmbeddingHub
Y All AI verticals
ZIR.AI AI search platform N NLP
Hebbia.AI Knowledge Base N NLP -> Finance
Rasa.ai Virtual assistants Y NLP
Muves.io Multilingual vector
search
N Multilingual search,
multimodality
Vector Databases: Milvus, Weaviate, Pinecone, GSI, Qdrant, Vespa, Vald, Elastiknn
Neural frameworks: Haystack, Jina.AI, ZIR.AI, Muves, Hebbia.AI, Featureform
KNN / ANN algorithms: HNSW, PQ, IVF, LSH, Zoom, DiskANN, BuddyPQ, …
Multi-modal encoder / single modality encoders
Application business logic: neural / BM25, symbolic
filters, ranking
user interface
How to pick a vector DB / framework
● Have own engineering?
○ Yes: go for the framework vendor / self-hosted DB
○ No: choose higher-level system, like Hebbia.AI
● Own embedding layer or OK with vector DB doing it?
○ Own: Qdrant, Milvus, Pinecone, GSI, Vespa, Vald
○ In-DB: Weaviate, Vespa
● Heavy focus on NLP?
○ YES: Consider Haystack (deepset)
○ NO: Consider Jina.AI
● Want to quickly test before investing?
○ Yes: ZIR.AI, Hebbia.AI
○ No: Jina.AI, Haystack etc
How to pick a vector DB / framework
● Want to HOST or fine with MANAGED?
○ HOST: Vespa, Vald, Milvus, Qdrant
○ MANAGED: Pinecone, Weaviate, GSI, Hebbia.AI, ZIR.AI
Demo: Muves demo.muves.io
Trends in ML at large
● Model hubs (e.g. Hugging Face) → ML community shares progress
quickly (similar to what GitHub did to sharing code)
● Deep Learning → multimodal: CLIP (text from images), DALL-E
(images from text)
● MLOps optimize experimentation and deployments: determined.ai,
DVC, MLflow / Kubeflow
Get practical
● Code: https://github.com/DmitryKey/bert-solr-search
● Supported Engines: Solr, Elasticsearch, OpenSearch, GSI, [hnswlib]
● Supported LMs: BERT, SBERT, [theoretically any]
Q&A demo
Questions:
1. Why did Peloton shares fall?
2. How are articles created on TechCrunch?
3. What is ethical AI?
https://techcrunch.vectors.network/
Weaviate console
Thank you! 🧡
twitter.com/DmitryKan
youtube.com/c/VectorPodcast
https://spoti.fi/3sRXcdn
Links
1. Martin Fowler’s talk on NoSQL databases: https://www.youtube.com/watch?v=qI_g07C_Q5I
2. Neural search vs Inverted search
https://trends.google.com/trends/explore?date=all&q=neural%20search,inverted%20search
3. Not All Vector Databases Are Made Equal
https://towardsdatascience.com/milvus-pinecone-vespa-weaviate-vald-gsi-what-unites-these-
buzz-words-and-what-makes-each-9c65a3bd0696
4. HN thread: https://news.ycombinator.com/item?id=28727816
5. A survey of PQ and related methods: https://faiss.ai/
6. Vector Podcast on YouTube:
https://www.youtube.com/channel/UCCIMPfR7TXyDvlDRXjVhP1g
7. Vector Podcast on Spotify: https://open.spotify.com/show/13JO3vhMf7nAqcpvlIgOY6
8. Vector Podcast on Apple Podcasts:
https://podcasts.apple.com/us/podcast/vector-podcast/id1587568733
9. BERT, Solr, Elasticsearch, OpenSearch, HNSWlib – in Python:
https://github.com/DmitryKey/bert-solr-search

More Related Content

What's hot

How Kafka Powers the World's Most Popular Vector Database System with Charles...
How Kafka Powers the World's Most Popular Vector Database System with Charles...How Kafka Powers the World's Most Popular Vector Database System with Charles...
How Kafka Powers the World's Most Popular Vector Database System with Charles...HostedbyConfluent
 
Apache Superset - open source data exploration and visualization (Conclusion ...
Apache Superset - open source data exploration and visualization (Conclusion ...Apache Superset - open source data exploration and visualization (Conclusion ...
Apache Superset - open source data exploration and visualization (Conclusion ...Lucas Jellema
 
Databricks Platform.pptx
Databricks Platform.pptxDatabricks Platform.pptx
Databricks Platform.pptxAlex Ivy
 
RESTful Machine Learning with Flask and TensorFlow Serving - Carlo Mazzaferro
RESTful Machine Learning with Flask and TensorFlow Serving - Carlo MazzaferroRESTful Machine Learning with Flask and TensorFlow Serving - Carlo Mazzaferro
RESTful Machine Learning with Flask and TensorFlow Serving - Carlo MazzaferroPyData
 
Databricks Fundamentals
Databricks FundamentalsDatabricks Fundamentals
Databricks FundamentalsDalibor Wijas
 
Bringing ML To Production, What Is Missing? AMLD 2020
Bringing ML To Production, What Is Missing? AMLD 2020Bringing ML To Production, What Is Missing? AMLD 2020
Bringing ML To Production, What Is Missing? AMLD 2020Mikio L. Braun
 
Monitor Apache Spark 3 on Kubernetes using Metrics and Plugins
Monitor Apache Spark 3 on Kubernetes using Metrics and PluginsMonitor Apache Spark 3 on Kubernetes using Metrics and Plugins
Monitor Apache Spark 3 on Kubernetes using Metrics and PluginsDatabricks
 
Unified Big Data Processing with Apache Spark (QCON 2014)
Unified Big Data Processing with Apache Spark (QCON 2014)Unified Big Data Processing with Apache Spark (QCON 2014)
Unified Big Data Processing with Apache Spark (QCON 2014)Databricks
 
Apply MLOps at Scale by H&M
Apply MLOps at Scale by H&MApply MLOps at Scale by H&M
Apply MLOps at Scale by H&MDatabricks
 
Simplify and Scale Data Engineering Pipelines with Delta Lake
Simplify and Scale Data Engineering Pipelines with Delta LakeSimplify and Scale Data Engineering Pipelines with Delta Lake
Simplify and Scale Data Engineering Pipelines with Delta LakeDatabricks
 
Machine Learning Platformization & AutoML: Adopting ML at Scale in the Enterp...
Machine Learning Platformization & AutoML: Adopting ML at Scale in the Enterp...Machine Learning Platformization & AutoML: Adopting ML at Scale in the Enterp...
Machine Learning Platformization & AutoML: Adopting ML at Scale in the Enterp...Ed Fernandez
 
Deep Learning for Natural Language Processing: Word Embeddings
Deep Learning for Natural Language Processing: Word EmbeddingsDeep Learning for Natural Language Processing: Word Embeddings
Deep Learning for Natural Language Processing: Word EmbeddingsRoelof Pieters
 
Neural Search Comes to Apache Solr
Neural Search Comes to Apache SolrNeural Search Comes to Apache Solr
Neural Search Comes to Apache SolrSease
 
Knowledge Graph Introduction
Knowledge Graph IntroductionKnowledge Graph Introduction
Knowledge Graph IntroductionSören Auer
 
Apache Iceberg: An Architectural Look Under the Covers
Apache Iceberg: An Architectural Look Under the CoversApache Iceberg: An Architectural Look Under the Covers
Apache Iceberg: An Architectural Look Under the CoversScyllaDB
 
bm25 demystified
bm25 demystifiedbm25 demystified
bm25 demystifiedFan Robbin
 
[pgday.Seoul 2022] PostgreSQL with Google Cloud
[pgday.Seoul 2022] PostgreSQL with Google Cloud[pgday.Seoul 2022] PostgreSQL with Google Cloud
[pgday.Seoul 2022] PostgreSQL with Google CloudPgDay.Seoul
 
Considerations for Data Access in the Lakehouse
Considerations for Data Access in the LakehouseConsiderations for Data Access in the Lakehouse
Considerations for Data Access in the LakehouseDatabricks
 

What's hot (20)

How Kafka Powers the World's Most Popular Vector Database System with Charles...
How Kafka Powers the World's Most Popular Vector Database System with Charles...How Kafka Powers the World's Most Popular Vector Database System with Charles...
How Kafka Powers the World's Most Popular Vector Database System with Charles...
 
Apache Superset - open source data exploration and visualization (Conclusion ...
Apache Superset - open source data exploration and visualization (Conclusion ...Apache Superset - open source data exploration and visualization (Conclusion ...
Apache Superset - open source data exploration and visualization (Conclusion ...
 
Databricks Platform.pptx
Databricks Platform.pptxDatabricks Platform.pptx
Databricks Platform.pptx
 
RESTful Machine Learning with Flask and TensorFlow Serving - Carlo Mazzaferro
RESTful Machine Learning with Flask and TensorFlow Serving - Carlo MazzaferroRESTful Machine Learning with Flask and TensorFlow Serving - Carlo Mazzaferro
RESTful Machine Learning with Flask and TensorFlow Serving - Carlo Mazzaferro
 
Databricks Fundamentals
Databricks FundamentalsDatabricks Fundamentals
Databricks Fundamentals
 
Bringing ML To Production, What Is Missing? AMLD 2020
Bringing ML To Production, What Is Missing? AMLD 2020Bringing ML To Production, What Is Missing? AMLD 2020
Bringing ML To Production, What Is Missing? AMLD 2020
 
Monitor Apache Spark 3 on Kubernetes using Metrics and Plugins
Monitor Apache Spark 3 on Kubernetes using Metrics and PluginsMonitor Apache Spark 3 on Kubernetes using Metrics and Plugins
Monitor Apache Spark 3 on Kubernetes using Metrics and Plugins
 
Elasticsearch
ElasticsearchElasticsearch
Elasticsearch
 
Unified Big Data Processing with Apache Spark (QCON 2014)
Unified Big Data Processing with Apache Spark (QCON 2014)Unified Big Data Processing with Apache Spark (QCON 2014)
Unified Big Data Processing with Apache Spark (QCON 2014)
 
What is MLOps
What is MLOpsWhat is MLOps
What is MLOps
 
Apply MLOps at Scale by H&M
Apply MLOps at Scale by H&MApply MLOps at Scale by H&M
Apply MLOps at Scale by H&M
 
Simplify and Scale Data Engineering Pipelines with Delta Lake
Simplify and Scale Data Engineering Pipelines with Delta LakeSimplify and Scale Data Engineering Pipelines with Delta Lake
Simplify and Scale Data Engineering Pipelines with Delta Lake
 
Machine Learning Platformization & AutoML: Adopting ML at Scale in the Enterp...
Machine Learning Platformization & AutoML: Adopting ML at Scale in the Enterp...Machine Learning Platformization & AutoML: Adopting ML at Scale in the Enterp...
Machine Learning Platformization & AutoML: Adopting ML at Scale in the Enterp...
 
Deep Learning for Natural Language Processing: Word Embeddings
Deep Learning for Natural Language Processing: Word EmbeddingsDeep Learning for Natural Language Processing: Word Embeddings
Deep Learning for Natural Language Processing: Word Embeddings
 
Neural Search Comes to Apache Solr
Neural Search Comes to Apache SolrNeural Search Comes to Apache Solr
Neural Search Comes to Apache Solr
 
Knowledge Graph Introduction
Knowledge Graph IntroductionKnowledge Graph Introduction
Knowledge Graph Introduction
 
Apache Iceberg: An Architectural Look Under the Covers
Apache Iceberg: An Architectural Look Under the CoversApache Iceberg: An Architectural Look Under the Covers
Apache Iceberg: An Architectural Look Under the Covers
 
bm25 demystified
bm25 demystifiedbm25 demystified
bm25 demystified
 
[pgday.Seoul 2022] PostgreSQL with Google Cloud
[pgday.Seoul 2022] PostgreSQL with Google Cloud[pgday.Seoul 2022] PostgreSQL with Google Cloud
[pgday.Seoul 2022] PostgreSQL with Google Cloud
 
Considerations for Data Access in the Lakehouse
Considerations for Data Access in the LakehouseConsiderations for Data Access in the Lakehouse
Considerations for Data Access in the Lakehouse
 

Similar to Vector databases and neural search

The Big Data Puzzle, Where Does the Eclipse Piece Fit?
The Big Data Puzzle, Where Does the Eclipse Piece Fit?The Big Data Puzzle, Where Does the Eclipse Piece Fit?
The Big Data Puzzle, Where Does the Eclipse Piece Fit?J Langley
 
Machine Learning on Google Cloud with H2O
Machine Learning on Google Cloud with H2OMachine Learning on Google Cloud with H2O
Machine Learning on Google Cloud with H2OSri Ambati
 
London IR Meetup - Players in Vector Search_ algorithms, software and use cases
London IR Meetup - Players in Vector Search_ algorithms, software and use casesLondon IR Meetup - Players in Vector Search_ algorithms, software and use cases
London IR Meetup - Players in Vector Search_ algorithms, software and use casesDmitry Kan
 
Azure 機器學習 - 使用Python, R, Spark, CNTK 深度學習
Azure 機器學習 - 使用Python, R, Spark, CNTK 深度學習 Azure 機器學習 - 使用Python, R, Spark, CNTK 深度學習
Azure 機器學習 - 使用Python, R, Spark, CNTK 深度學習 Herman Wu
 
Big Data and NoSQL for Database and BI Pros
Big Data and NoSQL for Database and BI ProsBig Data and NoSQL for Database and BI Pros
Big Data and NoSQL for Database and BI ProsAndrew Brust
 
Big Data Analytics (ML, DL, AI) hands-on
Big Data Analytics (ML, DL, AI) hands-onBig Data Analytics (ML, DL, AI) hands-on
Big Data Analytics (ML, DL, AI) hands-onDony Riyanto
 
Big Data Trend and Open Data
Big Data Trend and Open DataBig Data Trend and Open Data
Big Data Trend and Open DataJongwook Woo
 
Spsbepoelmanssharepointbigdataclean 150421080105-conversion-gate02
Spsbepoelmanssharepointbigdataclean 150421080105-conversion-gate02Spsbepoelmanssharepointbigdataclean 150421080105-conversion-gate02
Spsbepoelmanssharepointbigdataclean 150421080105-conversion-gate02BIWUG
 
How to build your own Delve: combining machine learning, big data and SharePoint
How to build your own Delve: combining machine learning, big data and SharePointHow to build your own Delve: combining machine learning, big data and SharePoint
How to build your own Delve: combining machine learning, big data and SharePointJoris Poelmans
 
Using Deep Learning to do Real-Time Scoring in Practical Applications - 2015-...
Using Deep Learning to do Real-Time Scoring in Practical Applications - 2015-...Using Deep Learning to do Real-Time Scoring in Practical Applications - 2015-...
Using Deep Learning to do Real-Time Scoring in Practical Applications - 2015-...Greg Makowski
 
Quick dive into the big data pool without drowning - Demi Ben-Ari @ Panorays
Quick dive into the big data pool without drowning - Demi Ben-Ari @ PanoraysQuick dive into the big data pool without drowning - Demi Ben-Ari @ Panorays
Quick dive into the big data pool without drowning - Demi Ben-Ari @ PanoraysDemi Ben-Ari
 
Graph-Oriented NoSQL Databases
Graph-Oriented NoSQL  Databases Graph-Oriented NoSQL  Databases
Graph-Oriented NoSQL Databases Abdelkader OUARED
 
Big data serving: Processing and inference at scale in real time
Big data serving: Processing and inference at scale in real timeBig data serving: Processing and inference at scale in real time
Big data serving: Processing and inference at scale in real timeItai Yaffe
 
Latest Developments in H2O
Latest Developments in H2OLatest Developments in H2O
Latest Developments in H2OSri Ambati
 
Bringing Deep Learning into production
Bringing Deep Learning into production Bringing Deep Learning into production
Bringing Deep Learning into production Paolo Platter
 
Vital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and Spark
Vital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and SparkVital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and Spark
Vital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and SparkVital.AI
 
Info Session : University Institute of engineering and technology , Kurukshet...
Info Session : University Institute of engineering and technology , Kurukshet...Info Session : University Institute of engineering and technology , Kurukshet...
Info Session : University Institute of engineering and technology , Kurukshet...HRITIKKHURANA1
 
Designing Artificial Intelligence
Designing Artificial IntelligenceDesigning Artificial Intelligence
Designing Artificial IntelligenceDavid Chou
 

Similar to Vector databases and neural search (20)

The Big Data Puzzle, Where Does the Eclipse Piece Fit?
The Big Data Puzzle, Where Does the Eclipse Piece Fit?The Big Data Puzzle, Where Does the Eclipse Piece Fit?
The Big Data Puzzle, Where Does the Eclipse Piece Fit?
 
Machine Learning on Google Cloud with H2O
Machine Learning on Google Cloud with H2OMachine Learning on Google Cloud with H2O
Machine Learning on Google Cloud with H2O
 
London IR Meetup - Players in Vector Search_ algorithms, software and use cases
London IR Meetup - Players in Vector Search_ algorithms, software and use casesLondon IR Meetup - Players in Vector Search_ algorithms, software and use cases
London IR Meetup - Players in Vector Search_ algorithms, software and use cases
 
Azure 機器學習 - 使用Python, R, Spark, CNTK 深度學習
Azure 機器學習 - 使用Python, R, Spark, CNTK 深度學習 Azure 機器學習 - 使用Python, R, Spark, CNTK 深度學習
Azure 機器學習 - 使用Python, R, Spark, CNTK 深度學習
 
Big Data and NoSQL for Database and BI Pros
Big Data and NoSQL for Database and BI ProsBig Data and NoSQL for Database and BI Pros
Big Data and NoSQL for Database and BI Pros
 
Big Data Analytics (ML, DL, AI) hands-on
Big Data Analytics (ML, DL, AI) hands-onBig Data Analytics (ML, DL, AI) hands-on
Big Data Analytics (ML, DL, AI) hands-on
 
Big Data Trend and Open Data
Big Data Trend and Open DataBig Data Trend and Open Data
Big Data Trend and Open Data
 
Spsbepoelmanssharepointbigdataclean 150421080105-conversion-gate02
Spsbepoelmanssharepointbigdataclean 150421080105-conversion-gate02Spsbepoelmanssharepointbigdataclean 150421080105-conversion-gate02
Spsbepoelmanssharepointbigdataclean 150421080105-conversion-gate02
 
How to build your own Delve: combining machine learning, big data and SharePoint
How to build your own Delve: combining machine learning, big data and SharePointHow to build your own Delve: combining machine learning, big data and SharePoint
How to build your own Delve: combining machine learning, big data and SharePoint
 
AI on Big Data
AI on Big DataAI on Big Data
AI on Big Data
 
Using Deep Learning to do Real-Time Scoring in Practical Applications - 2015-...
Using Deep Learning to do Real-Time Scoring in Practical Applications - 2015-...Using Deep Learning to do Real-Time Scoring in Practical Applications - 2015-...
Using Deep Learning to do Real-Time Scoring in Practical Applications - 2015-...
 
Quick dive into the big data pool without drowning - Demi Ben-Ari @ Panorays
Quick dive into the big data pool without drowning - Demi Ben-Ari @ PanoraysQuick dive into the big data pool without drowning - Demi Ben-Ari @ Panorays
Quick dive into the big data pool without drowning - Demi Ben-Ari @ Panorays
 
Graph-Oriented NoSQL Databases
Graph-Oriented NoSQL  Databases Graph-Oriented NoSQL  Databases
Graph-Oriented NoSQL Databases
 
Big data serving: Processing and inference at scale in real time
Big data serving: Processing and inference at scale in real timeBig data serving: Processing and inference at scale in real time
Big data serving: Processing and inference at scale in real time
 
Latest Developments in H2O
Latest Developments in H2OLatest Developments in H2O
Latest Developments in H2O
 
Bringing Deep Learning into production
Bringing Deep Learning into production Bringing Deep Learning into production
Bringing Deep Learning into production
 
Vital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and Spark
Vital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and SparkVital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and Spark
Vital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and Spark
 
Text mining and Visualizations
Text mining  and VisualizationsText mining  and Visualizations
Text mining and Visualizations
 
Info Session : University Institute of engineering and technology , Kurukshet...
Info Session : University Institute of engineering and technology , Kurukshet...Info Session : University Institute of engineering and technology , Kurukshet...
Info Session : University Institute of engineering and technology , Kurukshet...
 
Designing Artificial Intelligence
Designing Artificial IntelligenceDesigning Artificial Intelligence
Designing Artificial Intelligence
 

More from Dmitry Kan

Haystack LIVE! - 5 ways to increase result diversity at web-scale - Dmitry Ka...
Haystack LIVE! - 5 ways to increase result diversity at web-scale - Dmitry Ka...Haystack LIVE! - 5 ways to increase result diversity at web-scale - Dmitry Ka...
Haystack LIVE! - 5 ways to increase result diversity at web-scale - Dmitry Ka...Dmitry Kan
 
IR: Open source state
IR: Open source stateIR: Open source state
IR: Open source stateDmitry Kan
 
SentiScan: система автоматической разметки тональности в social media
SentiScan: система автоматической разметки тональности в social mediaSentiScan: система автоматической разметки тональности в social media
SentiScan: система автоматической разметки тональности в social mediaDmitry Kan
 
Social spam detection by SemanticAnalyzer Group
Social spam detection by SemanticAnalyzer GroupSocial spam detection by SemanticAnalyzer Group
Social spam detection by SemanticAnalyzer GroupDmitry Kan
 
Lucene revolution eu 2013 dublin writeup
Lucene revolution eu 2013 dublin writeupLucene revolution eu 2013 dublin writeup
Lucene revolution eu 2013 dublin writeupDmitry Kan
 
Starget sentiment analyzer for English
Starget sentiment analyzer for EnglishStarget sentiment analyzer for English
Starget sentiment analyzer for EnglishDmitry Kan
 
Linguistic component Tokenizer for the Russian language
Linguistic component Tokenizer for the Russian languageLinguistic component Tokenizer for the Russian language
Linguistic component Tokenizer for the Russian languageDmitry Kan
 
Linguistic component Lemmatizer for the Russian language
Linguistic component Lemmatizer for the Russian languageLinguistic component Lemmatizer for the Russian language
Linguistic component Lemmatizer for the Russian languageDmitry Kan
 
Linguistic component Sentiment Analyzer for the Russian language
Linguistic component Sentiment Analyzer for the Russian languageLinguistic component Sentiment Analyzer for the Russian language
Linguistic component Sentiment Analyzer for the Russian languageDmitry Kan
 
Solr onfitnesse learningfromberlinbuzzwords
Solr onfitnesse learningfromberlinbuzzwordsSolr onfitnesse learningfromberlinbuzzwords
Solr onfitnesse learningfromberlinbuzzwordsDmitry Kan
 
MTEngine: Semantic-level Crowdsourced Machine Translation
MTEngine: Semantic-level Crowdsourced Machine TranslationMTEngine: Semantic-level Crowdsourced Machine Translation
MTEngine: Semantic-level Crowdsourced Machine TranslationDmitry Kan
 
Rule based approach to sentiment analysis at romip’11 slides
Rule based approach to sentiment analysis at romip’11 slidesRule based approach to sentiment analysis at romip’11 slides
Rule based approach to sentiment analysis at romip’11 slidesDmitry Kan
 
Machine translation course program (in English)
Machine translation course program (in English)Machine translation course program (in English)
Machine translation course program (in English)Dmitry Kan
 
Rule based approach to sentiment analysis at ROMIP 2011
Rule based approach to sentiment analysis at ROMIP 2011Rule based approach to sentiment analysis at ROMIP 2011
Rule based approach to sentiment analysis at ROMIP 2011Dmitry Kan
 
Icsoft 2011 51_cr
Icsoft 2011 51_crIcsoft 2011 51_cr
Icsoft 2011 51_crDmitry Kan
 
Poster: Method for an automatic generation of a semantic-level contextual tra...
Poster: Method for an automatic generation of a semantic-level contextual tra...Poster: Method for an automatic generation of a semantic-level contextual tra...
Poster: Method for an automatic generation of a semantic-level contextual tra...Dmitry Kan
 
Semantic feature machine translation system
Semantic feature machine translation systemSemantic feature machine translation system
Semantic feature machine translation systemDmitry Kan
 
NoSQL, Apache SOLR and Apache Hadoop
NoSQL, Apache SOLR and Apache HadoopNoSQL, Apache SOLR and Apache Hadoop
NoSQL, Apache SOLR and Apache HadoopDmitry Kan
 
Introduction To Machine Translation 1
Introduction To Machine Translation 1Introduction To Machine Translation 1
Introduction To Machine Translation 1Dmitry Kan
 
Introduction To Machine Translation
Introduction To Machine TranslationIntroduction To Machine Translation
Introduction To Machine TranslationDmitry Kan
 

More from Dmitry Kan (20)

Haystack LIVE! - 5 ways to increase result diversity at web-scale - Dmitry Ka...
Haystack LIVE! - 5 ways to increase result diversity at web-scale - Dmitry Ka...Haystack LIVE! - 5 ways to increase result diversity at web-scale - Dmitry Ka...
Haystack LIVE! - 5 ways to increase result diversity at web-scale - Dmitry Ka...
 
IR: Open source state
IR: Open source stateIR: Open source state
IR: Open source state
 
SentiScan: система автоматической разметки тональности в social media
SentiScan: система автоматической разметки тональности в social mediaSentiScan: система автоматической разметки тональности в social media
SentiScan: система автоматической разметки тональности в social media
 
Social spam detection by SemanticAnalyzer Group
Social spam detection by SemanticAnalyzer GroupSocial spam detection by SemanticAnalyzer Group
Social spam detection by SemanticAnalyzer Group
 
Lucene revolution eu 2013 dublin writeup
Lucene revolution eu 2013 dublin writeupLucene revolution eu 2013 dublin writeup
Lucene revolution eu 2013 dublin writeup
 
Starget sentiment analyzer for English
Starget sentiment analyzer for EnglishStarget sentiment analyzer for English
Starget sentiment analyzer for English
 
Linguistic component Tokenizer for the Russian language
Linguistic component Tokenizer for the Russian languageLinguistic component Tokenizer for the Russian language
Linguistic component Tokenizer for the Russian language
 
Linguistic component Lemmatizer for the Russian language
Linguistic component Lemmatizer for the Russian languageLinguistic component Lemmatizer for the Russian language
Linguistic component Lemmatizer for the Russian language
 
Linguistic component Sentiment Analyzer for the Russian language
Linguistic component Sentiment Analyzer for the Russian languageLinguistic component Sentiment Analyzer for the Russian language
Linguistic component Sentiment Analyzer for the Russian language
 
Solr onfitnesse learningfromberlinbuzzwords
Solr onfitnesse learningfromberlinbuzzwordsSolr onfitnesse learningfromberlinbuzzwords
Solr onfitnesse learningfromberlinbuzzwords
 
MTEngine: Semantic-level Crowdsourced Machine Translation
MTEngine: Semantic-level Crowdsourced Machine TranslationMTEngine: Semantic-level Crowdsourced Machine Translation
MTEngine: Semantic-level Crowdsourced Machine Translation
 
Rule based approach to sentiment analysis at romip’11 slides
Rule based approach to sentiment analysis at romip’11 slidesRule based approach to sentiment analysis at romip’11 slides
Rule based approach to sentiment analysis at romip’11 slides
 
Machine translation course program (in English)
Machine translation course program (in English)Machine translation course program (in English)
Machine translation course program (in English)
 
Rule based approach to sentiment analysis at ROMIP 2011
Rule based approach to sentiment analysis at ROMIP 2011Rule based approach to sentiment analysis at ROMIP 2011
Rule based approach to sentiment analysis at ROMIP 2011
 
Icsoft 2011 51_cr
Icsoft 2011 51_crIcsoft 2011 51_cr
Icsoft 2011 51_cr
 
Poster: Method for an automatic generation of a semantic-level contextual tra...
Poster: Method for an automatic generation of a semantic-level contextual tra...Poster: Method for an automatic generation of a semantic-level contextual tra...
Poster: Method for an automatic generation of a semantic-level contextual tra...
 
Semantic feature machine translation system
Semantic feature machine translation systemSemantic feature machine translation system
Semantic feature machine translation system
 
NoSQL, Apache SOLR and Apache Hadoop
NoSQL, Apache SOLR and Apache HadoopNoSQL, Apache SOLR and Apache Hadoop
NoSQL, Apache SOLR and Apache Hadoop
 
Introduction To Machine Translation 1
Introduction To Machine Translation 1Introduction To Machine Translation 1
Introduction To Machine Translation 1
 
Introduction To Machine Translation
Introduction To Machine TranslationIntroduction To Machine Translation
Introduction To Machine Translation
 

Recently uploaded

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 

Recently uploaded (20)

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 

Vector databases and neural search

  • 1. Vector Databases and Neural Search Dmitry Kan, Max Irwin Open NLP Meetup
  • 2. About me ● PhD in NLP ● 16+ years of experience in developing search engines for start-ups and multinational technology giants ● Expert in vector search engines and the host of the Vector Podcast ● Blogging about vector search on Medium: https://dmitry-kan.medium.com/ ● Committer on search QA tool Quepid: https://github.com/o19s/quepid
  • 3. Team Sisu: BigANN Competition @ NeurIPS’21 ● Max Irwin ● Alex Semenov ● Aarne Talman ● Leo Joffe ● Alex Klibisz ● Dmitry Kan Billion-Scale ANN Algorithm Challenge
  • 4. Scope ● Vector search in a nutshell ● Main Vector DBs ● Neural search frameworks ● Get practical ● Demos
  • 5.
  • 7. Vector search in a nutshell Vector search is a way to represent and search your objects (documents, songs, images..) in a geometric space (usually of high-dimension) in the form of an embedding (a vector of numbers: [0.9, -0.1, 0.15, …]) ● At small scale you can apply exact KNN search ● At larger scale you need to use ANN search: trade some precision for speed Credit: Weaviate V1.0 release - virtual meetup
  • 8. Use cases ● Semantic search ● Image similarity ● Sound search ● Multimodality: searching images with text ● Recommenders ● E-commerce zero-hit long-tail (similarity search)
  • 9. Big players in the game ● Spotify: ANNOY ● Microsoft (Bing team): Zoom, DiskANN, SPTAG ○ Azure Cognitive Search ● Amazon: KNN based on HNSW in OpenSearch ● Google: ScaNN ● Yahoo! Japan: NGT ● Facebook: FAISS, PQ (CPU & GPU) ● Baidu: IPDG (Baidu Cloud) ● Yandex ● NVidia ● Intell
  • 12. Credit: The Pinecone Vector Database System (Edo Liberty)
  • 13. BuddyPQ: improving over FAISS BIGANN dataset Kolmogorov-Smirnov dimension test matrix for the first 100000 points. A higher number indicates a less similar distribution Variance Inflation Factor (Multicollinearity) (~2.25)
  • 14.
  • 15. Smaller Vector DB players: 71% are Open Source Company Product Cloud Open Source: Y/N Algorithms SeMI Weaviate Y Y (Go) custom HNSW Pinecone Pinecone Y N FAISS + own GSI APU chip for Elasticsearch / Opensearch N N Neural hashing / Hamming distance Qdrant Qdrant N Y (Rust) HNSW (graph) Yahoo! Vespa Y Y (Java, C++) HNSW (graph) Ziliz Milvus N Y (Go, C++, Python) FAISS, HNSW Yahoo! Vald N Y (Go) NGT
  • 16. Milvus 🌍 milvus.io 💡 self-hosted vector database 🤖 open source Value proposition: ● attention to scalability of the entire search engine: (re)indexing and search ● ability to index data with multiple ANN algorithms to compare their performance for your use case
  • 17. Pinecone 🌍 pinecone.io 💡managed vector database 🤖 close source Value proposition: ● Fully managed vector database ● Single-stage filtering capability: search for your objects (sweaters) + filter by metadata (color, size, price) in one query
  • 18. Vespa 🌍 vespa.ai/ 💡 managed / self-hosted 🤖 Code: open source Value proposition: ● low-latency computation over large data sets ● stores and indexes your data so that queries, selection and processing over the data can be performed at serving time ● customizable functionality ● deep data structures geared towards deep-learning like data science, like Tensors
  • 19. Weaviate 🌍 semi.technology/developers/weavia te/current/ 💡 managed / self-hosted 🤖 open source Value proposition: ● Expressive query syntax ● Graphql-like interface ● combo of vector search, object storage and inverted index ● Wow-effect: Has an impressive question answering component — esp for demos
  • 20. Vald 🌍 Link: vald.vdaas.org/ 💡 Type: Self-hosted vector database 🤖 Code: open source Value proposition: ● Billion-scale ● Cloud-native architecture ● Fastest ANN Algo: NGT ● Custom reranking / filtering algorithm plugins
  • 21. GSI APU 🌍 Link: gsitechnology.com/APU 💡 Type: Vector search hardware backend for your Elasticsearch / OpenSearch 🤖 Code: close source Value proposition: ● Billion-scale ● Extends your Elasticsearch / OpenSearch capabilities to similarity search ● On-prem / hosted APU board hosted cloud backend
  • 22. Qdrant 🌍 qdrant.tech/ 💡 self-hosted vector database (Cloud in roadmap) 🤖 open source Value proposition: ● The vector similarity engine with extended filtering support ● dynamic query planning and payload data indexing ● string matching, numerical ranges, geo-locations, and more ● Metric Deep Learning
  • 23. Semantic frameworks / layers: 57% Open Source Company Product Open Source: Y/N Focus Deepset.ai Haystack Y NLP, neural search Jina.AI Jina, Hub, Finetuner Y NLP, CV, ASR Featureform Feature store, EmbeddingHub Y All AI verticals ZIR.AI AI search platform N NLP Hebbia.AI Knowledge Base N NLP -> Finance Rasa.ai Virtual assistants Y NLP Muves.io Multilingual vector search N Multilingual search, multimodality
  • 24. Vector Databases: Milvus, Weaviate, Pinecone, GSI, Qdrant, Vespa, Vald, Elastiknn Neural frameworks: Haystack, Jina.AI, ZIR.AI, Muves, Hebbia.AI, Featureform KNN / ANN algorithms: HNSW, PQ, IVF, LSH, Zoom, DiskANN, BuddyPQ, … Multi-modal encoder / single modality encoders Application business logic: neural / BM25, symbolic filters, ranking user interface
  • 25. How to pick a vector DB / framework ● Have own engineering? ○ Yes: go for the framework vendor / self-hosted DB ○ No: choose higher-level system, like Hebbia.AI ● Own embedding layer or OK with vector DB doing it? ○ Own: Qdrant, Milvus, Pinecone, GSI, Vespa, Vald ○ In-DB: Weaviate, Vespa ● Heavy focus on NLP? ○ YES: Consider Haystack (deepset) ○ NO: Consider Jina.AI ● Want to quickly test before investing? ○ Yes: ZIR.AI, Hebbia.AI ○ No: Jina.AI, Haystack etc
  • 26. How to pick a vector DB / framework ● Want to HOST or fine with MANAGED? ○ HOST: Vespa, Vald, Milvus, Qdrant ○ MANAGED: Pinecone, Weaviate, GSI, Hebbia.AI, ZIR.AI
  • 28. Trends in ML at large ● Model hubs (e.g. Hugging Face) → ML community shares progress quickly (similar to what GitHub did to sharing code) ● Deep Learning → multimodal: CLIP (text from images), DALL-E (images from text) ● MLOps optimize experimentation and deployments: determined.ai, DVC, MLflow / Kubeflow
  • 29. Get practical ● Code: https://github.com/DmitryKey/bert-solr-search ● Supported Engines: Solr, Elasticsearch, OpenSearch, GSI, [hnswlib] ● Supported LMs: BERT, SBERT, [theoretically any]
  • 30. Q&A demo Questions: 1. Why did Peloton shares fall? 2. How are articles created on TechCrunch? 3. What is ethical AI? https://techcrunch.vectors.network/ Weaviate console
  • 32. Links 1. Martin Fowler’s talk on NoSQL databases: https://www.youtube.com/watch?v=qI_g07C_Q5I 2. Neural search vs Inverted search https://trends.google.com/trends/explore?date=all&q=neural%20search,inverted%20search 3. Not All Vector Databases Are Made Equal https://towardsdatascience.com/milvus-pinecone-vespa-weaviate-vald-gsi-what-unites-these- buzz-words-and-what-makes-each-9c65a3bd0696 4. HN thread: https://news.ycombinator.com/item?id=28727816 5. A survey of PQ and related methods: https://faiss.ai/ 6. Vector Podcast on YouTube: https://www.youtube.com/channel/UCCIMPfR7TXyDvlDRXjVhP1g 7. Vector Podcast on Spotify: https://open.spotify.com/show/13JO3vhMf7nAqcpvlIgOY6 8. Vector Podcast on Apple Podcasts: https://podcasts.apple.com/us/podcast/vector-podcast/id1587568733 9. BERT, Solr, Elasticsearch, OpenSearch, HNSWlib – in Python: https://github.com/DmitryKey/bert-solr-search