Composite Indexes and the Faiss Index Factory
In the world of vector search, there are many indexing methods and vector processing techniques that allow us to prioritize between recall, latency, and memory usage.
Using specific methods such as IVF, PQ, or HNSW, we can often return good results. But for best performance we will usually want to use composite indexes.
We can view a composite index as a step-by-step process of vector transformations and one or more indexing methods. Allowing us to place multiple indexes and/or processing steps together to create our ‘ideal’ index.
For example, we can use an inverted file (IVF) index to reduce the scope of our search (increasing search speed), and then add a compression technique such as product quantization (PQ) to keep larger indexes within a reasonable size limit.
Where there is the ability to customize indexes, there is the risk of producing indexes with unnecessarily poor recall, latency, or memory usage.
We must know how composite indexes work if we want to build robust and high-performance vector similarity search applications. It is essential to understand where different indexes or vector transformations can be used — and when they are not needed.
Part 2: https://youtu.be/3Wqh4iUupbM
🌲 Pinecone article:
https://www.pinecone.io/learn/composite-indexes/
🤖 70% Discount on the NLP With Transformers in Python course:
https://bit.ly/3DFvvY5
🎉 Sign-up For New Articles Every Week on Medium!
https://jamescalam.medium.com/subscribe (it's free!)
https://medium.com/@jamescalam/membership
👾 Discord:
https://discord.gg/c5QtDB9RAP
00:00 Intro
01:54 Composite Indexes
06:43 Faiss Index Factory
11:34 Why we use Index Factory
17:11 Outro
Видео Composite Indexes and the Faiss Index Factory канала James Briggs
Using specific methods such as IVF, PQ, or HNSW, we can often return good results. But for best performance we will usually want to use composite indexes.
We can view a composite index as a step-by-step process of vector transformations and one or more indexing methods. Allowing us to place multiple indexes and/or processing steps together to create our ‘ideal’ index.
For example, we can use an inverted file (IVF) index to reduce the scope of our search (increasing search speed), and then add a compression technique such as product quantization (PQ) to keep larger indexes within a reasonable size limit.
Where there is the ability to customize indexes, there is the risk of producing indexes with unnecessarily poor recall, latency, or memory usage.
We must know how composite indexes work if we want to build robust and high-performance vector similarity search applications. It is essential to understand where different indexes or vector transformations can be used — and when they are not needed.
Part 2: https://youtu.be/3Wqh4iUupbM
🌲 Pinecone article:
https://www.pinecone.io/learn/composite-indexes/
🤖 70% Discount on the NLP With Transformers in Python course:
https://bit.ly/3DFvvY5
🎉 Sign-up For New Articles Every Week on Medium!
https://jamescalam.medium.com/subscribe (it's free!)
https://medium.com/@jamescalam/membership
👾 Discord:
https://discord.gg/c5QtDB9RAP
00:00 Intro
01:54 Composite Indexes
06:43 Faiss Index Factory
11:34 Why we use Index Factory
17:11 Outro
Видео Composite Indexes and the Faiss Index Factory канала James Briggs
Показать
Комментарии отсутствуют
Информация о видео
Другие видео канала
![Variables and Flows for Chatbots | NeMo Guardrails #2](https://i.ytimg.com/vi/3Vg5bEtaepQ/default.jpg)
![NEW Pinecone Assistant](https://i.ytimg.com/vi/MPCiNA8BqO8/default.jpg)
![GUI-based Few Shot Classification Model Trainer | Demo](https://i.ytimg.com/vi/pfwBut7E60Q/default.jpg)
![AI Agent Evaluation with RAGAS](https://i.ytimg.com/vi/-_52DIIOsCE/default.jpg)
![Input Data Pipelines - TensorFlow Essentials #4](https://i.ytimg.com/vi/f6XVfgJTbp4/default.jpg)
![How to build a Q&A Reader Model in Python (Open-domain QA)](https://i.ytimg.com/vi/-fzCSPsfMic/default.jpg)
![FREE 11 Hour NLP Transformers Course (Next 3 Days Only)](https://i.ytimg.com/vi/fA0dFQacmic/default.jpg)
![Streamlit for ML #1 - Installation and API](https://i.ytimg.com/vi/JLKUV-LiXjk/default.jpg)
![Streamlit for ML #3 - Make Apps Fast with Caching](https://i.ytimg.com/vi/lYDiSCDcxmc/default.jpg)
![Fine-tune High Performance Sentence Transformers (with Multiple Negatives Ranking)](https://i.ytimg.com/vi/or5ew7dqA-c/default.jpg)
![How-to Decode Outputs From NLP Models (Python)](https://i.ytimg.com/vi/QJq9RTp_OVE/default.jpg)
![Fast Zero Shot Object Detection with OpenAI CLIP](https://i.ytimg.com/vi/i3OYlaoj-BM/default.jpg)
![SPLADE: the first search model to beat BM25](https://i.ytimg.com/vi/0FQ2WmM0t3w/default.jpg)
![LangChain Multi-Query Retriever for RAG](https://i.ytimg.com/vi/VFf8XJUIHnU/default.jpg)
![Intro to APIs in Python - API Series #1](https://i.ytimg.com/vi/g_yMowQikOE/default.jpg)
![Stoic Philosophy Text Generation with TensorFlow](https://i.ytimg.com/vi/Jk1YP4Y_U_0/default.jpg)
![Identify Stocks on Reddit with SpaCy (NER in Python)](https://i.ytimg.com/vi/TCZgXFPNnbc/default.jpg)
![Implementing Filters in the New Haystack Doc Store](https://i.ytimg.com/vi/j3psNM5y-eA/default.jpg)
![Q&A Document Retrieval With DPR](https://i.ytimg.com/vi/DBsxUSUhfRg/default.jpg)
![Streamlit for ML #5.3 - Publishing Components to Pip](https://i.ytimg.com/vi/lZ2EaPUnV7k/default.jpg)
![Functional API - TensorFlow Essentials #2](https://i.ytimg.com/vi/BYbJ_HH788U/default.jpg)