ads
Monday, August 14, 2023
Show HN: Epsilla – Open-source vector database with low query latency https://ift.tt/GJnsKV2
Show HN: Epsilla – Open-source vector database with low query latency Hey HN! We are building Epsilla ( https://ift.tt/BcriPlj ), an open-source, self-hostable vector database for semantic similarity search that specializes in low query latency. When do we need a vector database? For example, GPT-3.5 has a 16k context window limit. If we want to let it answer a question about a 300 page book, we cannot put the whole book content into the context. We have to choose the sections of the book that are most relevant to the question. Vector database is specialized at ranking and picking the most relevant content from a large pool of documents based on their semantic similarity. Most vector databases utilize hierarchical navigational small world (HNSW) for indexing the vectors for high precision vector search, and its latency significantly degrades when the precision target is higher than 95%. At a previous company, we worked on building the parallel graph traversal engine. We realized that the bottleneck of HNSW performance is because there are too many sequential traversal steps that don't fully leverage multi-core CPU computation resources. After some research, we found that there are algorithms such as SpeedANN that are targeting this problem, which is not leveraged by industry yet. So we built the Epsilla vector database to turn the research into a production system. With Epsilla, we shoot for 10x lower vector search latency compared to HNSW based vector databases. We did an initial benchmark against the top open source vector databases: https://ift.tt/d0sOaiC... We provide a Docker image for you to install Epsilla backend locally, and provide a Python client and a JavaScript client to connect and interact with it. Quickstart: docker pull epsilla/vectordb docker run --pull=always -d -p 8888:8888 epsilla/vectordb pip install pyepsilla git clone https://ift.tt/Sb2OFlk cd examples python hello_epsilla.py We just started a month ago. We'd love to hear what you think, and more importantly, what you wish to see in the future. We are thinking about a serverless vector database on cloud with a consumption based pricing model, and we are eager to get your feedback. https://ift.tt/BcriPlj August 14, 2023 at 10:21PM
Subscribe to:
Post Comments (Atom)
No comments:
Post a Comment