3 Comments

One problem with pg_vector hnsw index implementation is that fetches candidates based on distance **before** applying any other filters from `WHERE`. I would recommend checking lantern and ts_vector which are both faster and more precise.

Expand full comment

I did a similar project to learn more about LLMs to build a chat bot that is powered by an author's diary. I vectorized 40k of his journal entires, when you ask a question it queries for the top 10 related articles to use as context to answer your question. I found the process helpful to learn more about RAG and make LLMs more useful. More details here - https://stevebarbera.medium.com/building-rankobot-with-chatgpt-and-laravel-dd69088211d9

Expand full comment