Embeddings are a good starting point for the…

Apr 17

Vector embeddings have been an Overton window shifting experience for me, not because they’re sufficiently advanced technology indistinguishable from magic, but the opposite. Once I started using them, it felt obvious that this was what the search experience was always supposed to be: less “How did you do that?” and more mundanely, “Why isn’t this everywhere?”

Read →

3 Comments

Vladimir

Apr 17

One problem with pg_vector hnsw index implementation is that fetches candidates based on distance **before** applying any other filters from `WHERE`. I would recommend checking lantern and ts_vector which are both faster and more precise.

Expand full comment

Adrian

Adrian’s Substack

Apr 17

https://medium.com/@adrian.white/cosine-similarity-in-snowflake-ove-eed3b57f4e6f

Expand full comment

Steve B

Apr 17

I did a similar project to learn more about LLMs to build a chat bot that is powered by an author's diary. I vectorized 40k of his journal entires, when you ask a question it queries for the top 10 related articles to use as context to answer your question. I found the process helpful to learn more about RAG and make LLMs more useful. More details here - https://stevebarbera.medium.com/building-rankobot-with-chatgpt-and-laravel-dd69088211d9

Expand full comment

Bryant’s Newsletter

Embeddings are a good starting point for the…