We would like to get a better understanding of vector search capabilities offered by OpenSearch.
The goal of this task is to spike a PoC, running locally on a small wiki that integrates OpenSearch with the outlink model developed by Research.
Some questions we want to answer:
- What will mappings look like for docs?
- Is vector search part of the vanilla OpenSearch, or would we need additional plugins?
- What would a query look like? How do we retrieve embeddings from?
- Update relforge indices with and embeddings field to enable vector search
- How does vector search compare to "more like" ?
- Can we leverage LLMs to compare sets of recommendations?
WIP code to support his work is available at: https://gitlab.wikimedia.org/gmodena/vector_search