Algorithm for pre-filtering locations and other similar strings during vector search.
We built multiple vector databases of World Bank project documents with multiple location, topic and theme metadata fields using Marqo, AWS Kendra, Weaviate, Pinecone. The typical query looks like:
What are some lessons learned from rural water supply projects in Northern Africa?
What are the effects on girls' education due to sanitation projects in Asia and South America?