Configuring Vector Stores
Aizen provides vector store services. A vector store database is installed as part of the Aizen infrastructure components. See Installing the Infrastructure Components. Configure and start a vector store deployment to provide vector store services.
To configure a vector store on Aizen, follow these steps:
Log in to the Aizen Jupyter console. See Using the Aizen Jupyter Console.
Create an ML project if you have not already done so or set the current working project.
or
Configure the vector store by running the
configure vectorstore
command:In the notebook, you will be guided through a template form with boxes and drop-down lists that you can complete to configure the vector store. You must add at least one embeddings model. For each embeddings model, select the embeddings model's Provider Name, enter the Embeddings Model name, and click the Add Model button. Set one of the embeddings models as the default model by checking the Set as default model check box.
Serve the vector store running the
start vectorstore
command. This command will schedule a job to deploy the vector store. Optionally, you can configure resources for the job by running theconfigure resource
command. If you do not configure resources, default resource settings will be applied.Check the status of the vector store's deployment job and obtain serving URLs:
The base URL in the status output supports a REST API that lists the vector stores that are currently being served. The endpoint URL in the status output supports vector store operations, such as creating store IDs, uploading documents, and retrieving semantically similar texts from a store ID. Both of these URLs provide the FastAPI docs, Redoc, and OpenAPI paths.
Use the endpoint URL to create a Store ID in the vector store and to upload documents to a Store ID.
Add a vector-store cloud provider with the base URL. You will need the provider name, vector store name, and the Store ID when configuring a RAG tool for an AI Agent.
Last updated