gRPC can be used through Vertex Prediction private endpoint, but it is not yet officially supported. See sample here: https://github.com/GoogleCloudPlatform/vertex-ai-samples/blob/main/notebooks/community/vertex_endpoints/optimized_tensorflow_runtime/tabular_optimized_online_prediction.ipynb
Answer from Aleksey Vlasenko on Stack OverflowVertex AI: Getting a GRPC Exception when sending a prediction request in Java
Vertex AI on GCP(Google Cloud)
Vertex AI: Getting a GRPC Exception when sending a prediction request in Java
google cloud platform - How to make a prediction to a private Vertex AI endpoint with Node.js client libraries? - Stack Overflow
Me and my team have been developing a RAG app using Pinecone for vector DB, AWS for storage , google cloud for Gemini API , Digital Ocean for web admin hosting
There was a lot of third parties involved and we decided to use one platform that offers all services at once to prevent latency and cross platform billing but to also leverage its robust AI(GCP) , and so we opted for GCP and it was my decision , so looking into replacing all these technologies to use GCP , we found out that using vector search for vertex AI could replace pinecone , so i created the index and also deployed it before the developer responsible for deploying the ai-service could do the testing for the index .
So google could charge us over 100USD per day before anything worked out , not even anything was put to the created index
So we deactivated and now wondering what the issue was
I come here to seek help and advice regarding RAG using the vector search index for vertex AI , is there anyone who has used it and they were successful without being overcharged , what are its advantages and disadvantages faced