From v1.5.3 to v1.6.3, GPU memory usage has increased when adding vectors to an IVF index, and then never querying. In v1.5.3 GPU memory use was constant. Now it grows linearly with number of vectors.