Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Unsupervised PQ results #4

Open
hshreeshail opened this issue Jan 27, 2022 · 1 comment
Open

Unsupervised PQ results #4

hshreeshail opened this issue Jan 27, 2022 · 1 comment

Comments

@hshreeshail
Copy link

In table-1, is there any explanation for why the results of unsupervised PQ (MRR@10 = 0.028 @ compression ratio = 64x) so poor?

In our experience, PQ works reasonably well. For example, when we use PQ to compress vectors (768 dim) generated by the ANCE model with M=32 (compression ratio = 96x), we get MRR@10 = 0.252 on MS MARCO Passage Dev. We used IndexPQ from the FAISS library for the same.

Also, when reporting results on unsupervised methods (PQ, ScaNN, OPQ etc), what is the encoder of input uncompressed vectors? Is it the trained STAR model?

@jingtaozhan
Copy link
Owner

[1] According to my experience, PQ performs rather poorly and it is important to use OPQ. This is the script we use for OPQ. You can change this line to get the PQ results. Happy to know how you get such better PQ performance.
[2] Yes, the encoder is STAR model.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants