Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

PCA as post-processing #94

Open
AlekseySh opened this issue Aug 14, 2022 · 0 comments
Open

PCA as post-processing #94

AlekseySh opened this issue Aug 14, 2022 · 0 comments
Labels

Comments

@AlekseySh
Copy link
Contributor

AlekseySh commented Aug 14, 2022

  1. Make an example on mock dataset and put it in the Docs here. It should look like this:
embeddings = inference(model, val, batch_size=4, num_workers=0)
rr = RetrievalResults.from_embeddings(embeddings, val, n_items=3)

embeddings_upd = PCA()(embeddings)
rr_upd = RetrievalResults.from_embeddings(embeddings_upd, val, n_items=3)

# after that we compare results visually and/or by metrics
  1. Check if it boosts metrics on one of our benchmarks. Check how much the retrieval time is improved.

  2. Write a shor report below the example on the same Doc's page.

PS. OML already has a PCA implementation, so, no need to bring an extra requirement.

@AlekseySh AlekseySh added this to To do in OML-planning via automation Aug 14, 2022
@AlekseySh AlekseySh moved this from To do to backlog in OML-planning Aug 14, 2022
@AlekseySh AlekseySh removed the good first issue Good for newcomers label Nov 20, 2022
@AlekseySh AlekseySh added the good first issue Good for newcomers label May 31, 2023
@AlekseySh AlekseySh changed the title PCA as a part of PostProcessor PCA as post-processing Jun 7, 2024
@AlekseySh AlekseySh moved this from backlog to To do in OML-planning Jun 7, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
Development

No branches or pull requests

1 participant