Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Token masking strategies #44

Open
isamu-isozaki opened this issue Apr 15, 2023 · 0 comments
Open

Token masking strategies #44

isamu-isozaki opened this issue Apr 15, 2023 · 0 comments

Comments

@isamu-isozaki
Copy link
Collaborator

This is experimental and not a priority but I wanted to list 2 denoising strategies that seemed interesting to compare to

  1. Noise Mask which was proposed in Paella (here)[https://arxiv.org/pdf/2211.07292v1.pdf]. I think this is very similar but the distribution/sample method for choosing which tokens to mask is different
  2. Random Tokens were proposed (here)[https://arxiv.org/pdf/2206.12351.pdf]. I still need to fully understand it but basically, the whole idea of masking tokens is removed and instead, you start with random tokens from the codebook.

Overall, these are slightly experimental features that I'm still learning but I think it'll be interesting to compare them once we get a chance.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant