Limiting attention radius and extracting embeddings #70

george-henderson · 2024-05-15T17:41:27Z

george-henderson
May 15, 2024

Hello,

Is it possible to alter the model's attention radius, such that the model only applies attention within a certain window in the input?

A second question: Can you instruct on how I might extract the embeddings from the model? I am using the model out of the box, such that the final output is a tensor of dimension num_batches x input length x num_tokens, but I’d like to access the internal latent space representation of my text as well.

Thank you!

brianhie · 2024-06-21T02:28:14Z

brianhie
Jun 21, 2024
Maintainer

Is it possible to alter the model's attention radius, such that the model only applies attention within a certain window in the input?

This is not possible with the current model

A second question: Can you instruct on how I might extract the embeddings from the model? I am using the model out of the box, such that the final output is a tensor of dimension num_batches x input length x num_tokens, but I’d like to access the internal latent space representation of my text as well.

Here's an example of extracting the last hidden layer outputs from the model: #32

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Limiting attention radius and extracting embeddings #70

{{title}}

Replies: 1 comment

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Select a reply

Limiting attention radius and extracting embeddings #70

george-henderson May 15, 2024

Replies: 1 comment

brianhie Jun 21, 2024 Maintainer

george-henderson
May 15, 2024

brianhie
Jun 21, 2024
Maintainer