Skip to content

Grouping conversation together and assigning them topics #1945

Answered by MaartenGr
yksoni asked this question in Q&A
Discussion options

You must be logged in to vote

Hi Yogesh,

Thanks for sharing your use case. This indeed seems quite difficult when you are using an embedding model that is trained for semantic similarity.

It will depend on the extent to which you want to generate topics. How abstract or specific they need to be as there are tricks to still combine the topics. For instance, you could simply combine a number of sentences using a sliding window (e.g., sentence1 + sentence2, sentence2 + sentence3, etc.) and feed those into BERTopic. That will increase semantic content of your documents and might result in better representations.

The second thing, which I am not familiar with myself, is to look into graph-based clustering methods. It seems…

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by yksoni
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants