Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Allow semantic similarity computation using only a certain hierarchy #119

Open
hlapp opened this issue Nov 22, 2019 · 3 comments
Open

Allow semantic similarity computation using only a certain hierarchy #119

hlapp opened this issue Nov 22, 2019 · 3 comments
Labels
enhancement New feature or request

Comments

@hlapp
Copy link
Member

hlapp commented Nov 22, 2019

By default, the subsumer matrix contains all subsumers, and thus the semantic similarity metrics computed with it uses the entire superclass hierarchy.

For phenotype dependency and mutual exclusivity evaluation, it is sometimes useful to consider similarity only for a certain part of the superclass hierarchy. Specifically, it should be possible to consider only the parthood hierarchy.

@hlapp
Copy link
Member Author

hlapp commented Nov 22, 2019

One possibility to accomplish this on the client (RPhenoscape) side would be to remove (filter) from the subsumer matrix those subsumers that aren't in the desired hierarchy. If we can identify terms in the desired hierarchy (or their complement(s)) by, for example, a subclass query, then this would be an effective filter.

Note that filtering the subsumer matrix in this way assumes that rows in the matrix are independent, This should be true for Jaccard, but should be checked when weights are not in {0, 1}.

@hlapp hlapp added the enhancement New feature or request label Nov 22, 2019
@hlapp
Copy link
Member Author

hlapp commented Nov 22, 2019

@balhoff feel free to comment.

@hlapp
Copy link
Member Author

hlapp commented Nov 4, 2022

I think if we wanted to accomplish this it would have to be through a feature of the KB API (parameter allowing specification of allowable relationships for subsumption paths; or a named corpus defined as using only certain relationships for the transitive closure).

@balhoff can you remind us whether there's any work underway for supporting this, and whether this should be available before the TraitFest event?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant