Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

search: allow searching in a range of INSPIRE IDs or publication record IDs #791

Open
GraemeWatt opened this issue May 15, 2024 · 0 comments
Assignees
Labels
complexity: medium priority: high type: bug Indicates an unexpected problem or unintended behaviour

Comments

@GraemeWatt
Copy link
Member

As a workaround for #390 where there is a limit of 10k publications in search results, searches to get all records could be split into multiple searches using an identifier like the inspire_id or the recid, as suggested for INSPIRE-HEP.

However, searching for a range like inspire_id:[1 TO 10000] does not currently work for HEPData because the inspire_id is stored as text in the index (see also #301). A custom query should be implemented that converts the inspire_id to an integer.

Moreover, searching for a range like recid:[100 TO 200] returns records where the table recid is within the range even if the publication recid is not. This means it would not be possible to get all records (avoiding duplicates) from a combination of recid searches. Implementing a new search query like publication_recid:[100 TO 200] that does not match the table recid would be useful to avoid this problem.

@GraemeWatt GraemeWatt added type: bug Indicates an unexpected problem or unintended behaviour priority: high complexity: medium labels May 15, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
complexity: medium priority: high type: bug Indicates an unexpected problem or unintended behaviour
Projects
Status: In Progress
Development

No branches or pull requests

2 participants