Fast lexical search library implementing BM25 in Python using Scipy (on average 2x faster than Elasticsearch in single-threaded setting)
-
Updated
Jul 8, 2024 - Python
Fast lexical search library implementing BM25 in Python using Scipy (on average 2x faster than Elasticsearch in single-threaded setting)
This project implements an in-memory search engine for indexing and retrieving documents from a CSV file using Python and NLTK. It preprocesses text, builds an inverted index, and ranks documents based on relevance to a query using the Okapi BM25 algorithm.
A basic and intuitive Python module for (Vector Space) IR system. (Focuses on simplicity and understandability)
IR system built upon a corpus of open-access research papers. It ranks results using the Okapi BM25 algorithm
Repository containing the final project for the Information Retrieval course at DSSC Master Degree (UniTS).
A two-stage information retrieval model using baseline TF-IDF model and refined BM25.
A search engine which takes keywords as queries and retrieves a ranked list of results
A search engine that takes keyword queries as input and retrieves a ranked list of relevant results as output. It scraps a few thousand pages from one of the seed Wiki pages and uses Elasticsearch for a full-text search engine.
IR ranking system based on Okapi BM25 and blind feedback
Well-tested implementation of the OkapiBM25 algorithm. Install the npm package!
A detailed study on enhancing the working of an Automated Question Generation & Answering system in a real-time environment. Also, the paper gives a glimpse of bringing this system to freeware like WhatsApp.
Ranked document retrieval on a large text corpus.
The project is an extension of the SENT2IMG application, where an attention mechanism is introduced to obtain precise captions and Okapi BM25 algorithm has been utilised to rank the captions.
Content specific search engine with the aim to retrieve movies information given the content of the user's query.
Création d'un moteur de recherche (Parsing de la collection, Index + Index inversé, Ordonnancement, Ranking)
Parse HTML pages. Create inverted index. Search for pages
Tunable full text search engine in JavaScript that: (1) works natively on web apps like Express.js; (2) easy to customize (via BM25) to specific types of documents (e.g. tweets, scientifc journals); (3) is deployable on either the client-side or the server side.
Buscador de man pages con modelo vectorial y BM25.
Add a description, image, and links to the okapi-bm25 topic page so that developers can more easily learn about it.
To associate your repository with the okapi-bm25 topic, visit your repo's landing page and select "manage topics."