Skip to content
#

information-retrieval

Here are 2,417 public repositories matching this topic...

WebDiver is a versatile Python script for crawling websites, extracting internal and external links, titles, and descriptions. It's useful for tasks such as web analysis, OSINT (Open Source Intelligence) gathering, and competitive analysis.

  • Updated Jul 3, 2024
  • Python

RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-accelerated and form building blocks for more easily writing high performance applications.

  • Updated Jul 3, 2024
  • Cuda
weaviate

Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cloud-native database​.

  • Updated Jul 2, 2024
  • Go
haystack

🔍 LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.

  • Updated Jul 2, 2024
  • Python

Improve this page

Add a description, image, and links to the information-retrieval topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the information-retrieval topic, visit your repo's landing page and select "manage topics."

Learn more