pdf-search

Here are 21 public repositories matching this topic...

athrael-soju / Snappy

🐊 Snappy's unique approach unifies vision-language late interaction with structured OCR for region-level knowledge retrieval. Like the project? Drop a star! ⭐

python docker typescript computer-vision nextjs document-retrieval rag fastapi vector-search document-understanding pdf-search vector-database vision-ai qdrant colpali multimodal-ai multivector-search deepseek-ocr visual-retrieval

Updated Dec 23, 2025
Python

jina-ai / jina-vdr

Star

Jina VDR is a multilingual, multi-domain benchmark for visual document retrieval

embeddings multi-modal pdf-search visual-document-retrieval vidore

Updated Aug 4, 2025
Python

njmarko / googolplex-pdf-search

Star

Python program for searching pdf text, ranking the results and exporting highlighted search results in pdf. Uses trie structure, stack, heap, page graph. Converts queries to postfix notation. Allows for logical expressions and phrases. Offers did you mean functionality.

autocomplete stack graph trie heap pdf-generation didyoumean datastructures-algorithms postfix-evaluation pdf-highlighter pdf-search

Updated Aug 28, 2024
Python

ai-naymul / DocuVisQA

Star

DocuVisQA(Document Visual Question Answering) is a Python project that leverages Google's Generative AI and Langchain for document processing, text splitting, and question answering. It also supports image processing with Streamlit for interactive UI.

python open-source pdf chatbot document image-recognition streamlit pdf-search documentretrieval-exe streamlit-application langchain langchain-python

Updated Apr 8, 2024
Python

FelixKohlhas / pdf_search

Star

A web interface that allows searching for PDFs by their content

pdf flask sqlite pdf-search

Updated Nov 30, 2023
Python

raisultan / hermes

Star

Use semantic search on PDFs locally

embeddings semantic-search pdf-search

Updated Mar 30, 2024
Python

eli64s / pdflex

Sponsor

Star

CLI for merging PDF contexts.

pdf-converter pdf-document pdf-generator pdf-manipulation pdf-extractor pdf-library pdf-parser pdf-data-extraction pdf-processor pdf-tools pdf-document-processor python-pdf pdf-search pdf-text-extraction pdf-python pdf-automation python-pdf-tools pdf-document-parser pdf-regex

Updated Mar 20, 2025
Python

Ashad001 / UltimateRAG

Star

In Development

machine-learning ai embeddings gemini openai web-search rag pdf-search jina llms llama-index vector-store

Updated Jul 30, 2024
Python

shreyansh-kothari / PDF-Querying-using-TF-IDF-from-Scratch

Star

Given a set of PDFs and the query, the most relevant pdf can be found with the help of TF-IDF. The code has not used any library to implement TF-IDF

python glob pdf-converter python3 tf-idf querying pdfminer document-search pdf-search

Updated Oct 15, 2019
Python

M-Husnain-Ali / Cognivia-AI

Star

Cognivia AI is a powerful AI-powered PDF search and question-answering system built with LangChain, Pinecone Vector Store, OpenAI, and Supabase. Upload PDFs, ask questions, and get intelligent answers with persistent conversation memory.

embedded-systems openai question-answering semantic-search text-embedding pinecone streamli rag pdf-search vector-database ai-chatbot supbase langchain intelligent-document-processing chat-with-pdf pdf-rag

Updated Sep 8, 2025
Python

aemal / pdf-finder

Star

A tool to search for text in PDF files using multiple methods, including OCR (Optical Character Recognition).

ocr pdf-search pdf-finder search-in-pdf

Updated Apr 23, 2025
Python

Sazizi2025 / PDF-Founder

Star

Are you short on time?! Can't you search all the PDFs one by one for the content you want?! Well, PDF-Founder is here...

python pdf gui image tesseract rgb graphical tesseract-ocr easy-to-use image-generator snipping pdf-search-engine pymupdf pysimplegui pdf-search ptl pymupdf-fitz

Updated Jan 8, 2024
Python

sampconrad / busca-diario

Sponsor

Star

Programa que busca uma lista de nomes das Partes Processuais nos PDFs do Diário Oficial.

python law brasil pdf-search

Updated Dec 19, 2023
Python

braendma / PDF-finder

Star

This Python script allows users to search through PDF documents located in predefined directories for specific keywords. It uses PyPDF2 to extract text from PDFs and supports single or dual keyword searches.

python3 pdf-search-engine research-assistant pdf-search

Updated Aug 19, 2025
Python

ad4529 / unichemfinder

Star

Repository for the Indexing, Search and Evaluation of UniChemFinder

pdf-search granular-search chemical-ir

Updated Apr 21, 2025
Python

MilanSazdov / search-engine-pdf

Star

📄 PDF Search Engine – Advanced keyword-based PDF search with logical operators, graph-based ranking, autocomplete, and highlighted exports.

python search-engine pdf information-retrieval ranking-algorithm pdf-search

Updated Feb 25, 2025
Python

sara-stojkov / Python_PDF_Search_Engine

Star

Python console app that uses smart searching through the provided PDF. It showcases the use of tries for word searching.

python graph trie python3 pdf-search-engine pdf-search

Updated Nov 21, 2024
Python

logxdx / contextualized-late-interation-with-pdfs

Star

A high-performance RAG system for PDFs using multi-vector embeddings (ColPali / ColQwen / ColSmol) with vector search in Qdrant, prefetch optimization, and reranking for improved relevance. Designed for speed, accuracy, and scalability, this system is ideal for building intelligent search, document understanding, and QA applications.

rag pdf-search colpali pdf-rag colqwen2 colsmol

Updated Sep 12, 2025
Python

TheharshVardhan01 / RAGvisor

Star

An AI-powered Streamlit app for PDF and web-based Q&A using RAG (Retrieval-Augmented Generation), Groq’s Mixtral LLM, and DeepAI image generation.

nlp application openai chunking rag groq deepai streamlit pdf-search ai-applications imagegeneration stable-diffusion llms chromadb llm-inference genai-chatbot rag-chatbot

Updated Jul 27, 2025
Python

manulthanura / CrewAI-Doc-Search-Tool

Star

Build a workflow using CrewAI tools to scrape the content from the docs and then perform RAG on it.

google gemini pdf-search crewai rag-chatbot doc-search

Updated Aug 4, 2025
Python

Improve this page

Add a description, image, and links to the pdf-search topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the pdf-search topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pdf-search

Here are 21 public repositories matching this topic...

athrael-soju / Snappy

jina-ai / jina-vdr

njmarko / googolplex-pdf-search

ai-naymul / DocuVisQA

FelixKohlhas / pdf_search

raisultan / hermes

eli64s / pdflex

Ashad001 / UltimateRAG

shreyansh-kothari / PDF-Querying-using-TF-IDF-from-Scratch

M-Husnain-Ali / Cognivia-AI

aemal / pdf-finder

Sazizi2025 / PDF-Founder

sampconrad / busca-diario

braendma / PDF-finder

ad4529 / unichemfinder

MilanSazdov / search-engine-pdf

sara-stojkov / Python_PDF_Search_Engine

logxdx / contextualized-late-interation-with-pdfs

TheharshVardhan01 / RAGvisor

manulthanura / CrewAI-Doc-Search-Tool

Improve this page

Add this topic to your repo