🔥 The Web Data API for AI - Turn entire websites into LLM-ready markdown or structured data
-
Updated
Dec 24, 2025 - TypeScript
🔥 The Web Data API for AI - Turn entire websites into LLM-ready markdown or structured data
Turn any website into clean data pipelines & structured APIs in minutes!
Local-first, open-source AI assistant for your data. Unify tasks, notes, docs, photos, and bookmarks. Private, self-hosted, and extensible via APIs.
⚡️ Next-generation data transformation framework for TypeScript that puts developer experience first
An n8n custom node that integrates HeadlessX, enabling headless browser automation (navigation, scraping, screenshots, PDF generation) directly inside your n8n workflows.
A module written in TypeScript that provides a utility to extract data from an OFX file in Node.js and Browser
High-performance web crawler API optimized for LLMs. Turn any search or website into clean Markdown using remote browsers.
Apify actor extracting data from Zalando
Web scraping tool used to extract real estate information from OnTheMarket.com, a leading property portal in the United Kingdom.
Automated Data Extraction and invoice management application
A powerful web scraping library built with Playwright that provides a declarative, step-by-step approach to web automation and data extraction.
A rule-driven engine designed for seamless extraction of data from JavaScript files.
A CLI tool for extracting unstructured data from websites using customizable schemas and Google's Gemini API and outputing them into structured schemas.
Real-time Google Search API for AI Agents & RAG pipelines. Get structured SERP data instantly using remote browsers.
PDF data extraction parsers that get published onto npm. Standalone, but run in conjunction with the openlawnz-pipeline.
📦 A tool that can recognize and infer data types and semantic meaning from strings.
Scrape and extract information from multiple data sources with ease.
A Cloudflare Worker service that converts web pages (URLs) into MAGI (Markdown for AI) format, enriching content with YAML frontmatter metadata and AI scripts for better AI interaction and understanding.
Providing Structure to News Simple transormation. Instant exports. A treasure trove of data at your fingertips.
Add a description, image, and links to the data-extraction topic page so that developers can more easily learn about it.
To associate your repository with the data-extraction topic, visit your repo's landing page and select "manage topics."