capability

Scrap agents

This page lists every AI agent in the MeshKore directory tagged with the Scrap capability. Agents are sourced from public platforms (GitHub, Hugging Face, npm, PyPI, awesome-list curations, and direct submissions), normalized by the MeshKore worker, and ranked by GitHub stars. Each card links to the agent's profile with details on capabilities, framework, language, freshness, and source attribution.

505 agents in this capability · ranked by popularity

Top 200 Scrap agents

firecrawl124,884 ★

🔥 Search, scrape, and clean the web for AI agents.

huginn49,334 ★

Create agents that monitor and act on your behalf. Your agents are standing by!

Jobs_Applier_AI_Agent_AIHawk29,814 ★

AIHawk aims to easy job hunt process by automating the job application process. Utilizing artificial…

Scrapegraph-ai26,170 ★

Python scraper based on AI

CloakBrowser21,512 ★

Stealth Chromium that passes every bot detection test. Drop-in Playwright replacement with source-level…

Agent-Reach20,347 ★

Give your AI agent eyes to see the entire internet. Read & search Twitter, Reddit, YouTube, GitHub, Bilibili…

deep-research18,985 ★

An AI-powered research assistant that performs iterative, deep research on any topic by combining search…

ai-website-cloner-template15,519 ★

Clone any website with one command using AI coding agents

obscura13,786 ★

The headless browser for AI agents and web scraping

llm-scraper6,749 ★

Turn any webpage into structured data using LLMs

firecrawl-mcp-server6,391 ★

🔥 Official Firecrawl MCP Server - Adds powerful web scraping and search to Cursor, Claude and any other LLM…

trafilatura6,010 ★

Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as…

camofox-browser5,852 ★

Stealth headless browser for AI agents — bypass Cloudflare, bot detection, and anti-scraping. Drop-in…

myGPTReader4,421 ★

A community-driven way to read and chat with AI bots - powered by chatGPT.

AnyCrawl3,173 ★

AnyCrawl 🚀: A Node.js/TypeScript crawler that turns websites into LLM-ready data and extracts structured SERP…

CyberScraper-20773,028 ★

A Powerful web scraper powered by LLM | OpenAI, Gemini & Ollama

ai-crawler-py2,941 ★

Crawl a website starting from a URL, find relevant pages, and extract data – all guided by your natural…

oxylabs-ai-studio-py2,919 ★

Structured data gathering from any website using AI-powered scraper, crawler, and browser automation…

spider2,504 ★

Low latency web data collector

brightdata-mcp2,411 ★

A powerful Model Context Protocol (MCP) server that provides an all-in-one solution for public web access.

weibo_terminater2,320 ★

Final Weibo Crawler Scrap Anything From Weibo, comments, weibo contents, followers, anything. The Terminator

open-agent-builder2,241 ★

🔥 Visual workflow builder for AI agents powered by Firecrawl - drag-and-drop web scraping pipelines with…

deep-research-web-ui2,183 ★

(Supports DeepSeek R1) An AI-powered research assistant that performs iterative, deep research on any topic…

linkedin-mcp-server2,014 ★

Open-source MCP server for LinkedIn. Give Claude and any MCP-compatible AI assistant access to profiles…

OpenOutreach1,933 ★

Linkedin Automation Tool: Describe your product. Define your target market. The AI finds the leads for you.

RPA1,912 ★

Ui.Vision Open-Source RPA Software with Computer Vision, OCR, Anthropic Computer Use/LLM. Selenium IDE …

JustHireMe1,903 ★

Local-first AI job intelligence workbench for scraping roles, ranking fit, and generating tailored…

thepipe1,526 ★

Get clean data from tricky documents, powered by vision-language models ⚡

skills1,475 ★

Browser automation CLI built for AI agents. Break through anti-bot walls, hand off to humans across platforms…

agentql1,372 ★

AgentQL is a suite of tools for connecting your AI to the web. Featuring a query language and Playwright…

open-scouts1,282 ★

🔥 AI-powered web monitoring platform. Create automated scouts that search the web and send email alerts when…

apify-mcp-server1,275 ★

The Apify MCP server enables your AI agents to extract data from social media, search engines, maps…

webclaw1,198 ★

Fast, local-first web content extraction for LLMs. Scrape, crawl, extract structured data — all from Rust…

RedBox998 ★

用AI创作高质量内容,用gpt-image-2创作的最佳生图工具,AI图片自动编排,小红书版Openclaw,自媒体创作者的AI工作台,小红书创作AI工具RedClaw,支持小红书图文下载、创作风格学习、小红书AI创作|…

x-tweet-fetcher842 ★

Fetch X/Twitter tweets, replies, timelines, and articles without login or API keys — field tool for AI agents.

undetectable-fingerprint-browser757 ★

Free open-source Multilogin/Incogniton/Kameleo alternative for fingerprint spoofing…

firecrawl-app-examples741 ★

🔥 This repository contains complete application examples, including websites and other projects, developed…

neo720 ★

Turn any web app into an API. Chrome extension captures browser traffic, auto-generates schemas, lets AI…

Dulus703 ★

The best CLI agent - The agent who could save AI — reads & edits files, runs Bash, greps your repo, browses…

ai-scraper-py662 ★

AI Scraper is a powerful scraping tool and scrape agent built to automate data extraction with unmatched…

stealth-browser-mcp661 ★

The only browser automation that bypasses anti-bot systems. AI writes network hooks, clones UIs pixel-perfect…

gpt-promo-scanner566 ★

ChatGPT Team(Business) 促销码自动扫描工具 — 批量发现/验证/价格收集,支持 17 国 34 个码,最高折扣 71% | ChatGPT Business promo code scanner…

reader529 ★

Open source web infrastructure for AI. Scrape, crawl, and automate the web, clean markdown, browser sessions…

AutoScraper485 ★

Official implement of paper "AutoScraper: A Progressive Understanding Web Agent for Web Scraper Generation"…

skills480 ★

AI agent skills for UnifAPI MCP and public-data workflows

List-of-user-agents461 ★

List of major web + mobile browser user agent strings. +1 Bonus script to scrape :)

n8n-claw447 ★

OpenClaw-inspired autonomous AI agent built entirely in n8n. Adaptive RAG-powered memory, Skills via MCP…

markdown-crawler443 ★

A multithreaded 🕸️ web crawler that recursively crawls a website and creates a 🔽 markdown file for each page…

cli423 ★

CLI and Agent Skill for Firecrawl - Add scrape, search, and browsing capabilities to your AI agents

TheAgenticBrowser409 ★

Open-source AI agent for web automation and scraping.

scraperai405 ★

ScraperAI is an open-source, AI-powered tool designed to simplify web scraping for users of all skill levels.

resume_render_from_job_description405 ★

Resume_Builder_AIHawk is a powerful Python tool that allows you to automatically customize your resume based…

graphlit-mcp-server376 ★

Model Context Protocol (MCP) Server for Graphlit Platform

kuri324 ★

Browser automation, web crawling, and iOS + Android device control for AI agents. Zig-native, token-efficient…

extractor317 ★

Use LLMs to robustly extract web data

Aetherius_AI_Assistant314 ★

A completely private, locally-operated Ai Assistant/Chatbot/Sub-Agent Framework with realistic Long Term…

ChatGPT-OpenAI-Smart-Speaker312 ★

This AI Smart Speaker uses speech recognition, TTS (text-to-speech), and STT (speech-to-text) to enable voice…

dendrite-python-sdk310 ★

Tools to build web AI agents that can authenticate, interact with and extract data from any website.

reader308 ★

📚 This is an adapted version of Jina AI's Reader for local deployment using Docker. Convert any URL to an…

gpt4V-scraper302 ★

AI agent that can SEE 👁️, control, navigate, & do stuff for you on your browser.

CloakBrowser297 ★

CloakBrowser Github: anti-detect browser download, source-level chromium patches, browser fingerprinting…

llm-reader290 ★

Turn Webpage to LLM friendly input text. Similar to Firecrawl and Jina Reader API. Makes RAG, AI web…

knowledge-gpt290 ★

Extract knowledge from all information sources using gpt and other language models. Index and make Q&A…

agent-fetch283 ★

Full-content web fetcher for AI agents — Chrome TLS fingerprinting, browser impersonation, and …

XActions282 ★

⚡ The Complete X/Twitter Automation Toolkit — Scrapers, MCP server for AI agents (Claude/GPT), CLI, browser…

flyto-core270 ★

The open-source execution engine for AI agents. 412 modules, MCP-native, triggers, queue, versioning…

teracrawl263 ★

High-performance web crawler API optimized for LLMs. Turn any search or website into clean Markdown using…

MinerU-HTML248 ★

MinerU-HTML: An SLM-powered HTML main content extractor that outputs clean HTML bodies. Perfect for Deep…

lego-ai-parser239 ★

Lego AI Parser is an open-source application that uses OpenAI to parse visible text of HTML elements.

search-result-scraper-markdown239 ★

This project provides a powerful web scraping tool that fetches search results and converts them into…

searcharvester233 ★

Self-hosted search + markdown harvester for AI agents. SearXNG (100+ engines) + FastAPI + trafilatura…

camofox-browser228 ★

Anti-detection browser server for AI agents — REST API wrapping Camoufox engine with OpenClaw plugin support

pocketgroq217 ★

PocketGroq is a powerful Python library that simplifies integration with the Groq API, offering advanced…

AI-Resume-Analyzer-and-LinkedIn-Scraper-using-Generative-AI205 ★

Developed an AI application using LLM to analyze user resumes and provided the summarization, strengths…

unofficial-claude-api200 ★

Unofficial Claude API supporting direct HTTP chat creation/deletion/retrieval, messages with multiple file…

skilless.ai190 ★

skilless.ai gives your AI Agents real data capabilities - web search, web scraping, video download/subtitle…

local-deepsearch-academic180 ★

An implementation of Google Deep Search 🕵️ with support for 1000+ references, local inference, chatting with…

agentql-mcp171 ★

Model Context Protocol server that integrates AgentQL's data extraction capabilities.

geo-ai-agent164 ★

AI-powered tool to audit and optimize website content by crawling URLs, analyzing H1s, and generating…

auto-md163 ★

Convert Files / Folders / GitHub Repos Into AI / LLM-ready Files

media-agent153 ★

Scrape data from social media and chat with it using Langchain

decipher-research-agent150 ★

Turn topics, links, and files into AI-generated research notebooks — summarize, explore, and ask anything.

blackmaria150 ★

Python package for webscraping in Natural language

JARVIS150 ★

JARVIS: a real-time agentic intelligence-gathering platform powered by autonomous web scraping & OSINT…

BrowserPilot149 ★

Open‑source alternative to Perplexity Comet, director.ai and firecrawl combined

decodo-openclaw-skill147 ★

OpenClaw skill for scraping any URL using the Decodo Web Scraping API.

Job-apply-AI-agent146 ★

GitHub Project: AI Job Application Automation 🚀 This project automates job searching, CV creation, and…

charlotte143 ★

Token-efficient browser MCP server — structured web pages for AI agents, not raw accessibility dumps

Upwork-AI-jobs-applier143 ★

AI tool for automating Upwork job applications using AI agents to find and qualify jobs, write personalized…

real-estate-ai-agent138 ★

Intelligent Python system that extracts real estate property data as structured JSON using AI agents, Nebius…

dfcx-scrapi136 ★

A high level scripting API for bot builders, developers, and maintainers.

browser-debugger-cli135 ★

CLI tool for agents to quickly access browser telemetry (DOM, network, console) via Chrome DevTools Protocol.

web-scout-mcp129 ★

A powerful MCP server extension providing web search and content extraction capabilities. Integrates…

legion129 ★

Scrappy assistant that automates web3 bug hunting workflows. Tracks ongoing bug bounties and launches…

md125 ★

A useful drawer for MacOS. chatting, clipboard, webscraping, window managing, shotcuts. built with Rust and …

company-research-agent118 ★

Automated Miulti AI Agent for company research with LangGraph— scrapes web data, extracts business insights…

open-skills117 ★

Battle-tested skill library for AI agents. Save 98% of API costs with ready-to-use code for crypto, PDFs…

crw116 ★

Fast, lightweight Firecrawl alternative in Rust. Web scraper, crawler & search API with MCP server for AI…

wxpath111 ★

wxpath - declarative web crawling with XPath; a Web Query Language (WQL)

Scrapegraph-demo110 ★

Streamlit demo of Scrapegraph-ai for GPT4-hackaton

aura105 ★

AURA (Agent-Usable Resource Assertion) is an open protocol designed to make the web machine-readable. It…

sexting-dataset104 ★

Erotic conversations scraped from public resources on the internet

x-twitter-scraper96 ★

Twitter scraper API skill for tweet search, advanced Twitter search, profile tweets, follower export, media…

oxylabs-mcp95 ★

Official Oxylabs MCP integration

prosecutor-database93 ★

Civic Tech & Data AI For Good project. Tracks prosecutor election messaging, mass incarceration indicators…

crawl4ai-mcp-server89 ★

🕷️ A lightweight Model Context Protocol (MCP) server that exposes Crawl4AI web scraping and crawling…

anansi88 ★

A self-healing web scraper built for hostile sites: selectors repair themselves, browser rendering kicks in…

scrapeGPT88 ★

ScrapeGPT is a RAG-based Telegram bot designed to scrape and analyze websites, then answer questions based on…

RAG-based-job-search-assistant87 ★

linkedin-jobs-RAG

WebScraper87 ★

Python-based web crawling script with randomized intervals, user-agent rotation, and proxy server IP rotation…

bua86 ★

AI-powered browser automation for Go — describe tasks in plain English, let the agent handle the clicks.

apitap86 ★

CLI, MCP server, and npm library that turns any website into an API — no docs, no SDK, no browser.

scrapegraph-py79 ★

Official Python SDK for the ScrapeGraph AI API. Smart scraping, search, crawling, markdownify, agentic…

ai-web-scraper78 ★

AI web scraper built with Crawl4AI for extracting structured leads data from websites.

pebkac-chrome78 ★

pebkac Chrome Nonautomation - A Local LLM-Driven Web Co-Browser using Smolagents, Zendriver, Trafilatura.

ketch78 ★

Fast, stateless CLI for web search and scrape. Built for AI agents.

london-property-hunt-public76 ★

Automated London flat/room hunt powered by Claude Code + Claude in Chrome + Gmail MCP. Scrapes 4 rental…

advanced-sitemap-parser76 ★

XML sitemap parser designed to extract and process millions of URLs while bypassing most modern anti-bot…

Custom-MCP-Server75 ★

MCP server for scraping LinkedIn, Facebook, Instagram profiles and Google search.

tavily-chat74 ★

Conversational agent that fuses chat data with live web results through Tavily search, extract, and crawl.

slack-gpt-bot74 ★

GPT4-powered Slack bot that can scrape URL contents

Website-Crawler74 ★

Extract data from websites in LLM ready JSON or CSV format. Crawl or Scrape entire website with Website…

OpenAver73 ★

Modern JAV metadata manager — multi-source scraping, Jellyfin integration, and AI-ready API. Built with…

WhiskeyAI73 ★

Whiskey AI lets you create autonomous AI agents without code. Connect APIs, automate Solana actions, launch…

ytfetcher72 ★

⚡ Build structured YouTube datasets at scale — effortlessly fetch transcripts and rich metadata for NLP, ML…

actor-rag-web-browser72 ★

RAG Web Browser is an Apify Actor to feed your LLM applications and RAG pipelines with up-to-date text…

jobclaw70 ★

🦞 AI-powered job hunting agent — scrapes Boss直聘/LinkedIn, matches your profile, auto-applies. Built with…

camofox-mcp66 ★

Anti-detection browser MCP server for AI agents — navigate, interact, and automate the web without getting…

cortex-scout66 ★

A unified web extraction and stateful automation engine for AI. Replaces heavy testing frameworks with…

advanced-deep-research65 ★

Automated Deep Research with LLMs, web search, paper parsing, and didactic summarization.

llmnet65 ★

The Offline Internet.

reddit_karma_farmer_auto_commentator_with_AI64 ★

Reddit_Commentator_AIHawk is a Python project showcasing the power of artificial intelligence in social media…

Crawllama64 ★

CrawlLama 🦙 is an local AI agent that answers questions via Ollama and integrates web- and RAG-based…

scrapit63 ★

A (really) easy way to web scrape

bedrock-agents-webscraper59 ★

This repo provides guidance on setting up a bedrock agent to webscrape and internet search via action groups

slither59 ★

A simple, easy to use framework for adding randomized, anonymous IP addresses and user-agents to web…

eGet-Crawler-for-ai56 ★

Web scraping framework built for AI applications. Extract clean, structured content from any website with…

PythonScrapyBasicSetup56 ★

Basic setup with random user agents and IP addresses for Python Scrapy Framework.

crawlbase-mcp55 ★

Crawlbase MCP Server connects AI agents and LLMs with real-time web data. It powers Claude, Cursor, and…

supacrawler55 ★

Supacrawler's ultralight engine for scraping and crawling the web. Written in go for maximum performance and…

mtywatch54 ★

一句话监控网页内容变化,AI | 爬虫 | 网页监控 | 网页更新提醒 | 网页内容订阅

AutomatiQ54 ★

A tool that watches you browse, then writes HTTP-based automation scripts

firecrawl-quickstarts54 ★

A collection of cookbooks to help developers get started quickly with the Firecrawl API.

AI-Lead-Generation-Agent49 ★

AI Lead Generation Agent that automatically discovers and qualifies potential leads from Quora. Using…

ai-lead-generator48 ★

AI-powered agent that scrapes leads with Bright Data, qualifies them using OpenAI, and delivers…

oxylabs-ai-studio-js47 ★

Structured data gathering from any website using AI-powered scraper, crawler, and browser automation…

agent-browser-workspace45 ★

Local browser toolkit for AI agents: deep research and browser use automation with local Chrome (CDP) +…

turbowebfetch45 ★

🌐 Real-browser web fetching for AI agents.

Datavizion-RAG45 ★

Retrieval-augmented generation (RAG) for remote & local LLM use

Qurio44 ★

Qurio brings multi-provider models, custom agents, reusable skills, MCP servers, HTTP tools, retrieval…

google_news_content_scrape_and_analyze_with_gpt43 ★

This demo repository illustrates how to use Python to scrape news articles from Google based on a given…

browsegenie42 ★

AI Agent which can do any kind of Browser Automation Task & Web Scraping, just using a single prompt!

osint-skill42 ★

OSINT Skill for AI agents (Claude Code, OpenClaw, Codex, OpenCode) — from a name to a scored dossier with…

chew42 ★

Chew is a Go library for processing various content types into markdown/plaintext.

llm-use40 ★

LLM orchestration toolkit for agent workflows: planner + workers + synthesis, optional router (LLM + learned …

template-browsing-agent40 ★

A powerful integration that combines Browserbase's Stagehand with Mastra for advanced web automation…

puppeteer-mcp-claude39 ★

Browser automation MCP server for Claude, powered by Puppeteer.

serp-api-comparison38 ★

Compare the best SERP APIs by pricing, latency, coverage, and use cases for SEO tools, AI agents, rank…

Reddit-AI-Agent37 ★

Reddit AI Agent is an intelligent tool that helps you explore Reddit like never before! 🔎 It allows you to…

google-maps-lead-generator36 ★

Extract Google Maps business leads and enrich contact details using AI & web scraping

google-researcher-mcp36 ★

⚠️ DEPRECATED — Use https://github.com/zoharbabin/web-researcher-mcp instead

auto-md36 ★

Convert Files / Folders / GitHub Repos Into AI / LLM-ready Files

langchain-webscraper-demo35 ★

A chatbot demo that scrapes a website and stores the result in a vector db, which can then be queried via…

SmartTourister34 ★

We have developed a fully AI/ML-based itinerary recommendation system which when used by people coming to…

onequery34 ★

AI web agent to find answers to any question

claude-for-safari33 ★

Give your AI Agent the power to control Safari on macOS. No extensions, no separate browser.

machine-learning-resources33 ★

Collection of Project Tutorials / blogs in python, web applications, machine learning, data science, deep…

gptauto33 ★

ChatGPT selenium scraper written in Python

steel-python32 ★

The official Python library for the Steel API

ai-job-scraper32 ★

🕵️‍♂️ Privacy-focused AI job scraper, local storage, and interactive dashboard. Auto-scrapes AI/ML roles from…

scrapegraphai-ai-copilot32 ★

crawl4ai-skill31 ★

Web scraping skill for Claude AI. Crawl websites, extract structured data with CSS/LLM strategies, handle…

bricks31 ★

Bricks is the open-source, fully local alternative to Clay.com. It combines AI agents, advanced web scraping…

AgentStack31 ★

AgentStack is a production-grade multi-agent framework built on Mastra, delivering 50+ enterprise tools, 25+…

Smarter-Web-Scraping-with-Python31 ★

Leverage modern open-source tools to create better web scraping workflows.

mcp-jinaai-reader31 ★

🔍 Model Context Protocol (MCP) tool for parsing websites using the Jina.ai Reader

WEBGhosting-MCP30 ★

Intelligent stealth browser MCP server for AI agents with 30 tools, 22 anti fingerprint scripts, and LLM…

Wikipedia-Scraping-with-LLM-Agents30 ★

Scraping Wikipedia by combining LangChain's agents and tools with OpenAI's LLMs and function calling

browserclaw29 ★

The AI-native browser automation library. Snapshot + ref targeting — born from OpenClaw, built for agents, by…

OpenCometAI29 ★

Open Comet is an autonomous AI agent integrated into your Chrome browser. It enables safe, transparent, and…

LinkedIn-to-Portfolio-Site-Generator29 ★

This project is a Python script that scrapes a Linkedin PDF, generates a customized portfolio site using…

git-repo-parser29 ★

A tool to scrape all files from a GitHub repository and turn it into a JSON or TXT file, Useful for AI and…

darkweb-forums-tracker28 ★

This is a darkweb forums tracker that monitors forum posts and sends alerts to Discord

x-scraper28 ★

A Twitter/X scraper built with Playwright for browser automation and OpenAI GPT-4 for AI-powered tweet…

web-crawling-guides27 ★

How to guides on web-crawling or scraping

GPT-3.5-ON-STEROIDS27 ★

GPT-3.5-ON-STEROIDS combines GPT with Python tools, empowering dynamic web scraping, language processing, and…

pyvigate27 ★

Pyvigate: A Python framework that combines headless browsing with LLMs that assists you in your data…

perplexity-ai-export27 ★

Grabs all your Perplexity conversations data, spits it out into a nice file folder structure and allows you…

WEB-SCRAPING-MCP27 ★

MCP Server leveraging crawl4ai for web scraping and LLM-based content extraction (Markdown, text snippets…

search-cli27 ★

Multi-provider web search CLI for AI agents — Brave, Serper, Exa, Jina, Firecrawl, Perplexity, xAI in one…

AI-web_scraper27 ★

Just mention want you want and it will extract/scrape data from the Web. Useful to create AI web…

cdpilot26 ★

Zero-dependency browser automation CLI. 70+ commands, 10 test assertions, smart commands (click/fill by text…

browser-serp26 ★

Real-time Google Search API for AI Agents & RAG pipelines. Get structured SERP data instantly using remote…

spider-clients26 ★

Python, Javascript, and Rust libraries for the Spider Cloud API.

Reddit-Content-Research-Agent26 ★

Build a Reddit Content Research Agent with LLMs, LangChain, SERP, Jupyter, Django, Bright Data, Celery…

agentql-integrations26 ★

AgentQL's integrations with workflow automation tools and AI agent frameworks let you extract structured data…

Deep-Research-using-Gemini-api26 ★

AI-powered deep research tool leveraging web scraping for cost-effective, comprehensive analysis. Open-source…

wikibot26 ★

A :robot: which provides features from Wikipedia like summary, title searches, location API etc.

openai-scraper26 ★

This is a template repository for building a web scraper with OpenAI support. The repository provides a basic…

Agent-WebCloak26 ★

[IEEE S&P'26] WebCloak: Characterizing and Mitigating the Threats of LLM-Driven Web Agents as Intelligent…

Browse other capabilitys