🎁 Get the FREE AI Skills Starter GuideSubscribe →
BytesAgainBytesAgain

All Skills

210 skills total matching "extraction"

🦀 ClawHub16.7k dl
Pdf Extract
Extract text from PDF files for LLM processing
GitHub35.5k
x-twitter-scraper
X (Twitter) data platform skill — tweet search, user lookup, follower extraction, engagement metrics, giveaway draws, monitoring, webhooks, 19 extraction tools, MCP server.
🦀 ClawHub6.1k dl
Jina Reader
Web content extraction via Jina AI Reader API. Three modes: read (URL to markdown), search (web search + full content), ground (fact-checking). Extracts clea...
GitHub35.1k
security-requirement-extraction
Installable GitHub library of 1,400+ agentic skills for Claude Code, Cursor, Codex CLI, Gemini CLI, Antigravity, and more. Includes installer CLI, bundles, workflows, and official/community skill collections.
🦀 ClawHub5.6k dl
Image Vision
Analyze and interpret images by describing content, extracting text, answering questions, comparing visuals, and extracting structured data from JPG, PNG, GI...
GitHub35.1k
security-requirement-extraction
Installable GitHub library of 1,400+ agentic skills for Claude Code, Cursor, Codex CLI, Gemini CLI, Antigravity, and more. Includes installer CLI, bundles, workflows, and official/community skill collections.
🦀 ClawHub4.8k dl
Generate Instagram marketing content from product URLs. Extract product information and create engaging Instagram posts with image suggestions, captions, and hashtags optimized for engagement. Use whe
Generate Instagram marketing content from product URLs. Extract product information and create engaging Instagram posts with image suggestions, captions, and hashtags optimized for engagement. Use when user provides a product URL from e-commerce sites like Amazon, Shopify, Taobao, etc. and wants Instagram marketing content such as Image/text post ideas, engaging captions with CTAs, hashtag strategy, or Story/reels content suggestions.
GitHub35.1k
bdistill-knowledge-extraction
Installable GitHub library of 1,400+ agentic skills for Claude Code, Cursor, Codex CLI, Gemini CLI, Antigravity, and more. Includes installer CLI, bundles, workflows, and official/community skill collections.
🦀 ClawHub4.0k dl
Agent Browser
Automates browser interactions for web testing, form filling, screenshots, and data extraction. Use when the user needs to navigate websites, interact with web pages, fill forms, take screenshots, test web applications, or extract information from web pages.
GitHub35.1k
bdistill-knowledge-extraction
Installable GitHub library of 1,400+ agentic skills for Claude Code, Cursor, Codex CLI, Gemini CLI, Antigravity, and more. Includes installer CLI, bundles, workflows, and official/community skill collections.
🦀 ClawHub2.9k dl
Crawl4ai
AI-powered web scraping framework for extracting structured data from websites. Use when Codex needs to crawl, scrape, or extract data from web pages using AI-powered parsing, handle dynamic content, or work with complex HTML structures.
GitHub35.1k
bdistill-knowledge-extraction
Installable GitHub library of 1,400+ agentic skills for Claude Code, Cursor, Codex CLI, Gemini CLI, Antigravity, and more. Includes installer CLI, bundles, workflows, and official/community skill collections.
🦀 ClawHub2.3k dl
Google Maps Search Api
This skill is designed to help users automatically extract business data from Google Maps search results. When a user asks to "find coffee shops in New York," "search for dental clinics," or "extract business leads from Google Maps," the agent should proactively apply this skill.
GitHub35.1k
security-requirement-extraction
Installable GitHub library of 1,400+ agentic skills for Claude Code, Cursor, Codex CLI, Gemini CLI, Antigravity, and more. Includes installer CLI, bundles, workflows, and official/community skill collections.
🦀 ClawHub2.3k dl
Parallel Extract
URL content extraction via Parallel API. Extracts clean markdown from webpages, articles, PDFs, and JS-heavy sites. Use for reading specific URLs with LLM-re...
GitHub25.1k
x-twitter-scraper
Use when the user wants to integrate with the X (Twitter) API via Xquik to search tweets, look up user profiles, extract followers, run giveaway draws, monitor accounts, or access trending topics. Also use when the user mentions 'Xquik,' 'Twitter API,' 'X API,' 'tweet scraper,' 'follower extraction,' or 'Twitter monitoring.' Covers REST API, webhooks, and MCP server setup.
🦀 ClawHub2.2k dl
xiaohongshu-extract
Extract metadata from Xiaohongshu (XHS) share or discovery URLs by parsing window.__INITIAL_STATE__ and returning note details. Use when asked to fetch XHS page content, note metadata, video info, or engagement stats from a public XHS link.
GitHub10.0k
wordpress-theme-to-emdash
Port WordPress themes to EmDash CMS. Use when asked to convert, migrate, or port a WordPress theme to EmDash, or when creating an EmDash site that should match an existing WordPress site's design. Handles design extraction, template conversion, and EmDash-specific features like menus, taxonomies, and widgets.
🦀 ClawHub1.9k dl
Meta Video Ad Analyzer
Extract and analyze content from video ads using Gemini Vision AI. Supports frame extraction, OCR text detection, audio transcription, and AI-powered scene analysis. Use when analyzing video creative content, extracting text overlays, or generating scene-by-scene descriptions.
GitHub8.2k
affiliate-marketing
Scrape Amazon product details via browser and generate platform-optimized promotional content (Twitter/X, Instagram, WeChat) using LLM. No third-party API needed — browser-based extraction only.
🦀 ClawHub1.7k dl
Figma Sync
Read Figma files, extract design tokens, generate React Native Expo TS or Web React + Tailwind code, write back to Figma, and diff local models against Figma for minimal patches. Triggers: "pull figma", "sync figma", "figma to code", "push to figma", "diff figma", "extract design tokens", "generate from figma", "preview figma changes"
🦀 ClawHub1.5k dl
Terabox Link Extractor
Direct link extraction from TeraBox URLs using the XAPIverse protocol. Extracts high-speed download and stream links (360p/480p) without browser session requ...
🦀 ClawHub1.4k dl
Email to Calendar Extraction Engine
Extract calendar events, deadlines, action items, and follow-ups from emails. Works with any calendar provider (Google, Outlook, Apple, Notion, etc.). No external dependencies — pure agent intelligence. Use when the user forwards an email, asks to check inbox for events, or wants to extract structured scheduling data from any text.
🦀 ClawHub1.3k dl
deso-research
Research and analyze content across decentralized social networks (Farcaster, Lens, Nostr, Bluesky) using the deso-ag CLI tool. Use this skill when users want to research topics on decentralized social platforms, analyze trending content, extract discussion terms, browse Farcaster channels, or compare engagement across networks. Trigger on phrases like "research X on Farcaster", "what's trending on Lens", "analyze [topic] across deso networks", "search deso for [topic]", "extract trending terms"
🦀 ClawHub1.2k dl
X Actionbook Recap
Collect, scroll, extract, and summarize recent X (Twitter) posts for any handle (optionally filtered by keyword search) using the Actionbook Rust CLI (actionbook-rs) workflow (open → snapshot/accessibility tree → extract `article` text). Use when asked to analyze a handle over a time window (e.g., last 7 days), produce Chinese working notes, and publish a neutral English recap (single post or thread) from a specified account.
🦀 ClawHub1.2k dl
X Extract
Extract tweet content from x.com URLs without credentials using browser automation. Use when user asks to "extract tweet", "download x.com link", "get tweet...
🦀 ClawHub1.1k dl
pdf-skill
Create, read, edit, merge, split PDF files. Supports text extraction, table extraction, form filling, watermarks, OCR, and HTML-to-PDF conversion.
🦀 ClawHub986 dl
GLM-OCR
Extract text from images using GLM-OCR API. Supports images and PDFs with high accuracy OCR, table recognition, formula extraction, and handwriting recogniti...
🦀 ClawHub908 dl
Summarize Pdf
PDF to Markdown converter - extract text, tables and formulas from PDF files to clean Markdown. Use when converting PDF documents, extracting PDF content, pa...
🦀 ClawHub883 dl
Data Spider
Scrape any webpage and extract structured data as JSON, table, or list. Supports schema-guided extraction.
🦀 ClawHub784 dl
Local GLM OCR with llama.cpp on AIPC(no API Key)
Image OCR, text recognition, extract text from image, scan document, read image text, invoice OCR, receipt OCR, contract recognition, table extraction, busin...
🦀 ClawHub751 dl
Mistral PDF OCR
Extracts text, tables, and images from PDFs (including scanned PDFs) using the Mistral OCR API. Use when user asks to OCR a PDF/image, extract text from a PD...
🦀 ClawHub673 dl
BEE Belief Extraction Engine
Install and configure BEE — the Belief Extraction Engine for OpenClaw. Gives agents persistent structured memory across sessions. Auto-extracts beliefs at se...
🦀 ClawHub638 dl
DocStream
Document processing via DocStream API — text extraction, summarization, format conversion, PDF parsing. Use when user needs to extract text from documents, s...
🦀 ClawHub606 dl
Tavily Extract
Extract content from specific URLs using Tavily's extraction API. Returns clean markdown/text from web pages.
🦀 ClawHub582 dl
PDF Extract
Extract PDF extracts structured data from PDFs and images, including tables, OCR text, images, and stamps, built on ComPDF data extraction and AI document ex...
🦀 ClawHub561 dl
Supermemory
Long-term agent memory with atomic fact extraction, relational versioning, semantic search, and entity profiles. Extracts facts from conversations, tracks ho...
🦀 ClawHub536 dl
Discovery Engine
Cross-domain scientific discovery through structured extraction of scientific publications. What one paper solves, another needs — this skill extracts provid...
🦀 ClawHub504 dl
Scan To Markdown
OCR document extraction - extract text from scanned documents, photos, and images using OCR. Use when reading scanned PDFs, photographed pages, handwritten n...
🦀 ClawHub482 dl
mineru precision extract PDF、Document、Images
MinerU precision extract — high-accuracy document extraction with full feature set. Convert PDFs, scanned documents, images, Word (DOC/DOCX), PowerPoint (PPT...
🦀 ClawHub466 dl
Grab Videos and Frames from Youtube and other sites
Download videos and extract frames using yt-dlp and ffmpeg. Use when: - Download YouTube videos/Shorts: `yt-dlp <url>` - Extract frames from video: `ffmpeg -...
🦀 ClawHub445 dl
pdf-extract-skill
OpenClaw PDF extraction skill using OpenDataLoader. Use when the user wants to extract and process PDF content for RAG, embeddings, or coordinate-based citat...
🦀 ClawHub440 dl
F.A.C.E.T. Framework (心智模型:F.A.C.E.T. 框架)
F.A.C.E.T. cognitive framework for extracting mental models from classic books and theories. Use when (1) user asks for "reading notes", "extract models", or...
🦀 ClawHub431 dl
Pdf2md
PDF to Markdown converter - extract text, tables and formulas from PDF files to clean Markdown. Use when converting PDF documents, extracting PDF content, pa...
🦀 ClawHub421 dl
ObjectRemover Video Object Ops
Remove or extract objects from videos with AI. Supports watermark/distraction removal, subject extraction with transparent background, natural-language objec...
🦀 ClawHub414 dl
youtube-research-kit
Extract and analyze YouTube video content using yt-dlp. Supports metadata extraction, transcript/subtitle download, comment retrieval, playlist analysis, and...
🦀 ClawHub407 dl
Extract Design
Use this skill when the user wants to extract a webpage's design language into a reusable HTML style reference file, including typography, colors, spacing, s...
🦀 ClawHub396 dl
tra-extract-text
Extract readable text, markdown, HTML, JSON, or XML content from web pages using the trafilatura CLI tool with optional metadata and output formatting.