BytesAgainBytesAgain

Find the Right AI Skill for Any Job

Browse 21+ curated AI agent skills. Search by use case, filter by category, get the right tool instantly.

Browse by Use Case β†’Pick My Role

All Skills β€” coding

21 skills in "coding" matching "extraction"

πŸ¦€ ClawHub
Fabric Bridge
Run Fabric AI patterns for text transformation, analysis, and content creation. Use when the user asks to use a Fabric pattern, extract wisdom, analyze claims, improve writing, summarize with Fabric, or mentions 'fabric' CLI. Supports 242+ patterns for tasks like content analysis, writing improvement, code review, threat modeling, and structured extraction.
⭐ GitHub
kreuzberg
High-performance document extraction library with a Rust core, supporting 62+ formats including PDF, Office, images with OCR, HTML, email, and archives.
⭐ GitHub
pdf_oxide
A fast PDF library for text extraction, image extraction, and markdown conversion, powered by Rust.
πŸ¦€ ClawHub
Ai Agent Tools
Python library offering file handling, text extraction, data conversion, utilities, memory storage, and validation tools for AI agent workflows.
πŸ¦€ ClawHub
Uplo Github
AI-powered GitHub knowledge management. Search repository metadata, code review standards, issue tracking, and team workflows with structured extraction.
πŸ¦€ ClawHub
baml-codegen
Use when generating BAML code for type-safe LLM extraction, classification, RAG, or agent workflows - creates complete .baml files with types, functions, clients, tests, and framework integrations from natural language requirements. Queries official BoundaryML repositories via MCP for real-time patterns. Supports multimodal inputs (images, audio), Python/TypeScript/Ruby/Go, 10+ frameworks, 50-70% token optimization, 95%+ compilation success.
πŸ¦€ ClawHub
Scrapling
Web scraping and data extraction using the Python Scrapling library. Use to scrape static HTML pages, JavaScript-rendered pages (Playwright), and anti-bot or...
⭐ GitHub
MALLET
A Java-based package for statistical natural language processing, document classification, clustering, topic modelling, information extraction, and other machine learning applications to text.
πŸ¦€ ClawHub
Fast Browser Use 1.0.5
Rust-based Chrome automation for ultra-fast, token-efficient DOM extraction, session management, screenshots, infinite scroll harvesting, and sitemap analysis.
πŸ¦€ ClawHub
Web Scraper
Web scraping skill with JavaScript rendering support. Extract data from websites using CSS selectors, XPath, or AI-powered extraction.
πŸ¦€ ClawHub
Agentic Browser 0.1.2
Browser automation for AI agents via inference.sh. Navigate web pages, interact with elements using @e refs, take screenshots, record video. Capabilities: web scraping, form filling, clicking, typing, drag-drop, file upload, JavaScript execution. Use for: web automation, data extraction, testing, agent browsing, research. Triggers: browser, web automation, scrape, navigate, click, fill form, screenshot, browse web, playwright, headless browser, web agent, surf internet, record video
πŸ¦€ ClawHub
Clawbrowser
Use when the agent needs to drive a browser through the Microsoft Playwright CLI (`playwright-cli`) for navigation, form interactions, screenshots, recordings, data extraction, session management, or debugging without loading a full MCP browser. It trains the agent on the CLI commands, snapshots, and session/config habits that make Playwright CLI reliable for scripted browsing.
πŸ¦€ ClawHub
Fast Browser Use Local
Rust-based browser automation using local Chrome for ultra-fast DOM extraction, session management, screenshots, scraping, and site structure analysis.
πŸ¦€ ClawHub
StartClaw-Optimizer
Master optimization system - APPLIES TO EVERY RESPONSE. Before responding, classify task complexity (simple question vs analysis vs coding). Use Haiku for simple/navigation/extraction/status. Use Sonnet ONLY for writing/analysis/planning/debugging. Monitor context size - if >50k tokens, recommend /compact. For automations, use scheduler wrapper. Never load full conversation history for simple tasks. Heartbeats always Haiku, single-line only. Never use Opus. This skill MUST run before every respo
πŸ¦€ ClawHub
Clawhub Publish 146156
Automate web navigation, interaction, and data extraction using a fast Rust-based headless browser CLI with Node.js fallback and structured commands.
πŸ”Œ MCP
jae-jae/fetcher-mcp
πŸ“‡ 🏠 - MCP server for fetching web page content using Playwright headless browser, supporting Javascript rendering and intelligent content extraction, and outputting Markdown or HTML format.
πŸ¦€ ClawHub
Hemp CBD Video β€” Product Education and Brand Building Videos for Hemp and CBD Companies
Creates educational videos for hemp and CBD brands to explain products, dosing, lab testing, extraction methods, and farm-to-bottle stories to build trust.
πŸ¦€ ClawHub
Clawhub Publish 146198
Automate web browsing tasks like navigation, form filling, clicking, and data extraction using a fast Rust-based headless browser with Node.js fallback.
⭐ GitHub
Local Deep Research
AI-powered deep research tool with multi-source search (arXiv, PubMed, web), PDF text extraction, and encrypted local storage. `MIT` `Docker/Python`
πŸ¦€ ClawHub
System Commander
Convert user tasks to optimal Linux/Python commands. Use when user needs file processing, data extraction, text manipulation, or any task that can be solved...
πŸ¦€ ClawHub
hawk-memory-v2
Pure Python memory management with four-layer decay, context compression, extraction, vector retrieval, and self-improving features for AI agents without ext...