🎁 Get the FREE AI Skills Starter GuideSubscribe →
BytesAgainBytesAgain

All Skills

104 skills total matching "image processing"

🦀 ClawHub41.4k dl
Markdown Converter
Convert documents and files to Markdown using markitdown. Use when converting PDF, Word (.docx), PowerPoint (.pptx), Excel (.xlsx, .xls), HTML, CSV, JSON, XML, images (with EXIF/OCR), audio (with transcription), ZIP archives, YouTube URLs, or EPubs to Markdown format for LLM processing or text analysis.
GitHub167.2k
nutrient-document-processing
The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.
🦀 ClawHub3.4k dl
Screenshot Capture
Process screenshots Enzo shares with comments. Save to reference library, extract content, categorize, set reminders, and log patterns. Use when Enzo sends an image with context like "save this", shares a screenshot of content (LinkedIn posts, tweets, articles), or sends ideas/frameworks to remember.
GitHub167.2k
nutrient-document-processing
The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.
🦀 ClawHub1.9k dl
Remarkable
Fetch handwritten notes, sketches, and drawings from a reMarkable tablet via Cloud API (rmapi). Process content by refining artwork with AI image generation, extracting handwritten text to memory/journal, or using sketches as input for other workflows. Use when working with reMarkable tablet content, syncing handwritten notes, processing sketches, or integrating tablet drawings into projects.
GitHub167.2k
nutrient-document-processing
The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.
🦀 ClawHub1.9k dl
file-processor
Automatically detects and processes files including PDF, Excel, CSV, Word, images, and text for extraction, OCR, data analysis, and summarization.
GitHub39.7k
career-ops
AI-powered job search system built on Claude Code. 14 skill modes, Go dashboard, PDF generation, batch processing.
🦀 ClawHub1.6k dl
Grok Imagine Image Pro
Generates and edits high-quality PNG images via xAI Grok/Flux API using prompts, styles, aspect ratios, and batch processing with base64 output.
GitHub35.1k
threejs-postprocessing
Installable GitHub library of 1,400+ agentic skills for Claude Code, Cursor, Codex CLI, Gemini CLI, Antigravity, and more. Includes installer CLI, bundles, workflows, and official/community skill collections.
🦀 ClawHub1.5k dl
Dlazy Imageseg
Image matting tool: separates foreground from background and returns transparent background URL, suitable for product image processing, character cutout, and...
GitHub35.1k
threejs-postprocessing
Installable GitHub library of 1,400+ agentic skills for Claude Code, Cursor, Codex CLI, Gemini CLI, Antigravity, and more. Includes installer CLI, bundles, workflows, and official/community skill collections.
🦀 ClawHub1.1k dl
removebg-api
Remove image backgrounds using the remove.bg API with API-key auth and transparent PNG output. Use when high-quality cutouts are needed and cloud processing...
GitHub35.1k
threejs-postprocessing
Installable GitHub library of 1,400+ agentic skills for Claude Code, Cursor, Codex CLI, Gemini CLI, Antigravity, and more. Includes installer CLI, bundles, workflows, and official/community skill collections.
🦀 ClawHub916 dl
multimodal-parser
Unified multi-modal content parser for images, PDF, DOCX, audio, auto OCR/transcription, output structured text for LLM processing
GitHub34.9k
pubmed-database
Direct REST API access to PubMed. Advanced Boolean/MeSH queries, E-utilities API, batch processing, citation management. For Python workflows, prefer biopython (Bio.Entrez). Use this for direct HTTP/REST work or custom API implementations.
🦀 ClawHub736 dl
Remove Watermark
Remove light-colored text watermarks from white-background document images (exam papers, scanned documents). No API key needed - pure local image processing....
GitHub19.6k
elevenlabs
ElevenLabs audio generation — text-to-speech, voice cloning, and sound effects. Use this skill any time the agent needs to: convert text to spoken audio, narrate documents or content, generate voiceovers, clone voices from audio samples, create sound effects, or produce any audio output from text. Supports multiple voices, languages, models, voice cloning, batch processing, and sound effect generation. Requires ELEVENLABS_API_KEY.
🦀 ClawHub626 dl
File Batch Processor
One-click batch processing for all files: rename, compress images, convert to PDF, auto organize. No software installation needed, runs locally, safe and ad-...
GitHub5.7k
processing-stix-taxii-feeds
754 structured cybersecurity skills for AI agents · Mapped to 5 frameworks: MITRE ATT&CK, NIST CSF 2.0, MITRE ATLAS, D3FEND & NIST AI RMF · agentskills.io standard · Works with Claude Code, GitHub Copilot, Codex CLI, Cursor, Gemini CLI & 20+ platforms · 26 security domains · Apache 2.0
🦀 ClawHub615 dl
When dealing with text within an image, the system automatically recognizes it as an OCR (Optical Character Recognition) task and applies the corresponding capabilities.
OCR (Optical Character Recognition) tool using Tesseract for extracting text from images. Use when: (1) processing screenshots, charts, or documents in image...
🦀 ClawHub562 dl
fal.ai
fal.ai API integration with managed API key authentication. Run AI models for image generation, video generation, audio processing, and more. Use this skill...
🦀 ClawHub495 dl
Dicom Segmentation Api
Deploy and manage medical image segmentation using TotalSegmentator and MONAI with DICOM upload, batch processing, 3D export, and statistics generation.
🦀 ClawHub473 dl
飞书文件发送技能(安全版)
Send files, images, and audio messages via Feishu Lark API using the mandatory two-step process. Use when needing to send files, images, or voice messages to...
🦀 ClawHub437 dl
keevx-image-to-video
Convert images to videos using Keevx API with support for multiple models, resolutions up to 4K, audio generation, and batch processing.
🦀 ClawHub392 dl
OpenCV
Computer vision and image processing using OpenCV WebAssembly. Uses opencv-component.wasm running in openclaw-wasm-sandbox plugin. Supports image processing,...
🦀 ClawHub371 dl
Image OCR Parse
Extract text from images via the PDFAPIHub cloud OCR API. Images are uploaded to pdfapihub.com for Tesseract OCR processing. Supports preprocessing (grayscal...
🦀 ClawHub350 dl
Pixel Art Processing
Pixel art sprite sheet processing tool — video frame extraction, GIF/frames conversion, sprite sheet compose/split, image matting, pixelation, resize, crop,...
🦀 ClawHub343 dl
Microsoft MarkItDown
Use MarkItDown to convert various files (PDF, Word, Excel, PPT, images, audio, HTML, CSV, JSON, etc.) to Markdown format for LLM processing and text analysis...
🦀 ClawHub313 dl
Ecdysales
Quick product image processing: add price sticker + watermark + logo. Use when user sends `$price:` with an image. Minimal context, runs fast.
🦀 ClawHub301 dl
Afm Image Analysis 1.0.0
Analyze AFM images to compute surface roughness, detect nanoparticles, extract line profiles, generate 3D renderings, and process batches with detailed reports.
🦀 ClawHub291 dl
GenVR Skills
Generate images, videos, and process media using the GenVR API. Standalone Node.js CLI.
🦀 ClawHub273 dl
imageReader
Reads and analyzes images from messages across 10+ chat platforms using platform-specific APIs and unified image processing.
🦀 ClawHub248 dl
PDF Batch Processing Tool
Batch process PDF files - merge multiple PDFs, split PDF into multiple files, rotate pages, extract text, extract images, compress PDFs. Use when you need to...
🦀 ClawHub221 dl
Byted Tos Image Process
Provides image processing capabilities for objects in Bytedance TOS using the official SDK. Supports getting image info, format conversion, resizing, and wat...
🦀 ClawHub167 dl
watermark-remover-skill
Use this skill when the user wants to remove watermarks from images, batch-process images for watermark removal, or asks about the "布衣去水印" / "图片去水印" tool. Th...
🦀 ClawHub73 dl
Alt Text Batch
Batch-process multiple images to generate AI-powered alt text descriptions for accessibility. Supports up to 500 images per run.
🦀 ClawHub
Aihubmix Image (Dify)
**Author:** AIHubMix **Version:** 0.0.1 **Type:** Dify Plugin The AIHubMix Image Generation Plugin provides access to a variety of advanced image generation and image editing models through a unified AIHubMix API. It supports synchronous and asynchronous workflows, multiple resolutions, batch generation, image-to-image processing, and multilingual prompts.
🦀 ClawHub
Imsg Media
Fetch iMessage/Messages.app attachments (voice memos and images) and process them — transcribe audio via Silicon Flow ASR (SenseVoiceSmall), and analyze imag...
🦀 ClawHub
Kinho
Simple API for Neural Network. Better for image processing with CPU/GPU + Transfer Learning.
🦀 ClawHub
linxule/mineru-mcp
📇 ☁️ - MCP server for MinerU document parsing API. Parse PDFs, images, DOCX, and PPTX with OCR (109 languages), batch processing (200 docs), page ranges, and local file upload. 73% token reduction with structured output.
🦀 ClawHub
Novitaai (Dify)
**Novita AI** is an innovative tool for image generation and model exploration. In **Dify**, Novita AI allows you to create stunning visuals, explore model details, and generate seamless tile patterns for various design needs. With advanced AI capabilities, Novita AI simplifies creative processes and enhances workflows. To start using **Novita AI**, follow these steps: 1. **Install the Novita AI T
🦀 ClawHub18.4k dl
Image
Create, inspect, process, and optimize image files and visual assets with reliable format choice, resizing, compression, color-profile, metadata, and platfor...
🦀 ClawHub4.4k dl
Vision
Resize, crop, convert, and optimize images using ImageMagick. Use when processing photos, converting formats (PNG/WebP), compressing size, or adding watermarks.
🦀 ClawHub2.5k dl
Image Process
Image processing tool for compression, background removal/replacement, and upscaling. Invoke when user wants to compress image, remove background, change bac...
🦀 ClawHub2.2k dl
Glasses to Social
Turn smart glasses photos into social media posts. Monitors a Google Drive folder for new images from Meta Ray-Ban glasses (or any smart glasses), analyzes them with vision AI, drafts tweets/posts in the user's voice, and publishes on approval. Use when setting up a glasses-to-social pipeline, processing smart glasses photos for social media, or creating hands-free content workflows.
🦀 ClawHub1.9k dl
BitSoul AI Face Beauty 人像AI美颜
Edit image to beautify faces or portaits in it. Use when (1) User requests to process an image, (2) User asks to beautify a photo.
🦀 ClawHub1.9k dl
BitSoul AI Face Beauty 人像AI美颜
Edit image to beautify faces or portaits in it. Use when (1) User requests to process an image, (2) User asks to beautify a photo.