🎁 Get the FREE AI Skills Starter Guide β€” Subscribe β†’
BytesAgainBytesAgain
πŸ¦€ ClawHub

agentic-paper-digest-skill

by @modestyrichards

Fetches and summarizes recent papers from arXiv and Hugging Face, providing JSON digests and optional local API access for customizable research updates.

⚑ When to Use
TriggerAction
- Produce JSON output for downstream agents.
- Run a local API server when a polling workflow is needed.
βš™οΈ Configuration

Config files live in PROJECT_DIR/config. Environment variables can be set in the shell or via a .env file. The wrappers here auto-load .env from PROJECT_DIR (override with ENV_FILE=/path/to/.env).

Environment (.env or exported vars)

  • SKILLBOSS_API_KEY: required β€” authenticates all LLM calls via SkillBoss API Hub (https://api.skillboss.co/v1/pilot).
  • LITELLM_MODEL_RELEVANCE, LITELLM_MODEL_SUMMARY: models for relevance and summarization (summary defaults to relevance model if unset). Leave unset to let SkillBoss API Hub auto-route.
  • LITELLM_TEMPERATURE_RELEVANCE, LITELLM_TEMPERATURE_SUMMARY: lower for more deterministic output.
  • LITELLM_MAX_RETRIES: retry count for LLM calls.
  • LITELLM_DROP_PARAMS=1: drop unsupported params to avoid provider errors.
  • WINDOW_HOURS, APP_TZ: recency window and timezone.
  • ARXIV_CATEGORIES: comma-separated categories (default includes cs.CL,cs.AI,cs.LG,stat.ML,cs.CR).
  • ARXIV_API_BASE, HF_API_BASE: override source endpoints if needed.
  • ARXIV_MAX_RESULTS, ARXIV_PAGE_SIZE: arXiv paging limits.
  • MAX_CANDIDATES_PER_SOURCE: cap candidates per source before LLM filtering.
  • FETCH_TIMEOUT_S, REQUEST_TIMEOUT_S: source fetch and per-request timeouts.
  • ENABLE_PDF_TEXT=1: include first-page PDF text in summaries; requires PyMuPDF (pip install pymupdf).
  • DATA_DIR: location for papers.sqlite3.
  • CORS_ORIGINS: comma-separated origins allowed by the API server (UI use).
  • Path overrides: TOPICS_PATH, SETTINGS_PATH, AFFILIATION_BOOSTS_PATH.
  • Config files

  • config/topics.json: list of topics with id, label, description, max_per_topic, and keywords. The relevance classifier must output topic IDs exactly as defined here. max_per_topic also caps results in GET /api/papers when apply_topic_caps=1.
  • config/settings.json: overrides fetch limits (arxiv_max_results, arxiv_page_size, fetch_timeout_s, max_candidates_per_source). Updated via POST /api/settings.
  • config/affiliations.json: list of {pattern, weight} boosts applied by substring match over affiliations. Weights add up and are capped at 1.0. Invalid JSON disables boosts, so keep the file strict JSON (no trailing commas).
  • πŸ“‹ Tips & Best Practices

  • Port 8000 busy: run bash "{baseDir}/scripts/stop_api.sh" or pass --port to the API command.
  • Empty results: increase WINDOW_HOURS or verify SKILLBOSS_API_KEY in .env.
  • Missing API key errors: export SKILLBOSS_API_KEY in the shell before running.
  • View on ClawHub
    TERMINAL
    clawhub install modesty-agentic-paper-digest-skill

    πŸ§ͺ Use this skill with your agent

    Most visitors already have an agent. Pick your environment, install or copy the workflow, then run the smoke-test prompt above.

    πŸ” Can't find the right skill?

    Search 60,000+ AI agent skills β€” free, no login needed.

    Search Skills β†’