🎁 Get the FREE AI Skills Starter Guide β€” Subscribe β†’
BytesAgainBytesAgain
πŸ¦€ ClawHub✦ BytesAgain

Ml Visualizer

by @bytesagain-lab

Visual analysis and diagnostic tools to help machine learning model selection. ml-visualizer, python, anaconda, estimator, machine-learning, matplotlib.

Versionv1.0.0
Downloads544
Installs1
TERMINAL
clawhub install ml-visualizer

πŸ“– About This Skill


version: "1.0.0" name: Yellowbrick description: "Visual analysis and diagnostic tools to help machine learning model selection. ml-visualizer, python, anaconda, estimator, machine-learning, matplotlib."

ML Visualizer

A data toolkit for ingesting, transforming, querying, and visualizing machine learning datasets. Manage your entire data pipeline β€” from raw ingestion through profiling and validation β€” all from the command line.

Commands

| Command | Description | |---------|-------------| | ml-visualizer ingest | Ingest raw data or record a data source entry | | ml-visualizer transform | Log a data transformation step or operation | | ml-visualizer query | Record a query against your dataset | | ml-visualizer filter | Log a filter operation applied to data | | ml-visualizer aggregate | Record an aggregation or rollup operation | | ml-visualizer visualize | Log a visualization request or chart specification | | ml-visualizer export | Record an export operation or export all data | | ml-visualizer sample | Log a data sampling operation | | ml-visualizer schema | Record or describe a data schema | | ml-visualizer validate | Log a data validation check | | ml-visualizer pipeline | Record a full pipeline definition or step | | ml-visualizer profile | Log a data profiling run | | ml-visualizer stats | Show summary statistics across all entry types | | ml-visualizer export | Export all data (formats: json, csv, txt) | | ml-visualizer search | Search across all entries by keyword | | ml-visualizer recent | Show the 20 most recent activity log entries | | ml-visualizer status | Health check β€” version, disk usage, last activity | | ml-visualizer help | Show the built-in help message | | ml-visualizer version | Print the current version (v2.0.0) |

Each data command (ingest, transform, query, etc.) works in two modes:

  • Without arguments β€” displays the 20 most recent entries of that type
  • With arguments β€” saves the input as a new timestamped entry
  • Data Storage

    All data is stored as plain-text log files in ~/.local/share/ml-visualizer/:

  • Each command type gets its own log file (e.g., ingest.log, transform.log, visualize.log)
  • Entries are stored in timestamp|value format for easy parsing
  • A unified history.log tracks all activity across command types
  • Export to JSON, CSV, or TXT at any time with the export command
  • Set the ML_VISUALIZER_DIR environment variable to override the default data directory.

    Requirements

  • Bash 4.0+ (uses set -euo pipefail)
  • Standard Unix utilities: date, wc, du, tail, grep, sed, cat
  • No external dependencies or API keys required
  • When to Use

    1. Building a data pipeline journal β€” use ingest, transform, and pipeline to document each step of your ML data preparation workflow 2. Tracking data quality β€” use validate and profile to log validation checks and profiling runs, ensuring data integrity before model training 3. Logging visualization requests β€” use visualize to record what charts and plots you've generated for model diagnostics (confusion matrices, ROC curves, feature importance) 4. Managing dataset schemas β€” use schema to document the structure of your datasets, track schema changes over time, and share definitions with your team 5. Auditing data operations β€” use search, recent, and stats to review your complete data processing history and find specific operations

    Examples

    # Ingest a new data source
    ml-visualizer ingest "Loaded training set from s3://ml-data/train.csv β€” 50,000 rows, 24 features"

    Record a transformation step

    ml-visualizer transform "Applied StandardScaler to numeric columns, one-hot encoded categoricals"

    Log a visualization

    ml-visualizer visualize "Generated confusion matrix for RandomForest classifier β€” 94% accuracy"

    Define a schema entry

    ml-visualizer schema "users table: id(int), age(int), income(float), segment(str), churn(bool)"

    Search past operations

    ml-visualizer search "StandardScaler"

    Output

    All commands print results to stdout. Redirect to a file if needed:

    ml-visualizer stats > pipeline-report.txt
    ml-visualizer export json
    


    Powered by BytesAgain | bytesagain.com | hello@bytesagain.com

    ⚑ When to Use

    TriggerAction
    2. **Tracking data quality** β€” use `validate` and `profile` to log validation checks and profiling runs, ensuring data integrity before model training
    3. **Logging visualization requests** β€” use `visualize` to record what charts and plots you've generated for model diagnostics (confusion matrices, ROC curves, feature importance)
    4. **Managing dataset schemas** β€” use `schema` to document the structure of your datasets, track schema changes over time, and share definitions with your team
    5. **Auditing data operations** β€” use `search`, `recent`, and `stats` to review your complete data processing history and find specific operations

    πŸ’‘ Examples

    # Ingest a new data source
    ml-visualizer ingest "Loaded training set from s3://ml-data/train.csv β€” 50,000 rows, 24 features"

    Record a transformation step

    ml-visualizer transform "Applied StandardScaler to numeric columns, one-hot encoded categoricals"

    Log a visualization

    ml-visualizer visualize "Generated confusion matrix for RandomForest classifier β€” 94% accuracy"

    Define a schema entry

    ml-visualizer schema "users table: id(int), age(int), income(float), segment(str), churn(bool)"

    Search past operations

    ml-visualizer search "StandardScaler"