Ml Visualizer
by @bytesagain-lab
Visual analysis and diagnostic tools to help machine learning model selection. ml-visualizer, python, anaconda, estimator, machine-learning, matplotlib.
clawhub install ml-visualizerπ About This Skill
version: "1.0.0" name: Yellowbrick description: "Visual analysis and diagnostic tools to help machine learning model selection. ml-visualizer, python, anaconda, estimator, machine-learning, matplotlib."
ML Visualizer
A data toolkit for ingesting, transforming, querying, and visualizing machine learning datasets. Manage your entire data pipeline β from raw ingestion through profiling and validation β all from the command line.
Commands
| Command | Description |
|---------|-------------|
| ml-visualizer ingest | Ingest raw data or record a data source entry |
| ml-visualizer transform | Log a data transformation step or operation |
| ml-visualizer query | Record a query against your dataset |
| ml-visualizer filter | Log a filter operation applied to data |
| ml-visualizer aggregate | Record an aggregation or rollup operation |
| ml-visualizer visualize | Log a visualization request or chart specification |
| ml-visualizer export | Record an export operation or export all data |
| ml-visualizer sample | Log a data sampling operation |
| ml-visualizer schema | Record or describe a data schema |
| ml-visualizer validate | Log a data validation check |
| ml-visualizer pipeline | Record a full pipeline definition or step |
| ml-visualizer profile | Log a data profiling run |
| ml-visualizer stats | Show summary statistics across all entry types |
| ml-visualizer export | Export all data (formats: json, csv, txt) |
| ml-visualizer search | Search across all entries by keyword |
| ml-visualizer recent | Show the 20 most recent activity log entries |
| ml-visualizer status | Health check β version, disk usage, last activity |
| ml-visualizer help | Show the built-in help message |
| ml-visualizer version | Print the current version (v2.0.0) |
Each data command (ingest, transform, query, etc.) works in two modes:
Data Storage
All data is stored as plain-text log files in ~/.local/share/ml-visualizer/:
ingest.log, transform.log, visualize.log)timestamp|value format for easy parsinghistory.log tracks all activity across command typesexport commandSet the ML_VISUALIZER_DIR environment variable to override the default data directory.
Requirements
set -euo pipefail)date, wc, du, tail, grep, sed, catWhen to Use
1. Building a data pipeline journal β use ingest, transform, and pipeline to document each step of your ML data preparation workflow
2. Tracking data quality β use validate and profile to log validation checks and profiling runs, ensuring data integrity before model training
3. Logging visualization requests β use visualize to record what charts and plots you've generated for model diagnostics (confusion matrices, ROC curves, feature importance)
4. Managing dataset schemas β use schema to document the structure of your datasets, track schema changes over time, and share definitions with your team
5. Auditing data operations β use search, recent, and stats to review your complete data processing history and find specific operations
Examples
# Ingest a new data source
ml-visualizer ingest "Loaded training set from s3://ml-data/train.csv β 50,000 rows, 24 features"Record a transformation step
ml-visualizer transform "Applied StandardScaler to numeric columns, one-hot encoded categoricals"Log a visualization
ml-visualizer visualize "Generated confusion matrix for RandomForest classifier β 94% accuracy"Define a schema entry
ml-visualizer schema "users table: id(int), age(int), income(float), segment(str), churn(bool)"Search past operations
ml-visualizer search "StandardScaler"
Output
All commands print results to stdout. Redirect to a file if needed:
ml-visualizer stats > pipeline-report.txt
ml-visualizer export json
Powered by BytesAgain | bytesagain.com | hello@bytesagain.com
β‘ When to Use
π‘ Examples
# Ingest a new data source
ml-visualizer ingest "Loaded training set from s3://ml-data/train.csv β 50,000 rows, 24 features"Record a transformation step
ml-visualizer transform "Applied StandardScaler to numeric columns, one-hot encoded categoricals"Log a visualization
ml-visualizer visualize "Generated confusion matrix for RandomForest classifier β 94% accuracy"Define a schema entry
ml-visualizer schema "users table: id(int), age(int), income(float), segment(str), churn(bool)"Search past operations
ml-visualizer search "StandardScaler"