From the command line, quickly explore data from a CSV file.
shallow-explore is a Golang backed command-line tool for iterating over columns from a CSV file. This is a gut check tool to make sure the assumptions about the data are within the expected range of normal.
After installation, run the following command to start analyzing data:
shallow-explore -csv ~/complete/path/to/file/sample.csv
Note: The complete path of the file is required to load the data into the program.
shallow-explore supports three types of data: integers, floats, and strings.
The following output is an example of an integer or float column. The column name at the top of the frame followed by a summary line graph of the items, and some quick statistics about the data.
For string-based data, the column name is still at the top of the output. Below the column name lives a horizontal histogram and a count of unique entities found in the column.
If Golang is installed, run the following command:
go get github.com/tmickleydoyle/shallow-explore
Instructions for Installing Go
Installation with Homebrew
brew install go
Why I Built This Tool
I find myself running and rerunning the same basic statistical analysis on data to get an understanding of how trends are moving. I figured why not make it easier and share it with everyone else! I hope this speeds up your decision making