A Python library for creating fast, repeatable and self-documenting data analysis pipelines.

proof is a Python library for creating optimized, repeatable and self-documenting data analysis pipelines. proof was designed to be used with the agate data analysis library, but can be used with numpy, pandas or any other method of proces

Related Repos



princefishthrower A tiered chat app based on reddit account age for all wall street bets users.
 

therealsreehari This Repository Consists of Free Resources needed for a person to learn Datascience from the beginning to end. This repository is divided into Four main Parts.
 

aaronwangy A helpful 4-page data science cheatsheet to assist with exam reviews, interview prep, and anything in-between.
 

therealsreehari This repositary is a combination of different resources lying scattered all over the internet. The reason for making such an repositary is to combine all the valuable resources in a sequential manner, so that it helps every beginners who are in a search of free and structured learning resource for Data Science. For Constant Updates Follow me in Twitter.
 

AutoViML Use advanced feature engineering strategies and select the best features from your data set fast with a single line of code.
 

Seagate CORTX Community Object Storage is 100% open source object storage uniquely optimized for mass capacity storage devices.
 

Androz2091 🌀 What's really in your Discord Data package?
 

DerwenAI Graph-Based Data Science: an abstraction layer in Python for building knowledge graphs, integrated with popular graph libraries – atop Pandas, RDFlib, pySHACL, RAPIDS, NetworkX, iGraph, PyVis, pslpython, pyarrow, etc.