A docker based low-latency deep learning inference server using pytorch C++ frontend & NVIDIA GPUs.

