Triton Inference Server provides a cloud and edge inferencing solution optimized for both CPUs and GPUs. Triton supports an HTTP/REST and GRPC protocol that allows remote clients to request inferencing for any model being managed by the server. For edge deployments, Triton is available as a shared library with a C API that allows the full functionality of Triton to be included directly in an application.
Category: C/C++ / Artificial Intelligence
Watchers: 116
Star: 3.8k
Fork: 905
Last update: Jun 30, 2022

