Hi, I provide deployment of machine learning models with highest performance, from NVIDIA Jetson kits, or Intel CPUs, to cloud services. Maximum throughput and minimum latency are the targets achieved. I work with your data scientists to make sure that the intent as well as objective is retained during the conversion and deployment. Usually using C++, and sometimes in Python.