At Myrtle.ai we specialize in delivering the highest throughput for DNN inference within strict latency bounds. We are focused on two main areas, Recommendation Systems and Recurrent Neural Networks. Recommendation Systems. High throughput, low latency DNNs are an imperative for recommendation engines. We have a scalable solution which doubles the compute density of existing infrastructure. This halves the capital cost and halves the energy required to deliver the same capability, enabling major corporations to meet their carbon footprint goals. Recurrent Neural Networks. We deliver industry-leading performance for applications that contain RNN layers such as speech transcription, machine translation, speech synthesis and natural language processing. If you run inference within strict latency bounds then we can help you achieve more in the time available. The extremely low latency is critical in some applications such as time series analysis and high frequency trading; it also greatly simplifies the scheduling of multiple asynchronous workloads with differing Service Level Agreements.