ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)