A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator