Olive is a tool that optimizes models to ONNX format, making them suitable for deployment in Foundry Local. It uses techniques like quantization and graph optimization to improve performance.
https://github.com/microsoft/Olive
Olive is a tool that optimizes models to ONNX format, making them suitable for deployment in Foundry Local. It uses techniques like quantization and graph optimization to improve performance.
https://github.com/microsoft/Olive