eIQ ONNX2TFLite¶
eIQ ONNX2TFLite is a tool used to convert neural networks to the TF Lite format, and optionally quantize them in the ONNX format using the module ONNX2Quant.
Note
Quantization using ONNX2Quant is to be followed with conversion to TF Lite and deployment to eIQ Neutron NPU. It is not a general-purpose quantization tool and it should not be used with other NPU’s or different runtimes such as ONNX Runtime.
This guide will show you how to: