2.1.0 Update (12th Jun, 2024)
Enhanced Latency Bottleneck Visualization - Node Breakdown

The Node Breakdown feature has been significantly improved to help you easily identify latency bottlenecks. Enhancing the visualization of the matching TensorRT benchmark results on ONNX graphs makes pinpointing high-latency areas based on operator colors simpler. The stronger the color, the higher the latency.
Addition of New Data Type for Optimization: FP8

A new data type, FP8, has been added for . This aims to achieve possible accuracy gains by reducing quantization error. You can now simulate quantization using the fp8_e4m3 data type. The FP8 quantization supports STE as QAT backward and Minmax and percentile as PTQ calibration.
- The precision parameter has been replaced with the data type parameter.
- The former precision : 8 is now updated to data type : int8.
- The unsigned toggle has been removed.
- A new data type uint8 (unsigned int8) has been added.
Note: This feature is currently highly unstable and may yield unexpected results. Also, please note that FP8 benchmarks are not supported on the free plan.
Updated OwLite Support Channels
The OwLite support system has been updated for a smoother experience. You can now search through the OwLite documentation or help center if you have any inquiries. The system has been improved to provide a more seamless inquiry and response experience.