INT8 conv operator implementation with NCHWc data layout for Intel machines (#1680)
* Int8 implementation for convolution operator on Intel Skylake * Int8 implementation for convolution operator on Intel Skylake * PR changes * PR changes * PR changes * Fixing an error * Fixing an error * Minor typos fix * Minor typos fix * Removing the broadcast16 CPP code. Using astype feature instead * Replacing constant by variable name num_elements_intel * Name fixes and tensorize update rule updated * Fixing the bug about checking skylake * Replacing bitcast with reinterpret * Isolating INT8 and FP32 schedules to ease out future AutoTVM PR merge * Putting check_skylake function in the x86 directory * Added documentation and organizing files to better locations * Tensor intrin renaming. Avoid code duplication for intrin by kernel reshaping
Showing
- topi/python/topi/nn/conv2d.py 27 additions, 1 deletiontopi/python/topi/nn/conv2d.py
- topi/python/topi/x86/check_targets.py 12 additions, 0 deletionstopi/python/topi/x86/check_targets.py
- topi/python/topi/x86/conv2d.py 131 additions, 14 deletionstopi/python/topi/x86/conv2d.py
- topi/python/topi/x86/conv2d_avx_1x1.py 117 additions, 0 deletionstopi/python/topi/x86/conv2d_avx_1x1.py
- topi/python/topi/x86/conv2d_avx_common.py 123 additions, 0 deletionstopi/python/topi/x86/conv2d_avx_common.py
- topi/python/topi/x86/tensor_intrin.py 84 additions, 0 deletionstopi/python/topi/x86/tensor_intrin.py
- topi/recipe/conv/test_conv_int8_intel.py 149 additions, 0 deletionstopi/recipe/conv/test_conv_int8_intel.py
topi/python/topi/x86/check_targets.py
0 → 100644
topi/python/topi/x86/tensor_intrin.py
0 → 100644
topi/recipe/conv/test_conv_int8_intel.py
0 → 100644
Please register or sign in to comment