[TOPI] Basic x86 schedules (#775)
* add basic x86 schedules * parallelize & vectorize batchnorm + relu * fuse conv into bn + relu * move rc loop to outer * add nhwc conv * change weight layout to hwcf * conv + bn + relu fusion for nhwc conv * fix conv_nhwc schedule when no fusion * clean up default parallel schedules * simplify elemwise parallel * fix elemwise parallel for batch == 1 * update nhwc conv test * fix and add comment * fix lint * remove redundant import * remove default multithreading for some ops * remove default multithreading for global pool
Showing
- topi/python/topi/generic/nn.py 18 additions, 0 deletionstopi/python/topi/generic/nn.py
- topi/python/topi/nn/conv2d.py 51 additions, 0 deletionstopi/python/topi/nn/conv2d.py
- topi/python/topi/testing/__init__.py 1 addition, 0 deletionstopi/python/topi/testing/__init__.py
- topi/python/topi/testing/conv2d_nhwc_python.py 67 additions, 0 deletionstopi/python/topi/testing/conv2d_nhwc_python.py
- topi/python/topi/x86/__init__.py 3 additions, 1 deletiontopi/python/topi/x86/__init__.py
- topi/python/topi/x86/conv2d.py 66 additions, 2 deletionstopi/python/topi/x86/conv2d.py
- topi/python/topi/x86/injective.py 35 additions, 0 deletionstopi/python/topi/x86/injective.py
- topi/python/topi/x86/nn.py 56 additions, 0 deletionstopi/python/topi/x86/nn.py
- topi/tests/python/test_topi_conv2d_nhwc.py 59 additions, 0 deletionstopi/tests/python/test_topi_conv2d_nhwc.py
topi/python/topi/x86/injective.py
0 → 100644
topi/python/topi/x86/nn.py
0 → 100644
topi/tests/python/test_topi_conv2d_nhwc.py
0 → 100644
Please register or sign in to comment