Skip to content
Snippets Groups Projects
  1. Mar 08, 2018
  2. Mar 02, 2018
  3. Mar 01, 2018
  4. Feb 28, 2018
  5. Feb 23, 2018
  6. Feb 19, 2018
  7. Feb 18, 2018
  8. Feb 15, 2018
  9. Feb 13, 2018
  10. Feb 12, 2018
  11. Feb 10, 2018
    • masahi's avatar
      [TOPI] Initial NHWC layout support (#882) · 54d4fe4b
      masahi authored
      * add 4 dim softmax
      
      * update for NHWC layout
      
      * remove layout param from softmax
      
      * fix typo
      
      * minor fix to pool
      
      support axis=1 ndims=5 softmax.
      
      add softmax axis
      
      * few fix for softmax
      
      * fix typo
      
      * add more doc
      
      * minor doc fix
      
      * fix upsampling output shape
      
      * fix lint
      
      * cleanup softmax
      
      * minor fix
      
      * raise exception instead of assert, handles negative axis
      
      * check axis after axis transformation
      54d4fe4b
  12. Feb 09, 2018
  13. Feb 08, 2018
  14. Feb 07, 2018
  15. Feb 05, 2018
  16. Feb 04, 2018
  17. Feb 03, 2018
  18. Feb 02, 2018
  19. Feb 01, 2018
    • alex-weaver's avatar
      Update HalideIR submodule to include TVM_STATIC_IR_FUNCTOR_REGISTER (#857) · 108e9f3f
      alex-weaver authored
      * Update HalideIR commit to include TVM_STATIC_IR_FUNCTOR_REGISTER
      
      * Fix HalideIR to point to the right commit
      
      * Add missing using to C++ TOPI nn.h
      
      * Update HalideIR to include compiler error fix
      
      * Fixed error where broadcast_to fails if shape is tuple of IntImm
      
      * Change get_const_int to support int as input
      108e9f3f
  20. Jan 31, 2018
  21. Jan 28, 2018
    • Tianqi Chen's avatar
      9a1c7fcc
    • alex-weaver's avatar
      Porting schedules (except convolutions) to C++ (#763) · f280f23a
      alex-weaver authored
      * Ported injective schedules to C++. Added some elementwise ops.
      
      * Fix lint errors
      
      * Added reduction ops and schedules
      
      * Fix lint errors
      
      * Fix lint errors
      
      * Fix lint errors
      
      * Added transform ops
      
      * Fix lint errors
      
      * Fix lint errors
      
      * Added softmax, log_softmax, leaky_relu and flatten ops.
      Fixed issue where TVM_DECLARE_INTRIN_UNARY used the PureExtern flag
      instead of PureIntrinsic.
      Added softmax CUDA schedule.
      
      * Fix lint
      
      * Fix lint
      
      * Added binary_dense, batch_norm_inference, dense, dilate, scale_shift_*,
      global_pool and pool ops.
      Extended pad to allow specifying pad_value.
      Fixed issue where pad would throw if padding was zero in all dimensions.
      
      * Fix lint
      
      * Fix lint
      
      * Added CUDA schedules for dense, pool and global_pool
      
      * Added extern schedules for generic and CUDA
      
      * Fix lint
      
      * Added x86 binary schedules
      
      * Fix lint
      
      * Added rocm dense schedule. Added rocBLAS and cuBLAS support to dense ops
      
      * Added pow ops. Added x86 default and injective schedules
      
      * Fix lint
      
      * Fix lint
      
      * Fix lint
      
      * Fix lint
      
      * Fix lint
      
      * Fix indent
      
      * Removed schedules directory
      
      * Changed left_shift, right_shift to operators. Changed pad_value in pad() to remove pointer usage
      
      * Fixed usage of pad in nn/pooling.h. Fixed declaration of operator>>
      
      * Fixed comments for shift operators
      
      * Added comments to utility functions
      
      * Added TOPI C++ library, exporting broadcast_add op
      
      * Fix lint
      
      * Share libinfo.py with TVM
      
      * Fix lint
      
      * Add other broadcast ops
      
      * Fix lint
      
      * Fix imports in topi
      
      * Fix lib names
      
      * Fixed build issue where windows builds don't apply correct definitions
      
      * Removed TVM_EXPORTS from topi library
      
      * Attempted CI build fix
      
      * Add topi lib to tvm_multilib
      
      * Fix Jenkinsfile
      
      * Added TOPI build target to Makefile
      
      * Fix nn op namespaces.
      
      * Fix lint
      
      * Renamed TOPI lib to libtvm_topi
      
      * Removed _ffi/base.py
      
      * Remove _ffi from topi, now shared with tvm.
      
      * Make libtvm_topi loading optional
      
      * Fix compiler warnings
      
      * Fix lint
      
      * Fix lint
      
      * Fix lint
      
      * Fix build error by making new libs argument to Target optional
      
      * Added C++ Target type interop. Added registration of remaining C++ ops and schedules. Added test of broadcast ops
      
      * Fix lint
      
      * Fix lint
      
      * Fix compile error
      
      * Fix compiler warnings
      
      * Fix compiler warnings
      
      * Fixed int vector interop. Fixed argmin incorrectly invoking argmax. Fixed corner case in default schedules of attempting to fuse 0 length axes. Added tests for reduce ops.
      
      * Refactored reduce builders
      
      * Fixed typos in topi.cc. Added basic test.
      
      * Fixed padding size error. Added dense, dilate, pooling tests
      
      * Fixed issue where clip would output a different dtype to the input. Added split_sections op to cover the other mode of the python split op. Added tests.
      
      * Changed extension type numbers to avoid clash with NNVM
      
      * Fix lint
      
      * Fix compiler warnings
      
      * Removed use of std::vector from the public TOPI API
      
      * Fix lint
      
      * Add TOPI C++ tests to CI
      
      * Fixed detail namespacing. Improved comments.
      f280f23a
  22. Jan 19, 2018
  23. Jan 16, 2018
    • masahi's avatar
      [TOPI] Basic x86 schedules (#775) · 3df42cd7
      masahi authored
      * add basic x86 schedules
      
      * parallelize & vectorize batchnorm + relu
      
      * fuse conv into bn + relu
      
      * move rc loop to outer
      
      * add nhwc conv
      
      * change weight layout to hwcf
      
      * conv + bn + relu fusion for nhwc conv
      
      * fix conv_nhwc schedule when no fusion
      
      * clean up default parallel schedules
      
      * simplify elemwise parallel
      
      * fix elemwise parallel for batch == 1
      
      * update nhwc conv test
      
      * fix and add comment
      
      * fix lint
      
      * remove redundant import
      
      * remove default multithreading for some ops
      
      * remove default multithreading for global pool
      3df42cd7
    • Lianmin Zheng's avatar
      fix mali topi for python3 (#789) · 7ca44d7a
      Lianmin Zheng authored
      7ca44d7a
    • Xingjian Shi's avatar
      fix (#788) · b9a6c091
      Xingjian Shi authored
      b9a6c091
    • Lianmin Zheng's avatar
      [TOPI] add schedule for ARM Mali GPU (#786) · 16694815
      Lianmin Zheng authored
      * add schedule for ARM Mali GPU
      
      * fix lint
      
      * fix lint
      16694815
    • Lianmin Zheng's avatar
      [CODEGEN] fix vector conversion for opencl (#783) · 8d263e37
      Lianmin Zheng authored
      * support more argument type in depthwise_conv2d
      
      * mark all pointer as 'restrict' & fix vector conversion for opencl
      8d263e37
  24. Jan 15, 2018
  25. Jan 12, 2018
  26. Jan 11, 2018
    • masahi's avatar
      [TOPI] Upsampling op support (#772) · be457348
      masahi authored
      * add upsampling cpu op
      
      * add upsampling gpu schedule
      
      * add doc for upsampling op
      
      add more doc
      
      * cleanup upsampling test
      
      * add doc
      
      * fix lint
      
      * fix lint
      
      * fix lint
      
      * remove unused import
      
      * remove skimage dependency
      
      * remove skimage import
      
      * remove schedule_upsampling
      be457348
  27. Jan 04, 2018
  28. Jan 03, 2018
  29. Jan 02, 2018
    • masahi's avatar
      [CONTRIB] cuBLAS integration (#744) · 3d5032ae
      masahi authored
      * add cublas support
      
      * integrate cublas to topi dense
      
      * add cublas error check
      
      * minor fix
      
      * fix lint
      
      * remove topi import from contrib unittest
      3d5032ae
Loading