Skip to content
Snippets Groups Projects
  1. Jan 20, 2018
  2. Jan 19, 2018
  3. Jan 16, 2018
    • masahi's avatar
      [TOPI] Basic x86 schedules (#775) · 3df42cd7
      masahi authored
      * add basic x86 schedules
      
      * parallelize & vectorize batchnorm + relu
      
      * fuse conv into bn + relu
      
      * move rc loop to outer
      
      * add nhwc conv
      
      * change weight layout to hwcf
      
      * conv + bn + relu fusion for nhwc conv
      
      * fix conv_nhwc schedule when no fusion
      
      * clean up default parallel schedules
      
      * simplify elemwise parallel
      
      * fix elemwise parallel for batch == 1
      
      * update nhwc conv test
      
      * fix and add comment
      
      * fix lint
      
      * remove redundant import
      
      * remove default multithreading for some ops
      
      * remove default multithreading for global pool
      3df42cd7
    • Lianmin Zheng's avatar
      fix mali topi for python3 (#789) · 7ca44d7a
      Lianmin Zheng authored
      7ca44d7a
    • Xingjian Shi's avatar
      fix (#788) · b9a6c091
      Xingjian Shi authored
      b9a6c091
    • Lianmin Zheng's avatar
      [TOPI] add schedule for ARM Mali GPU (#786) · 16694815
      Lianmin Zheng authored
      * add schedule for ARM Mali GPU
      
      * fix lint
      
      * fix lint
      16694815
    • Lianmin Zheng's avatar
      [CODEGEN] fix vector conversion for opencl (#783) · 8d263e37
      Lianmin Zheng authored
      * support more argument type in depthwise_conv2d
      
      * mark all pointer as 'restrict' & fix vector conversion for opencl
      8d263e37
  4. Jan 15, 2018
  5. Jan 12, 2018
  6. Jan 11, 2018
  7. Jan 10, 2018
  8. Jan 09, 2018
  9. Jan 08, 2018
  10. Jan 07, 2018
    • xqdan's avatar
      [SCHEDULE]Improve bound deduce for loop partition (#743) (#755) · 9d6dbe34
      xqdan authored
      * [SCHEDULE]enable partition const loop with build flag (#719)
      
          * enable partition loop with build flag
      
          * add a testcase, and modify LoopPartition related cases
      
      *     * add document for split_const_loop
      
      * [IRbuild]Support automatically Name Loop Variable in IRBuilder (#719)
      
          * add idx_num in class
      
      * using typical index [i, j, k] first, then i_suffix
      
      * keep inputs names
      
      * fix lint
      
      * improve comment of name
      
      * fix lint
      
      * [SCHEDULE]Improve bound deduce for loop partition (#743)
      
          * add divided checking when deducing
      
          * related testcase
      
      * fix
      
      * * transform LE and GE first
      * remove is_equal
      * modify testcase for edge cases checking
      
      * * fix comment
      
      * * fix lint
      
      * * apply transformation form LT -> LE, GT -> GE
      
      * * fix lint
      
      * simplify code and testcase
      
      * add negative co-efficient case
      
      * More complicated cases
      
      * add testcase
      
      * simplify testcase
      
      * comment case for now
      
      * fix testcase
      9d6dbe34
  11. Jan 04, 2018
  12. Jan 03, 2018
  13. Jan 02, 2018
    • masahi's avatar
      [CONTRIB] cuBLAS integration (#744) · 3d5032ae
      masahi authored
      * add cublas support
      
      * integrate cublas to topi dense
      
      * add cublas error check
      
      * minor fix
      
      * fix lint
      
      * remove topi import from contrib unittest
      3d5032ae
  14. Dec 31, 2017
  15. Dec 29, 2017
    • xqdan's avatar
      enable partition const loop with build flag (#732) · 5f1816db
      xqdan authored
      * [SCHEDULE]enable partition const loop with build flag (#719)
      
          * enable partition loop with build flag
      
          * add a testcase, and modify LoopPartition related cases
      
      *     * add document for split_const_loop
      5f1816db
    • masahi's avatar
      Let CUDNN choose the best algo (#734) · 66fa0c3d
      masahi authored
      * use cudnn findalgo to choose the best algo
      
      * fix lint
      66fa0c3d
    • kun-zh's avatar
      Re-organize the test cases for tensorize. (#736) · f0cdb50e
      kun-zh authored
      * when there is no intrin func, using body for initialization. For issue 714.
      
      * Refine code per review comments, and add a test case.
      
      * Fix lint issues.
      
      * Re-organize the tensorize test cases, and add a new case for none-reset
      mode.
      
      * Fix a typo.
      
      * Delete the unit case because merged it into test_schedule_tensorize.py already.
      f0cdb50e
  16. Dec 27, 2017
  17. Dec 26, 2017
  18. Dec 25, 2017
  19. Dec 24, 2017
Loading