Skip to content
Snippets Groups Projects
  1. Jan 27, 2018
  2. Jan 26, 2018
  3. Jan 25, 2018
  4. Jan 24, 2018
  5. Jan 23, 2018
  6. Jan 22, 2018
  7. Jan 21, 2018
  8. Jan 20, 2018
  9. Jan 19, 2018
  10. Jan 16, 2018
    • masahi's avatar
      [TOPI] Basic x86 schedules (#775) · 3df42cd7
      masahi authored
      * add basic x86 schedules
      
      * parallelize & vectorize batchnorm + relu
      
      * fuse conv into bn + relu
      
      * move rc loop to outer
      
      * add nhwc conv
      
      * change weight layout to hwcf
      
      * conv + bn + relu fusion for nhwc conv
      
      * fix conv_nhwc schedule when no fusion
      
      * clean up default parallel schedules
      
      * simplify elemwise parallel
      
      * fix elemwise parallel for batch == 1
      
      * update nhwc conv test
      
      * fix and add comment
      
      * fix lint
      
      * remove redundant import
      
      * remove default multithreading for some ops
      
      * remove default multithreading for global pool
      3df42cd7
    • Lianmin Zheng's avatar
      fix mali topi for python3 (#789) · 7ca44d7a
      Lianmin Zheng authored
      7ca44d7a
    • Xingjian Shi's avatar
      fix (#788) · b9a6c091
      Xingjian Shi authored
      b9a6c091
    • Lianmin Zheng's avatar
      [TOPI] add schedule for ARM Mali GPU (#786) · 16694815
      Lianmin Zheng authored
      * add schedule for ARM Mali GPU
      
      * fix lint
      
      * fix lint
      16694815
    • Lianmin Zheng's avatar
      [CODEGEN] fix vector conversion for opencl (#783) · 8d263e37
      Lianmin Zheng authored
      * support more argument type in depthwise_conv2d
      
      * mark all pointer as 'restrict' & fix vector conversion for opencl
      8d263e37
  11. Jan 15, 2018
  12. Jan 12, 2018
  13. Jan 11, 2018
  14. Jan 10, 2018
  15. Jan 09, 2018
  16. Jan 08, 2018
  17. Jan 07, 2018
    • xqdan's avatar
      [SCHEDULE]Improve bound deduce for loop partition (#743) (#755) · 9d6dbe34
      xqdan authored
      * [SCHEDULE]enable partition const loop with build flag (#719)
      
          * enable partition loop with build flag
      
          * add a testcase, and modify LoopPartition related cases
      
      *     * add document for split_const_loop
      
      * [IRbuild]Support automatically Name Loop Variable in IRBuilder (#719)
      
          * add idx_num in class
      
      * using typical index [i, j, k] first, then i_suffix
      
      * keep inputs names
      
      * fix lint
      
      * improve comment of name
      
      * fix lint
      
      * [SCHEDULE]Improve bound deduce for loop partition (#743)
      
          * add divided checking when deducing
      
          * related testcase
      
      * fix
      
      * * transform LE and GE first
      * remove is_equal
      * modify testcase for edge cases checking
      
      * * fix comment
      
      * * fix lint
      
      * * apply transformation form LT -> LE, GT -> GE
      
      * * fix lint
      
      * simplify code and testcase
      
      * add negative co-efficient case
      
      * More complicated cases
      
      * add testcase
      
      * simplify testcase
      
      * comment case for now
      
      * fix testcase
      9d6dbe34
Loading