Skip to content
Snippets Groups Projects
  1. Jan 24, 2018
    • Tianqi Chen's avatar
      37734045
    • libing4752's avatar
      [PASS] enhance storage_rewrite to support different dtypes for unified buffer (#805) · 5fc4bc57
      libing4752 authored
      * modified schedule_dataflow_rewrite.cc to fix losing tensor problem
      
      * modified schedule_dataflow_rewrite.cc for lint scan
      
      * modified schedule_dataflow_rewrite.cc for lint scan
      
      * using tensor's value_index to index output of stage op
      
      * repare address offset for different kinds of dtype
      
      * bc
      
      * aaa
      
      * aaaaa
      
      * repare address for different dtypes
      
      * remove nonsense files
      
      * add whitespace of line 581
      
      * use base alloc elem_type
      
      * enhance the testcast of basic buffer is 64bits,32bits,16bits,8bits
      
      * use extends[0]->type() as dtype of offset
      
      * clear program writes
      5fc4bc57
  2. Jan 23, 2018
  3. Jan 22, 2018
  4. Jan 21, 2018
  5. Jan 20, 2018
  6. Jan 19, 2018
  7. Jan 16, 2018
    • masahi's avatar
      [TOPI] Basic x86 schedules (#775) · 3df42cd7
      masahi authored
      * add basic x86 schedules
      
      * parallelize & vectorize batchnorm + relu
      
      * fuse conv into bn + relu
      
      * move rc loop to outer
      
      * add nhwc conv
      
      * change weight layout to hwcf
      
      * conv + bn + relu fusion for nhwc conv
      
      * fix conv_nhwc schedule when no fusion
      
      * clean up default parallel schedules
      
      * simplify elemwise parallel
      
      * fix elemwise parallel for batch == 1
      
      * update nhwc conv test
      
      * fix and add comment
      
      * fix lint
      
      * remove redundant import
      
      * remove default multithreading for some ops
      
      * remove default multithreading for global pool
      3df42cd7
    • Lianmin Zheng's avatar
      fix mali topi for python3 (#789) · 7ca44d7a
      Lianmin Zheng authored
      7ca44d7a
    • Xingjian Shi's avatar
      fix (#788) · b9a6c091
      Xingjian Shi authored
      b9a6c091
    • Lianmin Zheng's avatar
      [TOPI] add schedule for ARM Mali GPU (#786) · 16694815
      Lianmin Zheng authored
      * add schedule for ARM Mali GPU
      
      * fix lint
      
      * fix lint
      16694815
    • Lianmin Zheng's avatar
      [CODEGEN] fix vector conversion for opencl (#783) · 8d263e37
      Lianmin Zheng authored
      * support more argument type in depthwise_conv2d
      
      * mark all pointer as 'restrict' & fix vector conversion for opencl
      8d263e37
  8. Jan 15, 2018
  9. Jan 12, 2018
  10. Jan 11, 2018
  11. Jan 10, 2018
  12. Jan 09, 2018
  13. Jan 08, 2018
  14. Jan 07, 2018
    • xqdan's avatar
      [SCHEDULE]Improve bound deduce for loop partition (#743) (#755) · 9d6dbe34
      xqdan authored
      * [SCHEDULE]enable partition const loop with build flag (#719)
      
          * enable partition loop with build flag
      
          * add a testcase, and modify LoopPartition related cases
      
      *     * add document for split_const_loop
      
      * [IRbuild]Support automatically Name Loop Variable in IRBuilder (#719)
      
          * add idx_num in class
      
      * using typical index [i, j, k] first, then i_suffix
      
      * keep inputs names
      
      * fix lint
      
      * improve comment of name
      
      * fix lint
      
      * [SCHEDULE]Improve bound deduce for loop partition (#743)
      
          * add divided checking when deducing
      
          * related testcase
      
      * fix
      
      * * transform LE and GE first
      * remove is_equal
      * modify testcase for edge cases checking
      
      * * fix comment
      
      * * fix lint
      
      * * apply transformation form LT -> LE, GT -> GE
      
      * * fix lint
      
      * simplify code and testcase
      
      * add negative co-efficient case
      
      * More complicated cases
      
      * add testcase
      
      * simplify testcase
      
      * comment case for now
      
      * fix testcase
      9d6dbe34
  15. Jan 04, 2018
  16. Jan 03, 2018
  17. Jan 02, 2018
    • masahi's avatar
      [CONTRIB] cuBLAS integration (#744) · 3d5032ae
      masahi authored
      * add cublas support
      
      * integrate cublas to topi dense
      
      * add cublas error check
      
      * minor fix
      
      * fix lint
      
      * remove topi import from contrib unittest
      3d5032ae
  18. Dec 31, 2017
    • xqdan's avatar
      Support automatically Name Loop Variable in IRBuilder (#716) (#741) · 77299df3
      xqdan authored
      * [SCHEDULE]enable partition const loop with build flag (#719)
      
          * enable partition loop with build flag
      
          * add a testcase, and modify LoopPartition related cases
      
      *     * add document for split_const_loop
      
      * [IRbuild]Support automatically Name Loop Variable in IRBuilder (#719)
      
          * add idx_num in class
      
      * using typical index [i, j, k] first, then i_suffix
      
      * keep inputs names
      
      * fix lint
      
      * improve comment of name
      
      * fix lint
      77299df3
Loading