- Jan 15, 2018
-
-
Aman authored
-
- Jan 12, 2018
-
-
Tianqi Chen authored
* [LLVM] Enable same target option in JITModule * not set mcpu explicitly
-
- Jan 11, 2018
-
-
masahi authored
* add upsampling cpu op * add upsampling gpu schedule * add doc for upsampling op add more doc * cleanup upsampling test * add doc * fix lint * fix lint * fix lint * remove unused import * remove skimage dependency * remove skimage import * remove schedule_upsampling
-
Yuwei Hu authored
-
- Jan 10, 2018
-
-
Tianqi Chen authored
-
- Jan 09, 2018
-
-
Yida Wang authored
* small fixs on docs * add IR output after parallelization
-
Tianqi Chen authored
* [PASS] Improve loop partition to remove un-necessary warning. * fix comment
-
- Jan 08, 2018
-
-
yuruofeifei authored
* Improve opt_gemm tutorial * Addressed comments
-
Tianqi Chen authored
* [PASS] StorageRewrite Fold Inplace op storage when possible * update comment to fix typos
-
- Jan 07, 2018
-
-
xqdan authored
* [SCHEDULE]enable partition const loop with build flag (#719) * enable partition loop with build flag * add a testcase, and modify LoopPartition related cases * * add document for split_const_loop * [IRbuild]Support automatically Name Loop Variable in IRBuilder (#719) * add idx_num in class * using typical index [i, j, k] first, then i_suffix * keep inputs names * fix lint * improve comment of name * fix lint * [SCHEDULE]Improve bound deduce for loop partition (#743) * add divided checking when deducing * related testcase * fix * * transform LE and GE first * remove is_equal * modify testcase for edge cases checking * * fix comment * * fix lint * * apply transformation form LT -> LE, GT -> GE * * fix lint * simplify code and testcase * add negative co-efficient case * More complicated cases * add testcase * simplify testcase * comment case for now * fix testcase
-
- Jan 04, 2018
-
-
Tianqi Chen authored
* [CODEGEN] use charp for voidp * fx
-
Yizhi Liu authored
-
- Jan 03, 2018
-
-
masahi authored
* rocblas integration * fix include * fix lint
-
libing4752 authored
* modified schedule_dataflow_rewrite.cc to fix losing tensor problem * modified schedule_dataflow_rewrite.cc for lint scan * modified schedule_dataflow_rewrite.cc for lint scan * using tensor's value_index to index output of stage op
-
Lianmin Zheng authored
* [CODEGEN] update codegen for vector operation * update comment, fix for metal * fix some bugs in codegen * use 'restrict' in every argument * fix * fix
-
- Jan 02, 2018
-
-
masahi authored
* add cublas support * integrate cublas to topi dense * add cublas error check * minor fix * fix lint * remove topi import from contrib unittest
-
- Dec 31, 2017
-
-
xqdan authored
* [SCHEDULE]enable partition const loop with build flag (#719) * enable partition loop with build flag * add a testcase, and modify LoopPartition related cases * * add document for split_const_loop * [IRbuild]Support automatically Name Loop Variable in IRBuilder (#719) * add idx_num in class * using typical index [i, j, k] first, then i_suffix * keep inputs names * fix lint * improve comment of name * fix lint
-
Tianqi Chen authored
-
- Dec 29, 2017
-
-
xqdan authored
* [SCHEDULE]enable partition const loop with build flag (#719) * enable partition loop with build flag * add a testcase, and modify LoopPartition related cases * * add document for split_const_loop
-
masahi authored
* use cudnn findalgo to choose the best algo * fix lint
-
kun-zh authored
* when there is no intrin func, using body for initialization. For issue 714. * Refine code per review comments, and add a test case. * Fix lint issues. * Re-organize the tensorize test cases, and add a new case for none-reset mode. * Fix a typo. * Delete the unit case because merged it into test_schedule_tensorize.py already.
-
- Dec 27, 2017
-
-
kun-zh authored
* when there is no intrin func, using body for initialization. For issue 714. * Refine code per review comments, and add a test case. * Fix lint issues.
-
Xingjian Shi authored
* support dim-0 tensor in topi ops revert transform * revert
-
masahi authored
* add target.libs to target str representation * integrate cudnn into topi cuda * append target.libs to target.options
-
- Dec 26, 2017
-
-
masahi authored
* add extern schedule for miopen * fix comment * optionally dispatch to miopen from topi * fix lint * check if current target is None * use generic dispatch for rocm conv2d * fix lint * fix workspace bug * remove blank line * remove blank line * remove blank line
-
Tianqi Chen authored
-
- Dec 25, 2017
-
-
Yuwei Hu authored
* add x86_64 target * add binary dense operator * rebase * improve schedule * remove x86 target * improve schedule
-
- Dec 24, 2017
-
-
masahi authored
* fist working miopen support * do FindFwdAlgo during build time * fix lint * update doc string * import topi after checking if rocm is enabled * add miopen namespace * fixed descriptor overwrite bug * add use_miopen option * fix lint * better miopen option handling * fix typo * fix options handling
-
Lianmin Zheng authored
* [CODEGEN] update codegen for vector operation * update comment, fix for metal
-
Tianqi Chen authored
-
- Dec 23, 2017
-
-
Cody Hao Yu authored
* Make duplicated function name checker working * Fix dependency checking problem for reducer condition (#712); add test * Fix dependency checking problem for reducer condition (#712); add test * Specify R to be computed inlined
-
Tianqi Chen authored
-
Salem Derisavi authored
-
- Dec 22, 2017
-
-
Salem Derisavi authored
During tensorize, call Simplify on algorithm and intrinsic definitions before CanonicalSimplify. This will prevent a number of false tensorize mismatches. (#718) thanks, this we can use this solution for now
-
- Dec 19, 2017
-
-
Salem Derisavi authored
* 1) removed non-determinism from CanonicalSimplify 2) added couple of testcases for CanonicalSimplify * Use IRDeepCompare instead of comparison of string representation * Give a warning (instead of fatal error) when two "ComExprEntry"s are equal
-
- Dec 17, 2017
-
-
Andrew Adams authored
-
- Dec 16, 2017
-
-
masahi authored
-
- Dec 15, 2017
-
-
Cody Hao Yu authored
-
- Dec 13, 2017
-
-
Salem Derisavi authored
* Simplify expressions early on * fixed lint errors
-
Salem Derisavi authored
* 1) Refactored some parts of the unrolling code into their own methods so we can reuse unrolling functionality in other parts of the code. E.g., to explicitly unroll loops with count of 1 when they are programmatically created. 2) Reorder based on top operator before resorting to pointers, which causes non-determinism. * Fixed lint errors
-