- Feb 04, 2018
-
-
libing4752 authored
* modified schedule_dataflow_rewrite.cc to fix losing tensor problem * modified schedule_dataflow_rewrite.cc for lint scan * modified schedule_dataflow_rewrite.cc for lint scan * using tensor's value_index to index output of stage op * repare address offset for different kinds of dtype * bc * aaa * aaaaa * repare address for different dtypes * remove nonsense files * add whitespace of line 581 * use base alloc elem_type * enhance the testcast of basic buffer is 64bits,32bits,16bits,8bits * use extends[0]->type() as dtype of offset * clear program writes * enhance inject_copy_intin to support of pragma stmt with no loops * fix cpplint errors * fix cpplint error of ! * enhance detectLinearEquation to support with no loop vars * fix cpplint errors
-
- Feb 03, 2018
-
-
Tianqi Chen authored
* Fix Vulkan Build, add tanh to llvm instrinsic, fix halideIR * fix llvm tanh
-
PENGUINLIONG authored
-
- Feb 02, 2018
-
-
PENGUINLIONG authored
-
Tianqi Chen authored
* [BACKEND] Vulkan Runtime and SPIRV Codegen * fix doc
-
- Jan 31, 2018
-
-
Thierry Moreau authored
-
xqdan authored
* fix #802, create cache based on sugar tensor * [Pass] Improve storage rewrite * fix ci * fix comment * fix comment
-
- Jan 30, 2018
-
-
ZhiWei Zhang authored
* Update workspace_pool.cc * Update workspace_pool.cc
-
- Jan 29, 2018
-
-
Clouds authored
* fix opengl to OpenGL/gl3.h for APPLE * use glfw3 to include gl.h header
-
- Jan 28, 2018
-
-
alex-weaver authored
* Ported injective schedules to C++. Added some elementwise ops. * Fix lint errors * Added reduction ops and schedules * Fix lint errors * Fix lint errors * Fix lint errors * Added transform ops * Fix lint errors * Fix lint errors * Added softmax, log_softmax, leaky_relu and flatten ops. Fixed issue where TVM_DECLARE_INTRIN_UNARY used the PureExtern flag instead of PureIntrinsic. Added softmax CUDA schedule. * Fix lint * Fix lint * Added binary_dense, batch_norm_inference, dense, dilate, scale_shift_*, global_pool and pool ops. Extended pad to allow specifying pad_value. Fixed issue where pad would throw if padding was zero in all dimensions. * Fix lint * Fix lint * Added CUDA schedules for dense, pool and global_pool * Added extern schedules for generic and CUDA * Fix lint * Added x86 binary schedules * Fix lint * Added rocm dense schedule. Added rocBLAS and cuBLAS support to dense ops * Added pow ops. Added x86 default and injective schedules * Fix lint * Fix lint * Fix lint * Fix lint * Fix lint * Fix indent * Removed schedules directory * Changed left_shift, right_shift to operators. Changed pad_value in pad() to remove pointer usage * Fixed usage of pad in nn/pooling.h. Fixed declaration of operator>> * Fixed comments for shift operators * Added comments to utility functions * Added TOPI C++ library, exporting broadcast_add op * Fix lint * Share libinfo.py with TVM * Fix lint * Add other broadcast ops * Fix lint * Fix imports in topi * Fix lib names * Fixed build issue where windows builds don't apply correct definitions * Removed TVM_EXPORTS from topi library * Attempted CI build fix * Add topi lib to tvm_multilib * Fix Jenkinsfile * Added TOPI build target to Makefile * Fix nn op namespaces. * Fix lint * Renamed TOPI lib to libtvm_topi * Removed _ffi/base.py * Remove _ffi from topi, now shared with tvm. * Make libtvm_topi loading optional * Fix compiler warnings * Fix lint * Fix lint * Fix lint * Fix build error by making new libs argument to Target optional * Added C++ Target type interop. Added registration of remaining C++ ops and schedules. Added test of broadcast ops * Fix lint * Fix lint * Fix compile error * Fix compiler warnings * Fix compiler warnings * Fixed int vector interop. Fixed argmin incorrectly invoking argmax. Fixed corner case in default schedules of attempting to fuse 0 length axes. Added tests for reduce ops. * Refactored reduce builders * Fixed typos in topi.cc. Added basic test. * Fixed padding size error. Added dense, dilate, pooling tests * Fixed issue where clip would output a different dtype to the input. Added split_sections op to cover the other mode of the python split op. Added tests. * Changed extension type numbers to avoid clash with NNVM * Fix lint * Fix compiler warnings * Removed use of std::vector from the public TOPI API * Fix lint * Add TOPI C++ tests to CI * Fixed detail namespacing. Improved comments.
-
Zhixun Tan authored
-
Siva authored
-
- Jan 27, 2018
-
-
Tianqi Chen authored
-
kun-zh authored
* when there is no intrin func, using body for initialization. For issue 714. * Refine code per review comments, and add a test case. * Fix lint issues. * Re-organize the tensorize test cases, and add a new case for none-reset mode. * Fix a typo. * Delete the unit case because merged it into test_schedule_tensorize.py already. * always use new tensor in its stage when rewrite for cache read * revert previous changes to sync up with master * support using the ptr with an original offset * update test case and fix CI error
-
- Jan 25, 2018
-
-
Zhixun Tan authored
* OpenGL texture is always 1024 x nrows. * Address review comments.
-
- Jan 24, 2018
-
-
Tianqi Chen authored
-
libing4752 authored
* modified schedule_dataflow_rewrite.cc to fix losing tensor problem * modified schedule_dataflow_rewrite.cc for lint scan * modified schedule_dataflow_rewrite.cc for lint scan * using tensor's value_index to index output of stage op * repare address offset for different kinds of dtype * bc * aaa * aaaaa * repare address for different dtypes * remove nonsense files * add whitespace of line 581 * use base alloc elem_type * enhance the testcast of basic buffer is 64bits,32bits,16bits,8bits * use extends[0]->type() as dtype of offset * clear program writes
-
- Jan 23, 2018
-
-
xqdan authored
-
Tianqi Chen authored
-
Siju Samuel authored
The compilation warning is fixed. src/runtime/graph/graph_runtime.cc:392:24: warning: comparison between signed and unsigned integer expressions [-Wsign-compare] CHECK(data_byte_size == size) ~~~~~~~~~~~~~~~^~~~ /mnt/D_DRIVE/work/nnvm_22_Jan/nnvm_latest/tvm/dmlc-core/include/dmlc/logging.h:109:9: note: in definition of macro ‘CHECK’ if (!(x)) \ ^
-
- Jan 22, 2018
-
-
Siju Samuel authored
This compilation warning is fixed. src/pass/inject_virtual_thread.cc:43:19: warning: ‘rw_mask’ may be used uninitialized in this function [-Wmaybe-uninitialized] if (rw_mask & 2) { ~~~~~~~~^~~
-
- Jan 20, 2018
-
-
Zhixun Tan authored
Basic WebGL Backend
-
- Jan 19, 2018
-
-
Jammy Zhou authored
* Add Mali target support to tvm.target.create * Add Mali target support in codegen
-
- Jan 16, 2018
-
-
Lianmin Zheng authored
* support more argument type in depthwise_conv2d * mark all pointer as 'restrict' & fix vector conversion for opencl
-
- Jan 12, 2018
-
-
Tianqi Chen authored
* [LLVM] Enable same target option in JITModule * not set mcpu explicitly
-
- Jan 11, 2018
-
-
Yuwei Hu authored
-
- Jan 10, 2018
-
-
Tianqi Chen authored
-
- Jan 09, 2018
-
-
Tianqi Chen authored
* [PASS] Improve loop partition to remove un-necessary warning. * fix comment
-
- Jan 08, 2018
-
-
Tianqi Chen authored
* [PASS] StorageRewrite Fold Inplace op storage when possible * update comment to fix typos
-
- Jan 07, 2018
-
-
xqdan authored
* [SCHEDULE]enable partition const loop with build flag (#719) * enable partition loop with build flag * add a testcase, and modify LoopPartition related cases * * add document for split_const_loop * [IRbuild]Support automatically Name Loop Variable in IRBuilder (#719) * add idx_num in class * using typical index [i, j, k] first, then i_suffix * keep inputs names * fix lint * improve comment of name * fix lint * [SCHEDULE]Improve bound deduce for loop partition (#743) * add divided checking when deducing * related testcase * fix * * transform LE and GE first * remove is_equal * modify testcase for edge cases checking * * fix comment * * fix lint * * apply transformation form LT -> LE, GT -> GE * * fix lint * simplify code and testcase * add negative co-efficient case * More complicated cases * add testcase * simplify testcase * comment case for now * fix testcase
-
- Jan 04, 2018
-
-
Tianqi Chen authored
* [CODEGEN] use charp for voidp * fx
-
- Jan 03, 2018
-
-
masahi authored
* rocblas integration * fix include * fix lint
-
libing4752 authored
* modified schedule_dataflow_rewrite.cc to fix losing tensor problem * modified schedule_dataflow_rewrite.cc for lint scan * modified schedule_dataflow_rewrite.cc for lint scan * using tensor's value_index to index output of stage op
-
Lianmin Zheng authored
* [CODEGEN] update codegen for vector operation * update comment, fix for metal * fix some bugs in codegen * use 'restrict' in every argument * fix * fix
-
- Jan 02, 2018
-
-
masahi authored
* add cublas support * integrate cublas to topi dense * add cublas error check * minor fix * fix lint * remove topi import from contrib unittest
-
- Dec 29, 2017
- Dec 27, 2017
-
-
kun-zh authored
* when there is no intrin func, using body for initialization. For issue 714. * Refine code per review comments, and add a test case. * Fix lint issues.
-
masahi authored
* add target.libs to target str representation * integrate cudnn into topi cuda * append target.libs to target.options
-
- Dec 26, 2017
-
-
masahi authored
* add extern schedule for miopen * fix comment * optionally dispatch to miopen from topi * fix lint * check if current target is None * use generic dispatch for rocm conv2d * fix lint * fix workspace bug * remove blank line * remove blank line * remove blank line
-