- Nov 18, 2017
-
-
Lianmin Zheng authored
-
- Nov 16, 2017
-
-
haolongzhangm authored
some host opencl runtime may at cpu mode, but remote client opencl runtime at gpu mode, compat it
-
Leyuan Wang authored
* scheduler tweaked for super resolution perf * lint error fixed * lint error fixed * conv2d_transpose schedule error fixed
-
- Nov 14, 2017
-
-
Tianqi Chen authored
-
ziheng authored
* [TOPI] Add out_dtype argument for conv2d; Add x86 schedules * Fix * Fix lint * Fix
-
Tianqi Chen authored
-
Leyuan Wang authored
* conv2d perf improved for conv2d_56_64_128, test name added to differentiate workloads * fix lint error
-
- Nov 13, 2017
-
-
abergeron authored
* Make the tvm conda package build with in-place source and use cmake from conda. * Add a package for topi.
-
Tianqi Chen authored
-
- Nov 12, 2017
-
-
Tianqi Chen authored
-
Tianqi Chen authored
-
Lianmin Zheng authored
-
- Nov 11, 2017
-
-
Tianqi Chen authored
* [PASS] Enhance LiftAttrScope * update vt
-
ziheng authored
-
- Nov 09, 2017
- Nov 08, 2017
-
-
Erwan BERNARD authored
* feat(docs) add how_to for tvm install with nnpack support * feat(docs) change python package paragraph * feat(doc) remove unsure sentence * add comments on nnpack usage vs TVM * remove mxnet nnpack tips for nthread change
-
eqy authored
* Support vector operations for AMD (llvm IR) * fix whitespace * update comments, docstring
-
Leyuan Wang authored
-
- Nov 07, 2017
-
-
eqy authored
Change minimum 32-bit restriction for floating point types to 8-bit. This change is to enable reduced precision types that may use vector operations underneath the hood (cases #lanes > 1 such as half4).
-
- Nov 06, 2017
- Nov 03, 2017
-
-
Tianqi Chen authored
-
Yuwei Hu authored
-
- Nov 02, 2017
-
-
Yuwei Hu authored
* enable popcount intrin * fix lint * add test * fix python3
-
- Nov 01, 2017
-
-
Cyril Lashkevich authored
-
- Oct 30, 2017
-
-
Leyuan Wang authored
-
- Oct 27, 2017
-
-
Tianqi Chen authored
-
- Oct 26, 2017
-
-
masahi authored
-
masahi authored
* removed fma dispatch * added comments to explain why remove fma * fix lint * use fmuladd intrin for fma dispatch
-
masahi authored
* view llvm ir and gcn asm with module.get_source(...) * fix lint
-
Tianqi Chen authored
* [BUFFER] Smarter slice to detect compactness * move simplify of begins early
-
- Oct 25, 2017
-
-
Yuwei Hu authored
-
- Oct 24, 2017
-
-
Tianqi Chen authored
-
Tianqi Chen authored
-
Wei Chen authored
-
- Oct 23, 2017
-
-
masahi authored
* update topi/cuda schedules to use target.max_num_threads * allow num_thread to be larger than cuda.max_num_threads * remove get_max_num_threads and make it inline
-
- Oct 22, 2017
-
-
Tianqi Chen authored
-
Hu Shiwen authored
* add friendly tips when not found cl and link * fix lint
-
Wei Chen authored
-