- Nov 12, 2017
-
-
Tianqi Chen authored
-
Lianmin Zheng authored
-
- Nov 11, 2017
-
-
Tianqi Chen authored
* [PASS] Enhance LiftAttrScope * update vt
-
ziheng authored
-
- Nov 09, 2017
- Nov 08, 2017
-
-
Erwan BERNARD authored
* feat(docs) add how_to for tvm install with nnpack support * feat(docs) change python package paragraph * feat(doc) remove unsure sentence * add comments on nnpack usage vs TVM * remove mxnet nnpack tips for nthread change
-
eqy authored
* Support vector operations for AMD (llvm IR) * fix whitespace * update comments, docstring
-
Leyuan Wang authored
-
- Nov 07, 2017
-
-
eqy authored
Change minimum 32-bit restriction for floating point types to 8-bit. This change is to enable reduced precision types that may use vector operations underneath the hood (cases #lanes > 1 such as half4).
-
- Nov 06, 2017
- Nov 03, 2017
-
-
Tianqi Chen authored
-
Yuwei Hu authored
-
- Nov 02, 2017
-
-
Yuwei Hu authored
* enable popcount intrin * fix lint * add test * fix python3
-
- Nov 01, 2017
-
-
Cyril Lashkevich authored
-
- Oct 30, 2017
-
-
Leyuan Wang authored
-
- Oct 27, 2017
-
-
Tianqi Chen authored
-
- Oct 26, 2017
-
-
masahi authored
-
masahi authored
* removed fma dispatch * added comments to explain why remove fma * fix lint * use fmuladd intrin for fma dispatch
-
masahi authored
* view llvm ir and gcn asm with module.get_source(...) * fix lint
-
Tianqi Chen authored
* [BUFFER] Smarter slice to detect compactness * move simplify of begins early
-
- Oct 25, 2017
-
-
Yuwei Hu authored
-
- Oct 24, 2017
-
-
Tianqi Chen authored
-
Tianqi Chen authored
-
Wei Chen authored
-
- Oct 23, 2017
-
-
masahi authored
* update topi/cuda schedules to use target.max_num_threads * allow num_thread to be larger than cuda.max_num_threads * remove get_max_num_threads and make it inline
-
- Oct 22, 2017
-
-
Tianqi Chen authored
-
Hu Shiwen authored
* add friendly tips when not found cl and link * fix lint
-
Wei Chen authored
-
- Oct 20, 2017
-
-
masahi authored
* added math function support * bug fix extern func call in llvm based codegen lint fix fix build bug fix extern func call in llvm based codegen * moved rocm bitcodes detection to python
-
- Oct 19, 2017
-
-
Wei Chen authored
use `object.__eq__`(default object identity comparison) as default implementation of same_as. This should be OK since `EqualOp` and `NotEqualOp` are pure Python object, `object.__eq__` is sufficient.
-
- Oct 17, 2017
-
-
Tianqi Chen authored
* [PYTHON] Improve equal sugar * fix comment
-
Tianqi Chen authored
-
- Oct 16, 2017
-
-
Tianqi Chen authored
* [ARITH] More caninical simplfy * [DEBUG] Use HalideIR with trace logging
-
ziheng authored
* [FIX] Fix target warning * [FIX] Deduplicate options * Fix * Fix
-
Tianqi Chen authored
* [CODEGEN] Allow link additional module * fix py3 * add register back
-
- Oct 15, 2017
-
-
Tianqi Chen authored
-
Tianqi Chen authored
* [CODEGEN] Force not inline compute core for better debug * also support llvm4
-
- Oct 14, 2017
-
-
Tianqi Chen authored
* [TVM] Introduce target generic dispatch system * fix target warning
-