- Nov 30, 2017
-
-
Salem Derisavi authored
-
Tianqi Chen authored
* [CUDA] Enable int64 * [PYTHON] Fix rpc tutorial with opencl * OK * update
-
- Nov 29, 2017
-
-
Tianqi Chen authored
* [RPC][JVM] Remove binary dist gradle from repo * fix header
-
- Nov 28, 2017
-
-
Tianqi Chen authored
-
- Nov 25, 2017
-
-
Tianqi Chen authored
* [PASS] Allow compact checking when strides is available * remove assert compact
-
- Nov 23, 2017
-
-
Siva authored
Readability.
-
- Nov 21, 2017
-
-
Tianqi Chen authored
* [PASS/SETUP] Fix minior issues * fix lint
-
Sheng Zha authored
* mps * update
-
- Nov 18, 2017
-
-
Lianmin Zheng authored
-
- Nov 16, 2017
-
-
haolongzhangm authored
some host opencl runtime may at cpu mode, but remote client opencl runtime at gpu mode, compat it
-
- Nov 14, 2017
-
-
Tianqi Chen authored
-
- Nov 13, 2017
-
-
Tianqi Chen authored
-
- Nov 12, 2017
-
-
Tianqi Chen authored
-
Tianqi Chen authored
-
- Nov 11, 2017
-
-
Tianqi Chen authored
* [PASS] Enhance LiftAttrScope * update vt
-
ziheng authored
-
- Nov 09, 2017
-
-
eqy authored
* Support vector operations for AMD (llvm IR) * fix whitespace * update comments, docstring * inline AMD GPU functions
-
- Nov 08, 2017
-
-
eqy authored
* Support vector operations for AMD (llvm IR) * fix whitespace * update comments, docstring
-
- Nov 07, 2017
-
-
eqy authored
Change minimum 32-bit restriction for floating point types to 8-bit. This change is to enable reduced precision types that may use vector operations underneath the hood (cases #lanes > 1 such as half4).
-
- Nov 06, 2017
-
-
masahi authored
-
- Nov 03, 2017
-
-
Tianqi Chen authored
-
- Nov 02, 2017
-
-
Yuwei Hu authored
* enable popcount intrin * fix lint * add test * fix python3
-
- Oct 26, 2017
-
-
masahi authored
* removed fma dispatch * added comments to explain why remove fma * fix lint * use fmuladd intrin for fma dispatch
-
masahi authored
* view llvm ir and gcn asm with module.get_source(...) * fix lint
-
Tianqi Chen authored
* [BUFFER] Smarter slice to detect compactness * move simplify of begins early
-
- Oct 24, 2017
-
-
Tianqi Chen authored
-
- Oct 22, 2017
-
-
Tianqi Chen authored
-
Wei Chen authored
-
- Oct 20, 2017
-
-
masahi authored
* added math function support * bug fix extern func call in llvm based codegen lint fix fix build bug fix extern func call in llvm based codegen * moved rocm bitcodes detection to python
-
- Oct 17, 2017
-
-
Tianqi Chen authored
-
- Oct 16, 2017
-
-
Tianqi Chen authored
* [ARITH] More caninical simplfy * [DEBUG] Use HalideIR with trace logging
-
Tianqi Chen authored
* [CODEGEN] Allow link additional module * fix py3 * add register back
-
- Oct 15, 2017
-
-
Tianqi Chen authored
-
Tianqi Chen authored
* [CODEGEN] Force not inline compute core for better debug * also support llvm4
-
- Oct 14, 2017
-
-
Tianqi Chen authored
* [TVM] Introduce target generic dispatch system * fix target warning
-
ziheng authored
* [CODEGEN] Detect broadcast(cast(x)) pattern in FMA * [CODEGEN] Improve * [CODEGEN] Fix
-
- Oct 13, 2017
-
-
Aditya Atluri authored
* added support for rocm gpu autodetect * changed type casting from old style to static_cast * fixed code to generate gfx specific code object * fixed namespaces
-
Hu Shiwen authored
-
Tianqi Chen authored
-
- Oct 12, 2017
-
-
masahi authored
-