- Dec 13, 2017
-
-
Salem Derisavi authored
* Simplify expressions early on * fixed lint errors
-
Salem Derisavi authored
* 1) Refactored some parts of the unrolling code into their own methods so we can reuse unrolling functionality in other parts of the code. E.g., to explicitly unroll loops with count of 1 when they are programmatically created. 2) Reorder based on top operator before resorting to pointers, which causes non-determinism. * Fixed lint errors
-
- Dec 11, 2017
-
-
abergeron authored
* Use long long for platforms where long is 32 bits (like windows). * Make sure scalar chars are signed. * Re-add NOLINT marker.
-
Lianmin Zheng authored
* [CODEGEN] add fp16 and fp64 enable pragma for opencl * fix style
-
- Dec 07, 2017
-
-
Lianmin Zheng authored
-
- Dec 05, 2017
-
-
alex-weaver authored
* Port build_module.py to C++ * Fix lint errors * Fix more lint errors * Fix more lint errors * Fix more lint errors * Fix build error * Implemented style fixes * Fix lint errors * Added function to construct target from string lower now returns array * Fix lint error * Implemented review changes - style & Target options -> std::vector * Fixed lint, argument alignment and added unit test * Changed test to target LLVM, fixed sign compare warnings * Reverted unit test to CUDA, changed Jenkinsfile to enable GPU for C++ tests * Slight change to Jenkinsfile * Changed build_module test from CUDA to LLVM * Added function var() to construct a Var instance. Changed implementation of LLVMEnabled() * Reverted Jenkinsfile
-
- Dec 04, 2017
-
-
Tianqi Chen authored
* Support rank-0 tensor * fix lint
-
- Dec 01, 2017
-
-
ziheng authored
* [RANDOM] Init contrib.random library * [RANDOM] Add uniform * [RANDOM] Fix lint * [RANDOM] Add comments and tests * [RANDOM] Fix lint
-
- Nov 30, 2017
-
-
Salem Derisavi authored
-
Tianqi Chen authored
* [CUDA] Enable int64 * [PYTHON] Fix rpc tutorial with opencl * OK * update
-
- Nov 29, 2017
-
-
Tianqi Chen authored
* [RPC][JVM] Remove binary dist gradle from repo * fix header
-
- Nov 28, 2017
-
-
Tianqi Chen authored
-
- Nov 25, 2017
-
-
Tianqi Chen authored
* [PASS] Allow compact checking when strides is available * remove assert compact
-
- Nov 23, 2017
-
-
Siva authored
Readability.
-
- Nov 21, 2017
-
-
Tianqi Chen authored
* [PASS/SETUP] Fix minior issues * fix lint
-
Sheng Zha authored
* mps * update
-
- Nov 18, 2017
-
-
Lianmin Zheng authored
-
- Nov 16, 2017
-
-
haolongzhangm authored
some host opencl runtime may at cpu mode, but remote client opencl runtime at gpu mode, compat it
-
- Nov 14, 2017
-
-
Tianqi Chen authored
-
- Nov 13, 2017
-
-
Tianqi Chen authored
-
- Nov 12, 2017
-
-
Tianqi Chen authored
-
Tianqi Chen authored
-
- Nov 11, 2017
-
-
Tianqi Chen authored
* [PASS] Enhance LiftAttrScope * update vt
-
ziheng authored
-
- Nov 09, 2017
-
-
eqy authored
* Support vector operations for AMD (llvm IR) * fix whitespace * update comments, docstring * inline AMD GPU functions
-
- Nov 08, 2017
-
-
eqy authored
* Support vector operations for AMD (llvm IR) * fix whitespace * update comments, docstring
-
- Nov 07, 2017
-
-
eqy authored
Change minimum 32-bit restriction for floating point types to 8-bit. This change is to enable reduced precision types that may use vector operations underneath the hood (cases #lanes > 1 such as half4).
-
- Nov 06, 2017
-
-
masahi authored
-
- Nov 03, 2017
-
-
Tianqi Chen authored
-
- Nov 02, 2017
-
-
Yuwei Hu authored
* enable popcount intrin * fix lint * add test * fix python3
-
- Oct 26, 2017
-
-
masahi authored
* removed fma dispatch * added comments to explain why remove fma * fix lint * use fmuladd intrin for fma dispatch
-
masahi authored
* view llvm ir and gcn asm with module.get_source(...) * fix lint
-
Tianqi Chen authored
* [BUFFER] Smarter slice to detect compactness * move simplify of begins early
-
- Oct 24, 2017
-
-
Tianqi Chen authored
-
- Oct 22, 2017
-
-
Tianqi Chen authored
-
Wei Chen authored
-
- Oct 20, 2017
-
-
masahi authored
* added math function support * bug fix extern func call in llvm based codegen lint fix fix build bug fix extern func call in llvm based codegen * moved rocm bitcodes detection to python
-
- Oct 17, 2017
-
-
Tianqi Chen authored
-
- Oct 16, 2017
-
-
Tianqi Chen authored
* [ARITH] More caninical simplfy * [DEBUG] Use HalideIR with trace logging
-
Tianqi Chen authored
* [CODEGEN] Allow link additional module * fix py3 * add register back
-