- Oct 22, 2017
-
-
Tianqi Chen authored
-
Wei Chen authored
-
- Oct 20, 2017
-
-
masahi authored
* added math function support * bug fix extern func call in llvm based codegen lint fix fix build bug fix extern func call in llvm based codegen * moved rocm bitcodes detection to python
-
- Oct 17, 2017
-
-
Tianqi Chen authored
-
- Oct 16, 2017
-
-
Tianqi Chen authored
* [ARITH] More caninical simplfy * [DEBUG] Use HalideIR with trace logging
-
Tianqi Chen authored
* [CODEGEN] Allow link additional module * fix py3 * add register back
-
- Oct 15, 2017
-
-
Tianqi Chen authored
-
Tianqi Chen authored
* [CODEGEN] Force not inline compute core for better debug * also support llvm4
-
- Oct 14, 2017
-
-
Tianqi Chen authored
* [TVM] Introduce target generic dispatch system * fix target warning
-
ziheng authored
* [CODEGEN] Detect broadcast(cast(x)) pattern in FMA * [CODEGEN] Improve * [CODEGEN] Fix
-
- Oct 13, 2017
-
-
Aditya Atluri authored
* added support for rocm gpu autodetect * changed type casting from old style to static_cast * fixed code to generate gfx specific code object * fixed namespaces
-
Hu Shiwen authored
-
Tianqi Chen authored
-
- Oct 12, 2017
-
-
masahi authored
-
Tianqi Chen authored
* [RUNTIME] Enable ext_dev type for quick plugin of device * [TEST] Update testcase to cover all computation
-
- Oct 11, 2017
-
-
Tianqi Chen authored
* [PASS] copy intrin * update comment thanks to derisavi
-
- Oct 10, 2017
-
-
Tianqi Chen authored
* [ARITH] Improve detect linear equation * fix doc
-
- Oct 08, 2017
-
-
Tianqi Chen authored
-
- Oct 05, 2017
-
-
Tianqi Chen authored
-
Tianqi Chen authored
-
- Oct 04, 2017
-
-
Tianqi Chen authored
-
- Sep 26, 2017
-
-
Tianqi Chen authored
-
Tianqi Chen authored
-
- Sep 25, 2017
-
-
Tianqi Chen authored
-
Tianqi Chen authored
-
Tianqi Chen authored
* [RUNTIME] Minimum graph runtime * update docs
-
- Sep 22, 2017
-
-
Tianqi Chen authored
* [INTRIN] Enable pow * rename pow->power * fix
-
Tianqi Chen authored
-
- Sep 20, 2017
-
-
Tianqi Chen authored
-
Tianqi Chen authored
* [CODEGEN] Redo CodegenLLVM. * Add remarks about origin of the pass Properly acknowledge related projects * Fix and expression
-
- Sep 18, 2017
-
-
Tianqi Chen authored
* [METAL] use 32bit indexing for metal until we have a bound adapted pass * fix lint
-
Tianqi Chen authored
* [RPC] Expose module handle * not include handle
-
- Sep 17, 2017
-
-
Tianqi Chen authored
* [PASS] Fix intrinsic lowering with fma and other intrin * relax rtol for sqrt
-
- Sep 13, 2017
-
-
Aditya Atluri authored
* added initial llvm codegen for amdgpu * fixed whitespace * fixed hsaco gen from ir * fixed targetmachine for rocm and added GetSource for rocm * fixed whitespace issues * changed statement to use less than 100 lines * added intrinsics for workgroup - rocm * whitespace - newline error fix * fixed error msg for workitem-workgroup intrinsics * added llvm ir dump for rocm codegen * [ROCM] changed codegen to emit proper amdgpu kernel header * fixed whitespace error * fixed whitespace error- 2 * fixed AddFunction to not to use extra arg 1. Changed AddFunctionInternal to not to take extra arg for target type 2. Use Target from CodeGenLLVM to check for AMDGPU target * fixed whitespaces * fixed whitespaces 2 * fixed codegen for AMDGPU - now generating valid IR * fixed codegen depending on code review * reviewed alignment for amd devices * added code to dump code object to file * fixed cpplint errors * print out IR after pass manager * added code to dump asm, obj to file and std string * fixed whitespaces * Update codegen_amdgpu.cc * used registry for amdgpu llvm * Fixed whitespaces * added code for calling linker * fixed formatting errors * added rocm link python interface * fixed pylint issues and added more body to the function * added doc string * added doc string for module * fixed python code after review, fixed llvm object codegen * fixed linker to generate code object * removed dumping to output file and debugging log out * fixed lint for python code * added fault check after running linker * removed print statement in rocm.py * changed rocm lld linker to raise runtimeerror than emitting error log to stderr * changed the way linker command line is pass to subprocess.popen * removed redundant code and reuse tvm utils * removed commented out code * removed cloning of unused modules, and put IR into string
-
Tianqi Chen authored
-
- Sep 12, 2017
-
-
Shuai Yuan authored
Clarify confusing error message for unmatched context
-
Tianqi Chen authored
* [RUNTIME] Enable extension type to PackedFunc. * More comments
-
- Sep 11, 2017
-
-
Tianqi Chen authored
* [RUNTIME][RPC] Enable remote linking of device code. * fix build
-
- Sep 09, 2017
-
-
Tianqi Chen authored
-
- Sep 07, 2017
-
-
Tianqi Chen authored
* [SCHEDULE] Enahance cache_write to enable layout change. * more tests
-