Commits · 3479b9ab75fc62be007d259a35a0ff82df1c7fcb · cld / ml / tvm

Nov 18, 2017
- [RUNTIME] support limited save without cross compile (#659) · 3479b9ab
  Lianmin Zheng authored 7 years ago
  
  3479b9ab
Nov 16, 2017

Compat for opencl mode between cpu mode and gpu mode (#655) · db743028
haolongzhangm authored 7 years ago
```
some host opencl runtime may at cpu mode, but remote

client opencl runtime at gpu mode, compat it
```
db743028

Conv2d scheduler tweaked for super resolution perf (#652) · 7d620be4

Leyuan Wang authored 7 years ago

* scheduler tweaked for super resolution perf

* lint error fixed

* lint error fixed

* conv2d_transpose schedule error fixed

7d620be4

Nov 14, 2017
- [UNROLL] New unroll option (#647) · a2aa154c
  Tianqi Chen authored 7 years ago
  
  a2aa154c
- [TOPI] Add out_dtype argument for conv2d; Add x86 schedules (#646) · c6a1241e
  ziheng authored 7 years ago
  
  * [TOPI] Add out_dtype argument for conv2d; Add x86 schedules * Fix * Fix lint * Fix
  c6a1241e
- [APP] improve parameter pack (#645) · d7354628
  Tianqi Chen authored 7 years ago
  
  d7354628
- conv2d perf improved for conv2d_56_64_128, super resolution workloads added (#643) · afc693dc
  Leyuan Wang authored 7 years ago
  
  * conv2d perf improved for conv2d_56_64_128, test name added to differentiate workloads * fix lint error
  afc693dc
Nov 13, 2017
- Fix conda packages (#642) · a908b831
  abergeron authored 7 years ago
  
  * Make the tvm conda package build with in-place source and use cmake from conda. * Add a package for topi.
  a908b831
- [PASS] Fix vthread when extern access touching (#636) · 4d2fc952
  Tianqi Chen authored 7 years ago
  
  4d2fc952
Nov 12, 2017
- [CODEGEN] Enable closure with no argument (#635) · b07ceff5
  Tianqi Chen authored 7 years ago
  
  b07ceff5
- [PASS] Update coproc sync (#634) · f1aabedc
  Tianqi Chen authored 7 years ago
  
  f1aabedc
- [TUTORIAL] use OpenCL on ARM board (#633) · 32b0fff2
  Lianmin Zheng authored 7 years ago
  
  32b0fff2
Nov 11, 2017
- [PASS] Enhance LiftAttrScope (#632) · e4b40b53
  Tianqi Chen authored 7 years ago
  
  * [PASS] Enhance LiftAttrScope * update vt
  e4b40b53
- [NNPACK] Add argument nthreads (#631) · 182a7852
  ziheng authored 7 years ago
  
  182a7852
Nov 09, 2017
- android gemm for topi/recipe (#628) · 35485307
  Yizhi Liu authored 7 years ago
  
  35485307
- inline AMD GPU functions (#625) · 8fea0879
  eqy authored 7 years ago
  
  * Support vector operations for AMD (llvm IR) * fix whitespace * update comments, docstring * inline AMD GPU functions
  8fea0879
Nov 08, 2017

WIP: Add how_to readme to install tvm with nnpack support (#610) · 90067e64

Erwan BERNARD authored 7 years ago

* feat(docs) add how_to for tvm install with nnpack support

* feat(docs) change python package paragraph

* feat(doc) remove unsure sentence

* add comments on nnpack usage vs TVM

* remove mxnet nnpack tips for nthread change

90067e64

Support vector operations for AMD (llvm IR) (#623) · cedd3900

eqy authored 7 years ago

* Support vector operations for AMD (llvm IR)

* fix whitespace

* update comments, docstring

cedd3900

conv2d_56_64_128 mark==1 bug fixed (#624) · 25847a4f
Leyuan Wang authored 7 years ago

25847a4f

Nov 07, 2017

remove minimum 32-bit restriction (#621) · 08e4d085

eqy authored 7 years ago

Change minimum 32-bit restriction for floating point types to 8-bit.
This change is to enable reduced precision types that may use vector operations underneath the hood (cases #lanes > 1 such as half4).

08e4d085

Nov 06, 2017
- add tanh dispatch (#619) · c7101537
  masahi authored 7 years ago
  
  c7101537
- [TOPI] fix weight layout in conv2d_transpose (#616) · c1008ec4
  Yuwei Hu authored 7 years ago
  
  c1008ec4
Nov 03, 2017
- [DLPack] Upgrade dlpack to 0.2 (#609) · 8214d6ca
  Tianqi Chen authored 7 years ago
  
  8214d6ca
- [TOPI] modify conv2d_transpose schedule (#613) · a152a9cb
  Yuwei Hu authored 7 years ago
  
  a152a9cb
Nov 02, 2017
- [INTRIN] Enable popcount (#606) · 685f78d0
  Yuwei Hu authored 7 years ago
  
  * enable popcount intrin * fix lint * add test * fix python3
  685f78d0
Nov 01, 2017
- Fixed build with metal on MacOS with case-sensitive FS (#601) · 3bb2eef5
  Cyril Lashkevich authored 7 years ago
  
  3bb2eef5
Oct 30, 2017
- vgg16 workload error fixed (#598) · 3c895464
  Leyuan Wang authored 7 years ago
  
  3c895464
Oct 27, 2017
- [TOPI] Support ceil_mode in pooling (#593) · 88662130
  Tianqi Chen authored 7 years ago
  
  88662130
Oct 26, 2017
- add helpful message to topi test (#592) · 2f2170f4
  masahi authored 7 years ago
  
  2f2170f4
- [ROCM] remove fma dispatch (#591) · 20144de2
  masahi authored 7 years ago
  
  * removed fma dispatch * added comments to explain why remove fma * fix lint * use fmuladd intrin for fma dispatch
  20144de2
- [ROCM] View llvm ir and gcn asm with module.get_source(...) (#590) · 6a5d6165
  masahi authored 7 years ago
  
  * view llvm ir and gcn asm with module.get_source(...) * fix lint
  6a5d6165
- [BUFFER] Smarter slice to detect compactness (#587) · a76851d7
  Tianqi Chen authored 7 years ago
  
  * [BUFFER] Smarter slice to detect compactness * move simplify of begins early
  a76851d7
Oct 25, 2017
- [TOPI] add conv2d_transpose_nchw (#586) · 5f79521b
  Yuwei Hu authored 7 years ago
  
  5f79521b
Oct 24, 2017
- [PYTHON] Allow no de-allocation when exit (#583) · 25f95766
  Tianqi Chen authored 7 years ago
  
  25f95766
- [CODEGEN] Fix CPU compute attribute (#582) · da27cfec
  Tianqi Chen authored 7 years ago
  
  da27cfec
- [DOCS] Fix tag_scope example (#581) · 18e4a1bd
  Wei Chen authored 7 years ago
  
  18e4a1bd
Oct 23, 2017

Update topi/cuda schedules to use target.max_num_threads (#577) · 12218358

masahi authored 7 years ago

* update topi/cuda schedules to use target.max_num_threads

* allow num_thread to be larger than cuda.max_num_threads

* remove get_max_num_threads and make it inline

12218358

Oct 22, 2017
- [PASS] More robust UnrollLoop configuratin (#576) · 0f1e0ff0
  Tianqi Chen authored 7 years ago
  
  0f1e0ff0
- add friendly tips when not found cl and link (#574) · 69759c0c
  Hu Shiwen authored 7 years ago
  
  * add friendly tips when not found cl and link * fix lint
  69759c0c
- [SCHEDULE] Detect duplicate IterVar in reorder (#575) · 1791b121
  Wei Chen authored 7 years ago
  
  1791b121