Skip to content
Snippets Groups Projects
  1. Sep 20, 2017
  2. Sep 19, 2017
  3. Sep 18, 2017
  4. Sep 17, 2017
  5. Sep 14, 2017
  6. Sep 13, 2017
    • Aditya Atluri's avatar
      [BACKEND] initial llvm codegen for amdgpu (#402) · 891e226b
      Aditya Atluri authored
      * added initial llvm codegen for amdgpu
      
      * fixed whitespace
      
      * fixed hsaco gen from ir
      
      * fixed targetmachine for rocm and added GetSource for rocm
      
      * fixed whitespace issues
      
      * changed statement to use less than 100 lines
      
      * added intrinsics for workgroup - rocm
      
      * whitespace - newline error fix
      
      * fixed error msg for workitem-workgroup intrinsics
      
      * added llvm ir dump for rocm codegen
      
      * [ROCM] changed codegen to emit proper amdgpu kernel header
      
      * fixed whitespace error
      
      * fixed whitespace error- 2
      
      * fixed AddFunction to not to use extra arg
      
      1. Changed AddFunctionInternal to not to take extra arg for target type
      2. Use Target from CodeGenLLVM to check for AMDGPU target
      
      * fixed whitespaces
      
      * fixed whitespaces 2
      
      * fixed codegen for AMDGPU - now generating valid IR
      
      * fixed codegen depending on code review
      
      * reviewed alignment for amd devices
      
      * added code to dump code object to file
      
      * fixed cpplint errors
      
      * print out IR after pass manager
      
      * added code to dump asm, obj to file and std string
      
      * fixed whitespaces
      
      * Update codegen_amdgpu.cc
      
      * used registry for amdgpu llvm
      
      * Fixed whitespaces
      
      * added code for calling linker
      
      * fixed formatting errors
      
      * added rocm link python interface
      
      * fixed pylint issues and added more body to the function
      
      * added doc string
      
      * added doc string for module
      
      * fixed python code after review, fixed llvm object codegen
      
      * fixed linker to generate code object
      
      * removed dumping to output file and debugging log out
      
      * fixed lint for python code
      
      * added fault check after running linker
      
      * removed print statement in rocm.py
      
      * changed rocm lld linker to raise runtimeerror than emitting error log to stderr
      
      * changed the way linker command line is pass to subprocess.popen
      
      * removed redundant code and reuse tvm utils
      
      * removed commented out code
      
      * removed cloning of unused modules, and put IR into string
      891e226b
    • Tianqi Chen's avatar
      5061a6da
  7. Sep 12, 2017
    • Leyuan Wang's avatar
      769544ad
    • Shuai Yuan's avatar
      [RPC] clarify error message for unmatched context (#451) · 220fa040
      Shuai Yuan authored
      Clarify confusing error message for unmatched context
      220fa040
    • wetliu's avatar
      [TOPI] Depth wise convolution backward methods for NHWC (#434) · ffff1e49
      wetliu authored
      * rename the nchw and pass the unit test; going to do it for nhwc depthwise
      
      * bug with fusion
      
      * nchw works fine; nhwc float32 problem remains
      
      * still cannot bind them together
      
      * fusion works
      
      * syntax fix
      
      * all bugs fixed; test cases pass
      
      * minor fix on nn.h
      
      * back wrt input
      
      * backward wrt input nhwc; only test case in recipe
      
      * test case for depthwise back wrt input
      
      * test case for depthwise backward wrt weight
      
      * tags
      
      * minor fixes
      
      * pylint test; add arch=3.7
      
      * modify scheduler
      
      * better backward depthwise w.r.t weight scheduler
      
      * updated scheduler
      
      * test_topi_depthwise_conv2d_back_input.py and test_topi_depthwise_conv2d_back_weight.py success
      
      * all test cases wrt input pass
      
      * update
      
      * new test cases and scheduler
      
      * not working 1 and 2
      
      * good wrt weight, bad wrt input
      
      * test cases added
      
      * remove tf lines
      
      * minor fix
      
      * compute arch changed
      
      * remove compile hook
      
      * minor change
      
      * pylint
      
      * fix the float for python case
      
      * fix cases for python3 case
      
      * except for memoize
      
      * fix most; memoize still wrong
      
      * memoize added
      
      * unexpected layout cases added for scheduler
      
      * error message layout other than NHWC added
      
      * improve padding
      
      * fix as pr requests
      
      * remove dilate in backward wrt weight
      ffff1e49
    • Tianqi Chen's avatar
      [RUNTIME] Enable extension type to PackedFunc. (#447) · f2ab736b
      Tianqi Chen authored
      * [RUNTIME] Enable extension type to PackedFunc.
      
      * More comments
      f2ab736b
  8. Sep 11, 2017
    • Shuai Yuan's avatar
      [DOCS] Add prerequisites about zlib1g-dev (#446) · 3130f2d5
      Shuai Yuan authored
      * [DOCS] Add prerequisites about zlib1g-devin
      
      Add prerequisites about zlib1g-dev. It occurs `/usr/bin/ld: cannot find -lz` without zlib1g-dev.
      
      * Add prerequisites about python-setuptools 
      
      Add prerequisites about python-setuptools. Otherwise, it will fail when executing `python setup install --user` command.
      
      * [DOCS] Add prerequisites about python-dev
      
      Add installation prerequisites about python-dev. Otherwise, it will fail with `SystemError: Cannot compile 'Python.h'. Perhaps you need to install python-dev|python-devel.` when executing `python setup install --user`.
      3130f2d5
    • Tianqi Chen's avatar
      [RUNTIME][RPC] Enable remote linking of device code. (#444) · 1077f8e8
      Tianqi Chen authored
      * [RUNTIME][RPC] Enable remote linking of device code.
      
      * fix build
      1077f8e8
  9. Sep 10, 2017
  10. Sep 09, 2017
  11. Sep 08, 2017
    • Leyuan Wang's avatar
      Conv2d updated (#435) · edca6830
      Leyuan Wang authored
      * improved conv2d for last group of workloads
      
      * conv2d_nchw improved on 14_256_256 and 56_64_128
      edca6830
  12. Sep 07, 2017
  13. Sep 06, 2017
  14. Sep 05, 2017
  15. Sep 04, 2017
  16. Sep 03, 2017
  17. Sep 01, 2017
Loading