Skip to content

Releases: tile-ai/tilelang

v0.1.1

23 Feb 09:32
59342bb
Compare
Choose a tag to compare

What's Changed

  • [Doc] Update release news by @LeiWang1999 in #80
  • [Doc] Convert docs from rst format to Markdown format. by @xwhzz in #82
  • [Bugfix] Bugfix of installing with develop mode by @LeiWang1999 in #81
  • [WHL] Support whl building for different python versions via tox by @LeiWang1999 in #83
  • [Refactor] Separate tilelang Pass Thread Sync (with Hopper support) from tvm by @LeiWang1999 in #85
  • [Backend][WebGPU] Support WebGPU WGSL code generation by @LeiWang1999 in #86
  • [Wheel] Support pypi build scripts for different python via tox by @LeiWang1999 in #93
  • [Wrap] Use a ctypes-based kernel wrapper instead of dlpack for runtime efficiency by @LeiWang1999 in #95
  • [Bugfix] Update Dockerfile.cu120 by @LeiWang1999 in #98
  • [Bugfix] Put InjectPtxAsyncCopy Pass behind ThreadSync Pass by @LeiWang1999 in #97
  • [Feature] Add CTypes JIT kernel support by @LeiWang1999 in #100
  • [Docker] Add Dockerfiles for multiple CUDA versions by @LeiWang1999 in #103
  • [JIT] Support Cython jit and make cython a default execution backend by @LeiWang1999 in #102
  • [Refactor] Phrase out torch cpp extension backend by @LeiWang1999 in #104
  • [Wheel] Provide a bare docker scripts to help build wheels for manylinux by @LeiWang1999 in #105
  • [Example] Implement simple block sparse kernel by @LeiWang1999 in #106
  • [Release] Bumpy version to v0.1.1 by @LeiWang1999 in #107

Full Changelog: v0.1.0...v0.1.1

v0.1.0

12 Feb 06:23
02a2cba
Compare
Choose a tag to compare

What's Changed

  • [LICENSE] Add LICENSE for flashinfer by @LeiWang1999 in #19
  • [Doc] Fix installation scripts and docs for dequantize gemm by @LeiWang1999 in #20
  • [Doc] Use sphinx to generate docs. by @xwhzz in #21
  • [Doc] update installation.md and readme by @Cunxiao2002 in #22
  • [Doc] fix a typo in installation.rst by @Cunxiao2002 in #24
  • [Doc] Remove legacy files and update reference by @LeiWang1999 in #25
  • [CI][Test] Add test cases for tilelang transform AnnotateDeviceRegions and MakePackedAPI by @LeiWang1999 in #26
  • [Doc] Create a workflow to host docs using GitHub Pages. by @xwhzz in #28
  • [CI][Test] Add test cases for tilelang transform InjectSoftwarePipeline and FrontendLegalize by @Cunxiao2002 in #30
  • [Bugfix] Replace thread binding detector in LayoutInference Pass by @LeiWang1999 in #31
  • [CI] Comprehensive Test cases Implementation of Matmul Dequantize by @LeiWang1999 in #32
  • [Doc] Update GitHub Actions workflow for documentation deployment and add CNAME file. by @xwhzz in #33
  • [Refactor] Simplify interface via replacing argument thread binding of intrinsics with KernelFrame.Current by @LeiWang1999 in #34
  • [Bugfix] Reorder Passes: Place Vectorize Loop Before StorageFlatten and FlattenBuffer to Prevent Redundant Allocations by @LeiWang1999 in #37
  • [Doc] Update documentation structure and content by @LeiWang1999 in #39
  • [Doc][CI] Update GitHub Actions workflow for documentation build and deployment. by @xwhzz in #42
  • [CI] Allow manual triggering of documentation workflow in addition to… by @xwhzz in #43
  • [CI][Test] Add test cases for tilelang transform PipelinePlanning by @Cunxiao2002 in #44
  • [CI][Test] Add test cases for tilelang transform LayoutInference and LowerTileOp on loop tail split functionality by @tzj-fxz in #29
  • [Debug] Introduce T.print for buffer and variables logging on frontend by @LeiWang1999 in #45
  • [CI] Change pull request trigger to pull_request_target for documen… by @xwhzz in #48
  • [Dev] Add FlashDecoding example by @chengyupku in #46
  • [Doc] update README that tilelang has been used in AttentionEngine by @smallscientist1 in #50
  • [Doc] Remove unnecessary layout annotation by @LeiWang1999 in #49
  • [CI][Test] Add test cases for tilelang kernel convolution by @chengyupku in #51
  • [Dev] Implement test case for tilelang transformations by @LeiWang1999 in #53
  • [CI][Test] Add test cases for tilelang kernel FlashAttention by @chengyupku in #54
  • [CI][Test] Add test cases for element_add by @Cunxiao2002 in #47
  • [CI] Clean up target repository before publishing documentation. by @xwhzz in #55
  • [CI][Test] Add test cases for tilelang transform ClusterPlanning by @chengyupku in #57
  • [Doc] Append debug relevant testing and documentations by @LeiWang1999 in #58
  • [CI][Test] Add test cases for tilelang transform LowerHopperIntrin by @chengyupku in #59
  • [Doc] Add matmul kernel tutorial with tile library by @LeiWang1999 in #60
  • [Dev] Separate LoopVectorize Pass from upstream tvm by @LeiWang1999 in #62
  • [Dev] Support FP8 Codegen for cuda backend by @LeiWang1999 in #64
  • [Dev] Add test case for bfloat16 and int4 gemm with mma by @LeiWang1999 in #65
  • [CI][Test] Add test cases for tilelang transform InjectFenceProxy by @chengyupku in #66
  • [Tools] Introduce plot_layout to visualize the fragment layout by @LeiWang1999 in #68
  • [Dev] Remove unnecessary python dependencies by @LeiWang1999 in #69
  • [Carver] Introduce a tile-structure based cost model for auto tuning by @LeiWang1999 in #70
  • [Bugfix] bug fix for bitblas dependency by @LeiWang1999 in #71
  • [CI][Test] Add test cases for tilelang transform MultiVersionBuffer and WarpSpecialized by @chengyupku in #72
  • [CostModel][Carver] Support Hint Recommend for Shared memory Kernel Fusion by @LeiWang1999 in #73
  • [Carver] Remove legacy todo items in carver's readme by @LeiWang1999 in #74
  • [Dev] Add mha backward example by @chengyupku in #77
  • [Release] Bump version into v0.1.0 by @LeiWang1999 in #76

New Contributors

Full Changelog: v0.0.1...v0.1.0

TileLang v0.0.1 Pre-release

20 Jan 12:54
473977b
Compare
Choose a tag to compare
Pre-release

Pre-release for the v0.0.1. Under testing, Only cuda prebuilt are provided.

What's Changed

New Contributors

Full Changelog: https://github.com/tile-ai/tilelang/commits/v0.0.1