Skip to content

Tags: tile-ai/tilelang

Tags

v0.1.11

Toggle v0.1.11's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
[Release] Bump version to 0.1.11 (#2354)

Bump version to 0.1.11

v0.1.10

Toggle v0.1.10's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
[Bugfix] Fix ROCm FP4 packed buffer map key (#2256)

Fix ROCm FP4 packed buffer map key

v0.1.9

Toggle v0.1.9's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
[Refactor] Strip build machine paths from LOG messages in wheel relea…

…ses (#2080)

* [Refactor] Use TVM_LOG_CUSTOMIZE to strip build paths from LOG messages in release builds

Enable TVM's custom logging hook (TVM_LOG_CUSTOMIZE) and provide our own
LogMessageImpl/LogFatalImpl that conditionally omit source file paths.
In wheel/release builds (detected via CIBUILDWHEEL or SKBUILD_STATE env
vars), LOG(WARNING) and friends no longer leak CI machine paths, showing
only the message. Local dev builds keep full paths for debugging.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

* [Refactor] Downgrade noisy LOG(WARNING) to DLOG(WARNING)

These warnings about TMA/swizzle layout fallbacks and warp specialization
status are development diagnostics, not actionable for end users. Use DLOG
so they are compiled out in release (wheel) builds.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

* [Chore] Apply clang-format

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

---------

Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>

v0.1.8

Toggle v0.1.8's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
[Release] Bump version into v0.1.8 (#1853)

Bump version into 0.1.8

v0.1.7.post3

Toggle v0.1.7.post3's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
[Release] Bump Version into v0.1.7.post3 (#1685)

Bump Version into v0.1.7.post3

v0.1.7.post2

Toggle v0.1.7.post2's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
[CUDA] Enhance Broadcast Codegen for Symbolic Value (#1669)

* Enhance CUDA code generation for BroadcastNode by implementing compile-time constant folding and runtime broadcasting for various lane configurations. Improved handling for 4-bit and 8-bit integer types, ensuring correct replication and type casting in output expressions. This update increases performance and correctness in CUDA kernel generation.

* add test

* lint fix

* fix

0.1.7.post2

Toggle 0.1.7.post2's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
[Release] Bump version to 0.1.7.post2 (#1575)

v0.1.7.post1

Toggle v0.1.7.post1's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
[Release] Bump version into 0.1.7.post1 (#1506)

v0.1.7

Toggle v0.1.7's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
[Typing] Enhance compatibility for advanced typing features in Python (

…#1382)

- Updated `allocate.py` and `annot.py` to improve compatibility with Python 3.9 and later by conditionally importing advanced typing features such as `TypeVarTuple`, `Unpack`, and `ParamSpec`.
- Added fallback imports from `typing_extensions` for environments using earlier Python versions.
- Improved handling of generic alias detection to ensure consistent behavior across different Python versions.

v0.1.6.post2

Toggle v0.1.6.post2's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
[Release] Bump version to v0.1.6.post2 (#1160)

* [Release] Update README and VERSION for v0.1.6.post2 compatibility with Python 3.8

* [Enhancement] Update packaging configuration and Docker scripts for multi-architecture support

* Add allowlist for TVM, CUTLASS, and Composable Kernel items in pyproject.toml
* Enhance docker_local_distribute.sh to support cross-architecture builds using docker buildx
* Modify pypi.manylinux.Dockerfile to accept TARGETARCH argument for better architecture handling

* [Enhancement] Improve Docker scripts and build process for multi-architecture support

* Update .gitignore to include dist directories
* Refactor docker_local_distribute.sh for better cross-architecture handling and error management
* Enhance docker_pypi_distribute.sh to support multi-architecture builds with docker buildx
* Modify pypi_distribution.sh to clean up additional directories
* Update pypi.manylinux.Dockerfile for improved environment configuration and architecture handling

* fix

* Remove outdated classifier for Artificial Intelligence from pyproject.toml

* Update pyproject.toml classifiers and modify Docker distribution scripts for clarity

* Add new classifier for Artificial Intelligence in pyproject.toml
* Rename output directories in docker_local_distribute.sh and docker_pypi_distribute.sh for better context