The home of the ICU project source code.
-
Updated
Apr 17, 2026 - C++
The home of the ICU project source code.
Fast and customizable text tokenization library with BPE and SentencePiece support
Fast and Portable Character String Processing in R (with the Unicode ICU)
Detect character encoding using ICU
Harfbuzz with a CMake build configuration using Freetype2, UCDN and ICU
Mirror of svn project at http://source.icu-project.org/repos/icu/icu/. The FieldWorks branch has some FieldWorks specific enhancements.
Project in JUCE & Faust for sonification of the ST-segment in ECG data. Contain two proposed sonification methods: Vocal & Synthesized
A high-performance Node.js native addon providing seamless conversion from Windows time zone IDs to IANA time zone names using ICU’s official mapping.
📦 Optimize tokenization in C++ for HuggingFace models with a fast, production-ready library supporting BPE, WordPiece, and Unigram methods.
Primitive interactive biotope simulation with an infinite, procedurally generated map
A C++20 header-only library for building powerful, composable data transformation pipelines — from integer ↔ bytes, base encodings, hashing, compression, and encryption to Unicode conversions.
[OBSOLETE] The recipe is now in https://github.com/conan-io/conan-center-index
Add a description, image, and links to the icu topic page so that developers can more easily learn about it.
To associate your repository with the icu topic, visit your repo's landing page and select "manage topics."