Stars
[COLM 2025] An Open Math Pre-trainng Dataset with 370B Tokens.
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
High-performance data engine for AI and multimodal workloads. Process images, audio, video, and structured data at any scale