-
Cerebras Systems
- Bay Area, California
Stars
brucechanglongxu / smi-al
Forked from StanfordSIMILab/smi-alEmbedding-based clustering to reduce annotation for surgical segmentation.
Embedding-based clustering to reduce annotation for surgical segmentation.
brucechanglongxu / harmony
Forked from openai/harmonyRenderer for the harmony response format to be used with gpt-oss
brucechanglongxu / gpt-oss
Forked from openai/gpt-ossgpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI
brucechanglongxu / cutlass
Forked from NVIDIA/cutlassCUDA Templates for Linear Algebra Subroutines
brucechanglongxu / vllm
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Fast and memory-efficient exact attention
brucechanglongxu / onnxruntime
Forked from microsoft/onnxruntimeONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
brucechanglongxu / TensorRT-LLM
Forked from NVIDIA/TensorRT-LLMTensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorR…
Cancer research and care have advanced dramatically over the past decades, yielding new therapies and improved patient outcomes. Yet despite these gains, cancer remains a leading global health chal…
CIRC: A protocol layer for coordinating clinical agents across systems, specialties, and institutions. A protocol layer for deploying, coordinating, and governing autonomous AI agents in healthcare…
This repository serves as a comprehensive resource on GFlowNets, exploring key algorithms, theoretical foundations, and practical implementations for generative flow networks. It is designed for re…
This repository explores Compressed Sensing, focusing on theory, algorithms, and practical implementations for signal reconstruction from sparse measurements. It is intended for researchers, engine…
Convert Machine Learning Code Between Frameworks