Stars
Automated GPU Kernel Generation via Co-Evolving Intrinsic World Model
[MLsys2026]: RAG on Everything with LEANN. Enjoy 97% storage savings while running a fast, accurate, and 100% private RAG application on your personal device.
Leveraging Base Language Models for Few-Shot Synthetic Data Generation
Achieve state of the art inference performance with modern accelerators on Kubernetes
Internet-scale OpenID Certifiedβ’ OpenID Connect and OAuth2.1 provider that integrates with your user management through headless APIs. Solve OIDC/OAuth2 user cases over night. Consume as a service β¦
Build production-ready AI agents in both Python and Typescript.
These are all of the files for all my YouTube videos.
Data structures and algorithms in X minutes. Code examples from my YouTube channel.
The official repo for "LLoCo: Learning Long Contexts Offline"
π OpenHands: AI-Driven Development
Devika is the first open-source implementation of an Agentic Software Engineer. Initially started as an open-source alternative to Devin.
A high-throughput and memory-efficient inference and serving engine for LLMs
Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)
Python interface to the ecCodes GRIB/BUFR decoder/encoder
Python library for file name matching with glob patterns
Training and serving large-scale neural networks with auto parallelization.
π₯ Blazing fast bulk data transfers between any cloud π₯
A multi-cloud framework for big data analytics and embarrassingly parallel jobs, that provides an universal API for building parallel applications in the cloud βοΈπ
Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, Slurm, 20+ clouds, on-prem).
Stocator is high performing connector to object storage for Apache Spark, achieving performance by leveraging object storage semantics.
Automatically generates configuration files for Lithops
Object Storage data processing for Ray framework
Enables Ray to use IBM Gen2 backend