-
AI Frameworks Engineer @intel
- SH
-
19:41
(UTC +08:00) - https://yiliu30.github.io/
Lists (5)
Sort Name ascending (A-Z)
Stars
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Unsloth Studio is a web UI for training and running open models like Qwen, DeepSeek, gpt-oss and Gemma locally.
The simplest, fastest repository for training/finetuning medium-sized GPTs.
The original local LLM interface. Text, vision, tool-calling, training, and more. 100% offline.
CowAgent是基于大模型的超级AI助理,能主动思考和任务规划、访问操作系统和外部资源、创造和执行Skills、拥有长期记忆并不断成长,比OpenClaw更轻量和便捷。同时支持微信、飞书、钉钉、企微、QQ、公众号、网页等接入,可选择OpenAI/Claude/Gemini/DeepSeek/ Qwen/GLM/Kimi/LinkAI,能处理文本、语音、图片和文件,可快速搭建个人AI助理和企…
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Making large AI models cheaper, faster and more accessible
Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthr…
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Qlib is an AI-oriented Quant investment platform that aims to use AI tech to empower Quant Research, from exploring ideas to implementing productions. Qlib supports diverse ML modeling paradigms, i…
Free ChatGPT&DeepSeek API Key,免费ChatGPT&DeepSeek API。免费接入DeepSeek API和GPT4 API,支持 gpt | deepseek | claude | gemini | grok 等排名靠前的常用大模型。
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
Community-contributed instructions, agents, skills, and configurations to help you make the most of GitHub Copilot.
SGLang is a high-performance serving framework for large language models and multimodal models.
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
Fast and memory-efficient exact attention
Universal LLM Deployment Engine with ML Compilation
gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI
Train transformer language models with reinforcement learning.
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
Machine Learning Engineering Open Book
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)