Starred repositories
An LLM agent that conducts deep research (local and web) on any given topic and generates a long report with citations.
🤗 smolagents: a barebones library for agents that think in code.
[EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
☁️ Build multimodal AI applications with cloud-native stack
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
Build resilient language agents as graphs.
Intelligent automation and multi-agent orchestration for Claude Code
Download market data from Yahoo! Finance's API
DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.
A configuration framework that enhances Claude Code with specialized commands, cognitive personas, and development methodologies.
SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]
A lightweight, powerful framework for multi-agent workflows
A curated list of awesome commands, files, and workflows for Claude Code
pix2tex: Using a ViT to convert images of equations into LaTeX code.
verl: Volcano Engine Reinforcement Learning for LLMs
Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音,一键全自动视频搬运AI字幕组
🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org
An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.
RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…
Pyodide is a Python distribution for the browser and Node.js based on WebAssembly
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
Replace 'hub' with 'ingest' in any GitHub URL to get a prompt-friendly extract of a codebase
Open Source DeepWiki: AI-Powered Wiki Generator for GitHub/Gitlab/Bitbucket Repositories. Join the discord: https://discord.gg/gMwThUMeme
Statsmodels: statistical modeling and econometrics in Python
A Library for Advanced Deep Time Series Models for General Time Series Analysis.
Fast and Accurate ML in 3 Lines of Code