Highlights
- Pro
Starred repositories
🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation
SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]
Janus-Series: Unified Multimodal Understanding and Generation Models
An orchestration platform for the development, production, and observation of data assets.
Gel supercharges Postgres with a modern data model, graph queries, Auth & AI solutions, and much more.
GenAI Agent Framework, the Pydantic way
Simple, unified interface to multiple Generative AI providers
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
Pocket Flow: Codebase to Tutorial
FreeAskInternet is a completely free, PRIVATE and LOCALLY running search aggregator & answer generate using MULTI LLMs, without GPU needed. The user can ask a question and the system will make a mu…
🧙 Build, run, and manage data pipelines for integrating and transforming data.
Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Wo…
This is my personal template collection. Here you'll find templates, and configurations for various tools, and technologies.
AWS MCP Servers — helping you get the most out of AWS, wherever you use MCP.
Magic to turn Cursor/Windsurf as 90% of Devin
Klavis AI (YC X25): MCP integration platforms that let AI agents use tools reliably at any scale
This repository is for active development of the Azure SDK for Python. For consumers of the SDK we recommend visiting our public developer docs at https://learn.microsoft.com/python/azure/ or our v…
LLM training code for Databricks foundation models
pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (…
Utils for streaming large files (S3, HDFS, gzip, bz2...)
Open, Multi-modal Catalog for Data & AI
A collection of production-ready Generative AI Agent templates built for Google Cloud. It accelerates development by providing a holistic, production-ready solution, addressing common challenges (D…
Google Gen AI Python SDK provides an interface for developers to integrate Google's generative models into their Python applications.
Building a Secure and Interoperable Future for AI-Driven Payments.
Data Engineering Practice Problems
A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewrites.