Skip to content
View marcromeyn's full-sized avatar

Block or report marcromeyn

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

727 stars written in Python
Clear filter

SQL databases in Python, designed for simplicity, compatibility, and robustness.

Python 17,081 783 Updated Nov 4, 2025

A reactive notebook for Python — run reproducible experiments, query with SQL, execute as a script, deploy as an app, and version with git. Stored as pure Python. All in a modern, AI-native editor.

Python 16,957 765 Updated Nov 7, 2025

A Powerful Spider(Web Crawler) System in Python.

Python 16,956 3,684 Updated Apr 30, 2024

Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.

Python 16,686 3,688 Updated Jun 2, 2023

A curated list of awesome commands, files, and workflows for Claude Code

Python 16,577 937 Updated Nov 7, 2025

Train transformer language models with reinforcement learning.

Python 16,214 2,279 Updated Nov 7, 2025

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 16,057 3,180 Updated Nov 7, 2025

A powerful coding agent toolkit providing semantic retrieval and editing capabilities (MCP server & other integrations)

Python 15,462 1,023 Updated Nov 6, 2025

PyGWalker: Turn your dataframe into an interactive UI for visual analysis

Python 15,376 844 Updated Nov 5, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 15,217 2,442 Updated Nov 7, 2025

🦉 Data Versioning and ML Experiments

Python 15,068 1,254 Updated Nov 4, 2025

FauxPilot - an open-source alternative to GitHub Copilot server

Python 14,760 634 Updated Apr 9, 2024

Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.

Python 14,625 2,257 Updated Nov 2, 2025

An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.

Python 14,290 1,836 Updated Jul 3, 2024

A formatter for Python files

Python 13,962 903 Updated Oct 17, 2025

1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

Python 13,238 1,754 Updated Nov 3, 2025

A hyperparameter optimization framework

Python 12,980 1,186 Updated Nov 5, 2025

Structured Outputs

Python 12,812 644 Updated Oct 27, 2025

Postgres CLI with autocompletion and syntax highlighting

Python 12,806 575 Updated Jul 31, 2025

the first library to let you embed a developer agent in your own app!

Python 12,175 1,098 Updated Apr 7, 2024

Python job scheduling for humans.

Python 12,174 982 Updated May 25, 2024

An open-source NLP research library, built on PyTorch.

Python 11,881 2,242 Updated Nov 22, 2022

Open-source infrastructure for Computer-Use Agents. Sandboxes, SDKs, and benchmarks to train and evaluate AI agents that can control full desktops (macOS, Linux, Windows).

Python 11,201 640 Updated Nov 4, 2025

Cleanlab's open-source library is the standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

Python 11,072 874 Updated Oct 17, 2025

Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable,…

Python 10,626 972 Updated Nov 6, 2025

This repository contains code examples for the Stanford's course: TensorFlow for Deep Learning Research.

Python 10,373 4,285 Updated Dec 22, 2020

A flexible tool for creating, organizing, and sharing visualizations of live, rich data. Supports Torch and Numpy.

Python 10,186 1,147 Updated May 30, 2024

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 10,070 733 Updated Oct 31, 2025

TensorFlow-based neural network library

Python 9,891 1,309 Updated Aug 4, 2025

The power of Claude Code / GeminiCLI / CodexCLI + [Gemini / OpenAI / OpenRouter / Azure / Grok / Ollama / Custom Model / All Of The Above] working as one.

Python 9,525 796 Updated Oct 22, 2025