Highlights
- Pro
Lists (6)
Sort Name ascending (A-Z)
Stars
Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
💫 Industrial-strength Natural Language Processing (NLP) in Python
AIHawk aims to easy job hunt process by automating the job application process. Utilizing artificial intelligence, it enables users to apply for multiple jobs in a tailored way.
Official inference framework for 1-bit LLMs
get things from one computer to another, safely
OCR, layout analysis, reading order, table recognition in 90+ languages
A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.
An orchestration platform for the development, production, and observation of data assets.
Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable,…
Automated Machine Learning with scikit-learn
Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!
A Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning
This repo powers my blog experiment where ChatGPT manages a real-money micro-cap stock portfolio.
Robyn is a Super Fast Async Python Web Framework with a Rust runtime.
Tools for merging pretrained large language models.
A next-generation curated knowledge sharing platform for data scientists and other technical professions.
A library of extension and helper modules for Python's data analysis and machine learning libraries.
Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML
A lightweight data processing framework built on DuckDB and 3FS.
A collection of important graph embedding, classification and representation learning papers with implementations.
H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://docs.h2o.ai/h2o-llmstudio/
[CVPR 2024] Official RT-DETR (RTDETR paddle pytorch), Real-Time DEtection TRansformer, DETRs Beat YOLOs on Real-time Object Detection. 🔥 🔥 🔥
Visual analysis and diagnostic tools to facilitate machine learning model selection.