Skip to content
View akamnev's full-sized avatar

Block or report akamnev

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 156,148 31,958 Updated Feb 4, 2026

Tesseract Open Source OCR Engine (main repository)

C++ 72,240 10,482 Updated Jan 8, 2026

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

Python 70,163 9,784 Updated Feb 4, 2026

Ultralytics YOLO 🚀

Python 52,915 10,131 Updated Feb 4, 2026

A library for efficient similarity search and clustering of dense vectors.

C++ 38,988 4,216 Updated Feb 4, 2026

Google Research

Jupyter Notebook 37,215 8,320 Updated Feb 4, 2026

💫 Industrial-strength Natural Language Processing (NLP) in Python

Python 33,161 4,636 Updated Nov 27, 2025

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

Python 28,901 3,534 Updated Dec 5, 2025

Library for fast text representation and classification.

HTML 26,481 4,820 Updated Mar 22, 2024

Label Studio is a multi-type data labeling and annotation tool with standardized output format

TypeScript 26,356 3,367 Updated Feb 4, 2026

Code for the paper "Language Models are Unsupervised Multitask Learners"

Python 24,592 5,861 Updated Aug 14, 2024

Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.

Python 22,977 3,621 Updated Jul 28, 2024

YOLOv4 / Scaled-YOLOv4 / YOLO - Neural Networks for Object Detection (Windows and Linux version of Darknet )

C 22,198 7,940 Updated Dec 15, 2025

🤗 The largest hub of ready-to-use datasets for AI models with fast, easy-to-use and efficient data manipulation tools

Python 21,171 3,091 Updated Feb 4, 2026

💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants

Python 21,017 4,915 Updated Jan 29, 2026

A fully-modern text-based browser, rendering to TTY and browsers

JavaScript 18,450 458 Updated Jul 11, 2025

Development repository for the Triton language and compiler

MLIR 18,348 2,550 Updated Feb 4, 2026

State-of-the-Art Text Embeddings

Python 18,206 2,754 Updated Feb 4, 2026

Datasets, Transforms and Models specific to Computer Vision

Python 17,493 7,205 Updated Feb 4, 2026

Mamba SSM architecture

Python 17,140 1,580 Updated Jan 12, 2026

Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.

Python 16,974 3,718 Updated Jun 2, 2023

StableLM: Stability AI Language Models

Jupyter Notebook 15,767 1,023 Updated Apr 8, 2024

Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.

Python 15,246 3,556 Updated Feb 4, 2026

newspaper3k is a news, full-text, and article metadata extraction in Python 3. Advanced docs:

HTML 14,966 2,133 Updated Dec 6, 2025

This repository contains implementations and illustrative code to accompany DeepMind publications

Jupyter Notebook 14,669 2,839 Updated Jan 23, 2026

Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.

Python 14,664 2,256 Updated Dec 1, 2025

A very simple framework for state-of-the-art Natural Language Processing (NLP)

Python 14,352 2,127 Updated Oct 27, 2025

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…

Python 14,344 987 Updated Feb 1, 2026

OpenProject is the leading open source project management software.

Ruby 14,321 3,077 Updated Feb 4, 2026

Open source code for AlphaFold 2.

Python 14,243 2,550 Updated Jan 15, 2026
Next