Skip to content
View liujinf's full-sized avatar
  • chengdu

Block or report liujinf

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

27 stars written in Python
Clear filter

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 153,998 31,478 Updated Dec 18, 2025

RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs

Python 70,027 7,598 Updated Dec 18, 2025

为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…

Python 69,832 8,401 Updated Sep 20, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 64,150 7,778 Updated Dec 16, 2025

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

Python 60,813 7,504 Updated Oct 4, 2025

LlamaIndex is the leading framework for building LLM-powered agents over your data.

Python 45,873 6,644 Updated Dec 17, 2025

Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and…

Python 36,815 6,089 Updated Nov 10, 2025

Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.

Python 28,080 4,537 Updated Dec 18, 2025

Prefect is a workflow orchestration framework for building resilient data pipelines in Python.

Python 21,121 2,036 Updated Dec 18, 2025

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 20,297 2,132 Updated Dec 17, 2025

The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.

Python 20,279 4,969 Updated Dec 18, 2025

Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.

Python 18,608 2,446 Updated May 16, 2025

An orchestration platform for the development, production, and observation of data assets.

Python 14,615 1,910 Updated Dec 18, 2025

dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.

Python 11,997 2,212 Updated Dec 17, 2025

Always know what to expect from your data.

Python 11,005 1,653 Updated Dec 18, 2025

🧙 Build, run, and manage data pipelines for integrating and transforming data.

Python 8,590 894 Updated Dec 17, 2025

StackStorm (aka "IFTTT for Ops") is event-driven automation for auto-remediation, incident responses, troubleshooting, deployments, and more for DevOps and SREs. Includes rules engine, workflow, 16…

Python 6,390 779 Updated Dec 10, 2025

CKAN is an open-source DMS (data management system) for powering data hubs and data portals. CKAN makes it easy to publish, share and use data. It powers catalog.data.gov, open.canada.ca/data, data…

Python 4,909 2,069 Updated Dec 16, 2025

Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.

Python 4,711 972 Updated Dec 18, 2025

Compare tables within or across databases

Python 2,992 298 Updated May 17, 2024

A Unified Library for Parameter-Efficient and Modular Transfer Learning

Python 2,791 372 Updated Oct 12, 2025

Meltano: the declarative code-first data integration engine that powers your wildest data and ML-powered product ideas. Say goodbye to writing, maintaining, and scaling your own API integrations.

Python 2,291 191 Updated Dec 18, 2025

Open-source IoT Gateway - integrates devices connected to legacy and third-party systems with ThingsBoard IoT Platform using Modbus, CAN bus, BACnet, BLE, OPC-UA, MQTT, ODBC and REST protocols

Python 2,042 953 Updated Dec 10, 2025

MetricFlow allows you to define, build, and maintain metrics in code.

Python 1,415 136 Updated Dec 18, 2025

Data Pipeline Framework using the singer.io spec

Python 657 131 Updated Dec 15, 2025
Python 71 18 Updated May 19, 2023

Open-source metadata collector based on ODD Specification

Python 44 14 Updated Nov 6, 2023