Skip to content
View heluocs's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report heluocs

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

The LLM Evaluation Framework

Python 14,736 1,352 Updated Apr 9, 2026

Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.

Python 21,298 2,275 Updated Mar 11, 2025

Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.

Java 3,236 1,298 Updated Apr 13, 2026

RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs

Python 77,889 8,768 Updated Apr 13, 2026

Structured data extraction and instruction calling with ML, LLM and Vision LLM

Python 5,152 511 Updated Apr 12, 2026

PyGWalker: Turn your dataframe into an interactive UI for visual analysis

Python 15,721 863 Updated Apr 4, 2026

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

Python 8,484 831 Updated Apr 11, 2026

小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫

Python 47,768 10,242 Updated Apr 10, 2026

Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)

Java 12,712 3,557 Updated Apr 13, 2026

Upserts, Deletes And Incremental Processing on Big Data.

Java 6,141 2,475 Updated Apr 12, 2026

Apache Iceberg

Java 8,724 3,152 Updated Apr 12, 2026

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

Python 6,260 358 Updated Apr 13, 2026

Master programming by recreating your favorite technologies from scratch.

Markdown 489,676 46,229 Updated Feb 21, 2026

Large World Model -- Modeling Text and Video with Millions Context

Python 7,407 556 Updated Oct 19, 2024

Ant game engine

Lua 3,926 408 Updated Nov 17, 2025

Apache Flink

Java 25,941 13,915 Updated Apr 13, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 76,360 15,509 Updated Apr 13, 2026

Apache Spark - A unified analytics engine for large-scale data processing

Scala 43,112 29,159 Updated Apr 13, 2026

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

Python 13,353 2,277 Updated Apr 13, 2026

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Python 7,210 398 Updated Jul 11, 2024

JavaScript Style Guide

JavaScript 148,105 26,731 Updated Feb 24, 2026

Apache Parquet Format

Thrift 2,342 479 Updated Apr 4, 2026

Fast and simple stream processing of files in tar files, useful for deep learning, big data, and many other applications.

Go 135 15 Updated Dec 10, 2023

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 30,254 4,001 Updated Jul 17, 2024
Python 1,517 174 Updated Nov 9, 2023

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 20,925 2,246 Updated Apr 10, 2026

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 39,074 4,686 Updated Aug 19, 2024
Next