Skip to content
View heluocs's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report heluocs

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

The LLM Evaluation Framework

Python 14,321 1,308 Updated Mar 27, 2026

Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.

Python 21,250 2,265 Updated Mar 11, 2025

Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.

Java 3,229 1,292 Updated Mar 29, 2026

RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs

Python 76,507 8,570 Updated Mar 29, 2026

Structured data extraction and instruction calling with ML, LLM and Vision LLM

Python 5,143 510 Updated Mar 29, 2026

PyGWalker: Turn your dataframe into an interactive UI for visual analysis

Python 15,702 858 Updated Mar 2, 2026

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

Python 8,492 831 Updated Mar 23, 2026

小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫

Python 46,855 10,076 Updated Mar 24, 2026

Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)

Java 12,668 3,548 Updated Mar 29, 2026

Upserts, Deletes And Incremental Processing on Big Data.

Java 6,125 2,470 Updated Mar 29, 2026

Apache Iceberg

Java 8,674 3,107 Updated Mar 29, 2026

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

Python 6,156 346 Updated Mar 27, 2026

Master programming by recreating your favorite technologies from scratch.

Markdown 484,275 45,565 Updated Feb 21, 2026

Large World Model -- Modeling Text and Video with Millions Context

Python 7,404 558 Updated Oct 19, 2024

Ant game engine

Lua 3,927 408 Updated Nov 17, 2025

Apache Flink

Java 25,906 13,905 Updated Mar 29, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 74,623 14,918 Updated Mar 29, 2026

Apache Spark - A unified analytics engine for large-scale data processing

Scala 43,057 29,141 Updated Mar 29, 2026

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

Python 13,213 2,223 Updated Mar 29, 2026

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Python 7,207 396 Updated Jul 11, 2024

JavaScript Style Guide

JavaScript 148,122 26,748 Updated Feb 24, 2026

Apache Parquet Format

Thrift 2,317 475 Updated Mar 22, 2026

Fast and simple stream processing of files in tar files, useful for deep learning, big data, and many other applications.

Go 135 15 Updated Dec 10, 2023

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 30,262 4,003 Updated Jul 17, 2024
Python 1,521 174 Updated Nov 9, 2023

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 20,856 2,231 Updated Mar 27, 2026

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 39,062 4,686 Updated Aug 19, 2024
Next