Starred repositories
Easy & Flexible Alerting With ElasticSearch
Diff Match Patch is a high-performance library in multiple languages that manipulates plain text.
Home of StarCoder: fine-tuning & inference!
TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecasting.
pkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation
Google AI 2018 BERT pytorch implementation
A lightweight LMM-based Document Parsing Model
[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters
An Open-source Framework for Data-centric, Self-evolving Autonomous Language Agents
A large-scale 7B pretraining language model developed by BaiChuan-Inc.
text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。
[COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wild
Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。
Instant Stack Overflow results whenever an exception is thrown
FlagAI (Fast LArge-scale General AI models) is a fast, easy-to-use and extensible toolkit for large-scale model.
A live stream development of RL tunning for LLM agents
MS-Agent: Lightweight Framework for Empowering Agents with Autonomous Exploration in Complex Task Scenarios
免费开源A股量化交易数据库; 专注A股,专注量化,向阳而生; 开放、纯净、持续、为Ai(爱)发电。为个人量化交易而生,保卫3000点,珍惜底部机会......【股票数据,股票行情数据,股票量化数据,股票交易数据,k线行情数据,股票概念数据,股票数据接口,行情数据接口,量化交易数据】【多数据源融合,动态设置代理,保障数据高可用性】
FlameScope is a visualization tool for exploring different time ranges as Flame Graphs.
Curated list of awesome Cursor Rules .mdc files
An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models
PAC/Dnsmasq/Wingy file Generator, working with gfwlist, support custom rules.
MongoDB data stream pipeline tools by YouGov (adopted from MongoDB)
AutoChain: Build lightweight, extensible, and testable LLM Agents
GPTeam: An open-source multi-agent simulation
A series of code large language models developed by PKU-KCL
Agent techniques to augment your LLM and push it beyong its limits
Data-Copilot: Bridging Billions of Data and Humans with Autonomous Workflow