Stars
PyTorch building blocks for the OLMo ecosystem
Toolkit for linearizing PDFs for LLM datasets/training
Modeling, training, eval, and inference code for OLMo
Data and tools for generating and inspecting OLMo pre-training data.
A high-throughput and memory-efficient inference and serving engine for LLMs
MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.
An exploration of a few AI authors and the data Semantic Scholar has about their citations.
Package githubv4 is a client library for accessing GitHub GraphQL API v4 (https://docs.github.com/en/graphql).
Data pipelines for cloud config and security data. Build cloud asset inventory, CSPM, FinOps, and vulnerability management solutions. Extract from AWS, Azure, GCP, and 70+ cloud and SaaS sources.
A use-package inspired plugin manager for Neovim. Uses native packages, supports Luarocks dependencies, written in Lua, allows for expressive config
An open-source NLP research library, built on PyTorch.
Backup and migrate Kubernetes applications and their persistent volumes
Easy web analytics. No tracking of personal data.
A terminal-based presentation tool with colors and effects.
Tsunami is a general purpose network security scanner with an extensible plugin system for detecting high severity vulnerabilities with high confidence.
The library for web and native user interfaces.
A simple SSL/TLS proxy with mutual authentication for securing non-TLS services.
A tool for visualizing trees, tailored specifically to the analysis of parse trees.
Protocol Buffers for JavaScript & TypeScript.
Sprest is a collection of libaries to make building REST services simpler using Spray.
📄 Documented Style Sheets Parser