-
Jane Street
- New York
- https://fanpu.io
- @FanPu_Zeng
Stars
- All languages
- Agda
- Assembly
- Astro
- C
- C#
- C++
- CSS
- Clojure
- CodeQL
- CoffeeScript
- Coq
- Cuda
- Cython
- Dart
- Dockerfile
- Elixir
- Emacs Lisp
- Erlang
- Gherkin
- Go
- Groovy
- HCL
- HTML
- Hack
- Haskell
- Java
- JavaScript
- Jsonnet
- Julia
- Jupyter Notebook
- Lean
- Lua
- MDX
- MLIR
- Makefile
- Max
- OCaml
- Objective-C
- Objective-C++
- PHP
- Pascal
- Perl
- PowerShell
- Python
- QML
- R
- Raku
- Reason
- Ren'Py
- Roff
- Ruby
- Rust
- SCSS
- SWIG
- Shell
- Standard ML
- Stylus
- Svelte
- Swift
- TeX
- TypeScript
- Vala
- Vim Script
- X10
- XSLT
- YARA
deepspeedai / Megatron-DeepSpeed
Forked from NVIDIA/Megatron-LMOngoing research training transformer language models at scale, including: BERT & GPT-2
The nnsight package enables interpreting and manipulating the internals of deep learned models.
Tooling for exact and MinHash deduplication of large-scale text datasets
[ICML'24] Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"
Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)
High-performance MinHash implementation in Rust with Python bindings for efficient similarity estimation and deduplication of large datasets
All-pair set similarity search on millions of sets in Python and on a laptop
Python extension for MurmurHash (MurmurHash3), a set of fast and robust hash functions.
MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW
A specification that python filesystems should adhere to.
Text utilities, including beam search decoding, tokenizing, and more, built for use in Flashlight.
The RedPajama-Data repository contains code for preparing large datasets for training large language models.
Ship correct and fast LLM kernels to PyTorch
MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.
Streaming WARC/ARC library for fast web archive IO
[ICLR 2025] Official PyTorch Implementation of Gated Delta Networks: Improving Mamba2 with Delta Rule
Github Pages template based upon HTML and Markdown for personal, portfolio-based websites.