Stars
- All languages
- C
- C++
- CMake
- CSS
- CoffeeScript
- Common Workflow Language
- Cuda
- Cython
- Dockerfile
- Fortran
- Go
- HTML
- Haskell
- Java
- JavaScript
- Julia
- Jupyter Notebook
- Lua
- MATLAB
- Nunjucks
- OCaml
- Objective-C
- OpenEdge ABL
- PHP
- Perl
- Protocol Buffer
- PureBasic
- Python
- Ruby
- Rust
- Scala
- Shell
- Swift
- TeX
- TypeScript
- Vim Script
- XSLT
A collection of skills for AI financial analysis.
Train transformer language models with reinforcement learning.
deepspeedai / Megatron-DeepSpeed
Forked from NVIDIA/Megatron-LMOngoing research training transformer language models at scale, including: BERT & GPT-2
Why Low-Precision Transformer Training Fails: An Analysis on Flash Attention
gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI
Machine Learning Engineering Open Book
Official PyTorch implementation for "Large Language Diffusion Models"
AlgoTune is a NeurIPS 2025 benchmark made up of 154 math, physics, and computer science problems. The goal is write code that solves each problem, and is faster than existing implementations.
Darwin Gödel Machine: Open-Ended Evolution of Self-Improving Agents
[NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards
A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.
[ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding
Haxophone, an electronic musical instrument that resembles a saxophone
Training and serving large-scale neural networks with auto parallelization.
Shortcuts for Siri using ChatGPT API, supports continuous conversations, configure the API key & save chat records. 由 ChatGPT API 模型驱动的智能 Siri,支持连续对话,配置API key,配置系统prompt,保存聊天记录。
Unsupervised text tokenizer for Neural Network-based text generation.
The official code of WWW2021 paper: Extract the Knowledge of Graph Neural Networks and Go Beyond it: An Effective Knowledge Distillation Framework
C++ `std::unique_ptr` that represents each object as an NFT on the Ethereum blockchain
Breeze is/was a numerical processing library for Scala.
IEEE TNNLS 2021, transformer, multi-graph transformer, graph, graph classification, sketch recognition, sketch classification, free-hand sketch, official code of the paper "Multi-Graph Transformer …
✨ Hacker News, but refined — Interface tweaks and features to make the HN experience better