Lists (18)
Sort Name ascending (A-Z)
Stars
- All languages
- Ada
- Adblock Filter List
- Agda
- AngelScript
- Antlers
- AppleScript
- Assembly
- Astro
- AutoHotkey
- AutoIt
- Awk
- Batchfile
- Bikeshed
- Blade
- C
- C#
- C++
- C3
- CMake
- CSS
- Clojure
- CoffeeScript
- Common Lisp
- Coq
- Crystal
- Cuda
- D
- D2
- Dart
- Dockerfile
- Elixir
- Elm
- Emacs Lisp
- F#
- Factor
- Fennel
- Forth
- Futhark
- GDScript
- GLSL
- GSC
- Game Maker Language
- Gleam
- Go
- HCL
- HLSL
- HTML
- Haml
- Handlebars
- Haskell
- Haxe
- Isabelle
- Jai
- Janet
- Java
- JavaScript
- Jinja
- Jsonnet
- Julia
- Jupyter Notebook
- Just
- KakouneScript
- Koka
- Kotlin
- LLVM
- Lean
- Less
- Logos
- Lua
- Luau
- MDX
- Makefile
- Markdown
- Mathematica
- Mercury
- Meson
- Metal
- Mustache
- NASL
- NSIS
- Nim
- Nix
- Nunjucks
- Nushell
- OCaml
- Objective-C
- Objective-C++
- Odin
- PHP
- PLpgSQL
- POV-Ray SDL
- Pascal
- Perl
- Pony
- PostScript
- PowerShell
- Processing
- Prolog
- PureScript
- Python
- QML
- Racket
- ReScript
- RenderScript
- Rocq Prover
- Roff
- Ruby
- Rust
- SCSS
- SMT
- SVG
- Sage
- SaltStack
- Sass
- Scala
- Scheme
- ShaderLab
- Shell
- Spline Font Database
- Starlark
- Svelte
- Swift
- SystemVerilog
- TLA
- TSQL
- TeX
- Text
- TypeScript
- Typst
- V
- VBScript
- Vala
- Verilog
- Vim Script
- Visual Basic .NET
- Vue
- WGSL
- WebAssembly
- Wolfram Language
- XSLT
- YAML
- YARA
- Yacc
- Zig
- jq
- sed
6
results
for source starred repositories
written in Cuda
Clear filter
Instant neural graphics primitives: lightning fast NeRF and more
A massively parallel, optimal functional runtime in Rust
Flash Attention in ~100 lines of CUDA (forward pass only)
State of the art sorting and segmented sorting, including OneSweep. Implemented in CUDA, D3D12, and Unity style compute shaders. Theoretically portable to all wave/warp/subgroup sizes.