- San Francisco, California
- http://www.smerity.com
Stars
- All languages
- ActionScript
- Assembly
- Batchfile
- C
- C#
- C++
- CSS
- Clojure
- CoffeeScript
- Common Lisp
- Cuda
- Cython
- Dockerfile
- Elixir
- Erlang
- Forth
- G-code
- GDScript
- GLSL
- Go
- Groovy
- HTML
- Haskell
- Haxe
- Hy
- Java
- JavaScript
- Jinja
- Julia
- Jupyter Notebook
- LLVM
- Less
- Lua
- MDX
- MLIR
- Makefile
- Markdown
- Mathematica
- Mustache
- NASL
- Nim
- Objective-C
- OpenEdge ABL
- PHP
- PLpgSQL
- Perl
- Python
- R
- Rich Text Format
- Ruby
- Rust
- SCSS
- Scala
- Shell
- Svelte
- Swift
- TeX
- TypeScript
- Vim Script
- Vue
- WebAssembly
- Wren
- Zig
OpenRefine is a free, open source power tool for working with messy data and improving it
1️⃣🐝🏎️ The One Billion Row Challenge -- A fun exploration of how quickly 1B rows from a text file can be aggregated with Java
Enso Analytics is a self-service data prep and analysis platform designed for data teams.
Apache Nutch is an extensible and scalable web crawler
Clear implementation of arithmetic coding for educational purposes in Java, Python, C++.
Huge Collections for Java using efficient off heap storage
Java implementation of a probabilistic set data structure
A virtual pet that helps you raise your TDD practice.
An AWS SDK-backed FileSystem driver for Hadoop
Hadoop jobs for WikiReverse project. Parses Common Crawl data for links to Wikipedia articles.
Stipple Image and TSP-Art Generator for Processing
Aloisius / nutch
Forked from apache/nutchCommonCrawl Test version of Nutch