-
Mattmann.AI
- La Canada Flintridge, CA
- http://mattmann.ai
- http://x.com/chrismattmann/
- http://instagram.com/chrismattmann/
Highlights
- Pro
Stars
Free and Open Source, Distributed, RESTful Search Engine
Suite of tools for deploying and training deep learning models using the JVM. Highlights include model import for keras, tensorflow, and onnx/pytorch, a modular and tiny c++ library for running mat…
CoreNLP: A Java suite of core NLP tools for tokenization, sentence segmentation, NER, parsing, coreference, sentiment analysis, etc.
Open source routing engine for OpenStreetMap. Use it as Java library or standalone web server.
A scalable, distributed Time Series Database.
A machine learning software for extracting information from scholarly documents
The Apache Tika toolkit detects and extracts metadata and text from over a thousand different file types (such as PPT, XLS, and PDF).
Apache Nutch is an extensible and scalable web crawler
Autopsy® is a digital forensics platform and graphical interface to The Sleuth Kit® and other digital forensics tools. It can be used by law enforcement, military, and corporate examiners to invest…
Anthelion is a plugin for Apache Nutch to crawl semantic annotations within HTML pages.
Deeplearning4j Examples (DL4J, DL4J Spark, DataVec)
Please visit https://github.com/h2oai/h2o-3 for latest H2O
DmitryKey / luke
Forked from sonarme/lukeThis is mavenised Luke: Lucene Toolbox Project
Elasticsearch File System Crawler (FS Crawler)
Java API for GeoIP2 webservice client and database reader
A programmable, embeddable web browser driver compatible with the Selenium WebDriver spec -- headless, WebKit-based, pure Java
Mapper Attachments Type plugin for Elasticsearch
Spark-Crawler: Apache Nutch-like crawler that runs on Apache Spark.
Neural Adaptive Machine Translation that adapts to context and learns from corrections.
Wicketstuff-core projects are bundled user contributions for use with Apache Wicket (https://wicket.apache.org/). They are released in step with Wicket releases to make them easy to use.
Source code for Big Data: Principles and best practices of scalable realtime data systems