-
Mattmann.AI
- La Canada Flintridge, CA
- http://mattmann.ai
- http://x.com/chrismattmann/
- http://instagram.com/chrismattmann/
Highlights
- Pro
Stars
Docker container to provide Apache Tika RESTful API
bash-httpd is a web server written in bash, the GNU bourne shell replacement.
Nutch with Cassandra and Elasticsearch on Docker
A dataset downloaded from the deep and scientific web across three major Polar data centers for use in research.
easy Mitie-nlp setup to use with MITIE NER enabled in TIKA
Some useful scripts I have written over time to help me with my day-to-day work
Collection of code and scripts to run Apache cTAKES against clinical text
Example of using Nutch to authenticate and crawl mrs.org
An Apache OODT, Apache Tika, and Apache Solr based system to automatically take large TSV file datasets, and to translate them from one language to another. Built and inspired by the DARPA XDATA Em…
An OpenShift Cartridge containing the Apache Tika JAXRS Server
package for generating analytic dashboards using open source tools
Bash script that creates a URL seed list with URLs included in a generic file
Bash script that performs file format identification on all files in a directory tree using Apache Tika
Information Retrieval for Planetary Science using DeepDive
Inspired from the instructions found at https://cwiki.apache.org/confluence/display/TIKA/GeoTopicParser.
A repository for Nutch crawl evaluation