-
-
-
db-deploy Public
Scripts and stuff to make Databricks deployments easier for MMC customers
2 UpdatedSep 30, 2024 -
-
db-repo-path Public
Demonstration of approach to access Python modules on workers in Git repo
-
-
pegasus Public
Forked from InsightDataScience/pegasusVM based deployment for prototyping Big Data tools on Amazon Web Services
Shell UpdatedMay 25, 2016 -
-
elasticsearch-dump Public
Forked from elasticsearch-dump/elasticsearch-dumpImport and export tools for elasticsearch
JavaScript Apache License 2.0 UpdatedJun 18, 2014 -
opennlp Public
Forked from apache/opennlpMirror of Apache OpenNLP (Incubating)
Java Apache License 2.0 UpdatedFeb 23, 2014 -
-
tnh Public
(T)he (N)ew (H)otness. Improved full-txt search of archival web data.
-
jbs Public
Builds Lucene/Solr indexes out of NutchWAX segments and revisit records via Hadoop.
-
-
-
ia-hadoop-tools Public
Clone of iof ia-hadoop-tools repo, but just zipnum branch with new features for zipnum and cluster merging.
-