Skip to content
View getcake's full-sized avatar
  • BOS
  • 13:07 (UTC -04:00)

Highlights

  • Pro

Organizations

@flask-extensions

Block or report getcake

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
7 stars written in Java
Clear filter

Apache Lucene open-source search software

Java 3,402 1,336 Updated Apr 17, 2026

Apache Nutch is an extensible and scalable web crawler

Java 3,150 1,264 Updated Apr 16, 2026

A search interface and wayback machine for the UKWA Solr based warc-indexer framework.

Java 137 28 Updated Apr 14, 2026

Please note that the warc-indexer tool & code is now supported by NetArchiveSuite. The 'warc-indexer' directory and code that exists in this repo is now only for reference. For support and issues o…

Java 132 26 Updated Nov 21, 2025

Merged search-arctika and search-achon into a multi-module project

Java 14 2 Updated May 20, 2022
Java 5 1 Updated Mar 1, 2024

Java library to extract large scale data from a Solr server with index build by the Warc-indexer.

Java 2 Updated Jun 12, 2025