GitHub - thraxil/hakmes: REST-based content-addressed storage for large files

Hakmes (it means "cleaver" in Dutch) is a REST-based content-addressed storage service intended for large files.

In a nutshell, you POST a file to it and it gives you back a hash (currently just a SHA1 of the contents of the file). Later, you can make a GET request with that hash as the key and it will return the contents of the file. It is optimized to work with large (multi-GB at least) files.

Hakmes functions as a front-end to a Cask cluster, which handles efficiently storing data replicated across multiple nodes. Hakmes takes the file that you upload, splits it into a number of chunks, and stores those chunks to the Cask cluster. When you retrieve the file, it pulls the chunks down and reassembles the file on the fly.

API

POST / -> upload file, returns json object with key, etc.
GET /file/<key>/ -> returns file, fetched by key.

A quick example with curl

$ echo "hello" > test.txt
$ curl -F file=@test.txt http://localhost:9300/
{"key":"sha1:f572d396fae9206628714fb2ce00f72e94f2258f",
 "extension":".txt","mimetype":"text/plain","size":6,
 "chunks":["sha1:f572d396fae9206628714fb2ce00f72e94f2258f"]}
$ curl -i http://localhost:9300/file/sha1:f572d396fae9206628714fb2ce00f72e94f2258f/
  HTTP/1.1 200 OK
  Content-Type: text/plain
  Etag: "sha1:f572d396fae9206628714fb2ce00f72e94f2258f"
  Date: Sun, 25 Jan 2015 13:43:10 GMT
  Transfer-Encoding: chunked
  
  hello

Config

Configuration is via environment variables. The following are used:

HAKMES_PORT

Port to listen on.

HAKMES_CASK_BASE

Base URL of a Cask node to use for the backend. Future versions might support comma separated lists of Cask nodes to allow Hakmes to round-robin or failover between them. In the meantime, for HA, it's recommended that you stick an HAProxy instance between Hakmes and Cask and point this setting there.

HAKMES_CHUNK_SIZE

Maximum size of chunks (in bytes). If a file is smaller than this, it will be uploaded to Cask as one piece. If it's larger, this is the size of chunks it's broken into. Cask works more smoothly with relatively small chunks, but the more chunks you have, the more overhead is involved. You will want to benchmark and try some different values out to figure out what works best for your Cask cluster and the typical file sizes that you are working with. A good starting point is probably in the 4-16MB range.

HAKMES_DB_PATH

Path to a boltdb file. If the file doesn't exist, Hakmes will create a new one when it starts.

HAKMES_SERIALIZE

If this is set to true, Hakmes will not start a server. Instead, it will iterate through the database and print out every entry as a line of JSON (JSONL format) to STDOUT. This is useful for backups or migrations.

HAKMES_INGEST

If this is set to true, Hakmes will not start a server. Instead, it will read lines of JSON (JSONL format) from STDIN and add those entries to the database if they are not already present. This is useful for restoring from a backup or migrating between database backends.

HAKMES_SSL_CERT, HAKMES_SSL_KEY

If you configure these, Hakmes will use SSL/TLS.

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
.github		.github
.envrc		.envrc
.gitignore		.gitignore
Makefile		Makefile
README.markdown		README.markdown
dev.sh		dev.sh
flake.lock		flake.lock
flake.nix		flake.nix
go.mod		go.mod
go.sum		go.sum
hakmes.go		hakmes.go
hakmes_test.go		hakmes_test.go
key.go		key.go
key_test.go		key_test.go
reader.go		reader.go
reader_test.go		reader_test.go
site.go		site.go
site_test.go		site_test.go
store.go		store.go
views.go		views.go
views_test.go		views_test.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

API

Config

HAKMES_PORT

HAKMES_CASK_BASE

HAKMES_CHUNK_SIZE

HAKMES_DB_PATH

HAKMES_SERIALIZE

HAKMES_INGEST

HAKMES_SSL_CERT, HAKMES_SSL_KEY

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

API

Config

HAKMES_PORT

HAKMES_CASK_BASE

HAKMES_CHUNK_SIZE

HAKMES_DB_PATH

HAKMES_SERIALIZE

HAKMES_INGEST

HAKMES_SSL_CERT, HAKMES_SSL_KEY

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages