Smol Snake

Side-loading Python deps into AWS lambdas.

AWS Lambda has limits on the size of a function deployment package, specifically, 50MB zipped and 250MB unzipped. This makes life difficult for Python projects with many or large dependencies, especially in the data engineering and ML space. This is because normally all dependencies are bundled and zipped into the deployment package. Common workarounds for this involve use of layers or container images. Both increase deployment complexity considerably.

Smol Snake takes a different approach. It works by pre-installing the dependencies into an EFS file system and then mounting that file system into Lambdas.

The implementation here works roughtly like this:

smolsnake lock --function-source-path=<path> generates a formal dependency lock file (smolsnake uses poetry and its excellent dependency solver under the hood).
The lock file is transmitted to a server that has write access to an EFS file system. The server runs smolsnake install --lockfile=<lockfile> to install all requested Python packages individually into the EFS mount using the following directory structure:
```
/efs/
  <python-version-1>/
    <dep-1>/
       <ver-1.0>/
          lib/<module>.py
       <ver-2.0>/
          lib/<module>.py
  <python-version-2>/
    <dep-3>/
      ...

  ...
```
Communication with the dependency cache server happens via SQS.
The lambda function source is amended to inject paths to required packages on the EFS mount with smolsnake injectsyspath --lockfile=<lockfile> into sys.path (runtime version of PYTHONPATH).

How to run this

Install the prerequisites: terraform, awscli, python3, jq.
Install smolsnake into a virtual environment with your favorite Python package installation method.
Get AWS access.

You'll need access to an AWS account with sufficient privileges to create SQS queues, IAM roles and policies, lambda functions and EC2 instances (admin access on a dedicated account is recommended). smolsnake requires no explicit credential configuration and expects default credentials to be available (either in the environment or in ~/.aws/credentials).

Create the EFS cache server:

terraform -chdir=terraform/depcache init
terraform -chdir=terraform/depcache apply

Create and run demo Lambda function:

terraform -chdir=tests/func1 init
terraform -chdir=tests/func1 apply

Modify func1/src or make a copy to experiment with your dependency-heavy Python function.

Development

If you need to debug the depcache server, enable public SSH by placing your public SSH key in terraform/depcache/ssh_authorized_keys and rerunning terraform apply:

terraform -chdir=terraform/depcache apply

You can then SSH to the depcache server (the IP address is in the Terraform outputs):

ssh -l ec2-user <decache-server-ip>

License

Distributed under the Zero Clause BSD (0BSD) license.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
src/smolsnake		src/smolsnake
terraform		terraform
tests/func1		tests/func1
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Smol Snake

How to run this

Development

License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Smol Snake

How to run this

Development

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages