Skip to content
@Burla-Cloud

Burla

The simplest way to scale Python.



Scale Python to 1,000 VMs in your cloud in 1 second.

Burla is a open-source compute platform for scaling Python applications.
Run AI-inference, vector embeddings, ML-pipelines and more on thousands of VM's at once.

Burla only has one function:

from burla import remote_parallel_map

my_inputs = list(range(1000))

def my_function(x):
    print(f"[#{x}] running on separate computer")

remote_parallel_map(my_function, my_inputs)

This example runs my_function on 1,000 VMs in less than one second:

Burla terminal demo showing remote_parallel_map running on 1,000 computers


Scalable & efficient pipelines are not straightforward.

Slow deployments, VM reboots, or container rebuilds mean waiting 5-10 minutes with every change.
Errors are vague, and configs are full of secret tradeoffs. 90% resource utilization is a pipe dream.

Cryptic errors from Airflow, Ray, Dask, and AWS Batch: Broken DAG, OutOfMemoryError, KilledWorker, and INSUFFICIENT_CAPACITY


Burla can scale any workload with a single function.

Easily fan Python in/out across thousands of machines with varying sizes, types, and environments.
Quickly develop pipelines that handle 100+ TB datasets, using simple code anyone can understand.

This code:

remote_parallel_map(process, [...], image="rocker/geospatial:latest")
remote_parallel_map(aggregate, [...], func_cpu=64)
remote_parallel_map(predict, [...], func_gpu="A100")

Creates a pipeline like:

Burla data pipeline: cloud storage to CPUs, into a 64-CPU aggregation step, out to GPUs, and back to cloud storage


With Burla, the same jobs use 50% less compute.

Compared to software like Ray, Dask, or AWS Batch workloads running on Burla require less compute and automatically stay at 90%+ CPU/RAM utilization without taking any longer to finish the job.

This is achieved with adaptive concurrency and horizontal autoscaling. Burla quickly reacts to changes in task resource utilization, and rearranges work during runtime to fill excess capacity.

CPU utilization comparison: other orchestration tools fluctuate between idle and busy, while the same workload on Burla stays near full utilization

This system frequently more than doubles compute efficiency, and eliminates out of memory errors.
Read our blog to learn how it works.


How it works:

Running code in the cloud shouldn't feel any different from running code locally.

return_values = remote_parallel_map(my_function, my_inputs)

When a Python function is run using remote_parallel_map, it runs in the cloud but:

  • Anything it prints appears locally (and inside the dashboard).
  • Any exceptions are thrown locally.
  • Any packages or local modules are (very quickly) cloned on all remote machines.
  • Code starts running in under one second! Even with millions of inputs, or thousands of machines.

Burla automatically manages it's own pool of VMs underneath to maximize speed and efficiency.
You can manually add & remove machines from the pool, or let the platform react live to requests.


Everything you need to manage Python at scale.

Monitor your analysis, pipeline, or background job from your phone.
Burla has all the features you need to closely manage logs, output files, and available compute.

Burla dashboard showing live logs, output files, and cluster status


Examples:

Query 2.4TB of Parquet in 76s
Query 2.4TB of Parquet in 76s
Rank 1.7M Airbnbs
Rank 1.7M Airbnbs
Distill 572M Amazon reviews
Distill 572M Amazon reviews
Cluster 2.7M arXiv abstracts
Cluster 2.7M arXiv abstracts
Genomic alignment pipeline
Genomic alignment pipeline
Map 9.5M geotagged photos
Map 9.5M geotagged photos

Browse all →


Want to learn more? Schedule a call we're happy to answer any questions.

Popular repositories Loading

  1. burla burla Public

    The simplest way to scale Python.

    Python 254 11

  2. examples examples Public

    Burla examples, demos, and use cases

    HTML 5 1

  3. airbnb-burla airbnb-burla Public

    Every public Airbnb in Inside Airbnb's open dump (119 cities, 4 quarterly snapshots, 1.7M photos, 50.7M reviews). CLIP + Claude Haiku Vision + bootstrap CIs, all parallelized on Burla.

    Python 4

  4. burla-agent-starter-kit burla-agent-starter-kit Public

    Minimum-intervention onboarding + parallel job runner for Burla. One command, one password, fully automated.

    Python 3

  5. dbt-burla dbt-burla Public

    A dbt adapter that runs Python models on a Burla cluster (instead of Spark/Snowpark/Dataproc).

    Astro 2

  6. .github .github Public

Repositories

Showing 10 of 11 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…