GitHub - bruin-data/ingestr: ingestr is a CLI tool to copy data between any databases with a single command seamlessly.

Copy data from any source to any destination without any code

ingestr is a command-line app that allows you to ingest data from any source into any destination using simple command-line flags, no code necessary.

✨ copy data from your database into any destination
➕ incremental loading: append, merge or delete+insert
🐍 single-command installation

ingestr takes away the complexity of managing any backend or writing any code for ingesting data, simply run the command and watch the data land on its destination.

Installation

You can install ingestr using the install script:

curl -LsSf https://getbruin.com/install/ingestr | sh

Alternatively, you can install it with pip:

pip install ingestr

The pip package can also be used from Python. Install the SDK extra for Python data ingestion:

pip install 'ingestr[sdk]'

Python rows, generators, and DataFrames are sent to the bundled ingestr binary as Arrow IPC streams by default:

import ingestr

ingestr.ingest(
    [{"id": 1, "name": "Ada"}, {"id": 2, "name": "Grace"}],
    dest_uri="duckdb:///tmp/warehouse.duckdb",
    dest_table="main.people",
)

DataFrames and yielded data use the same Arrow stream transport:

ingestr.ingest(df, dest_uri="duckdb:///tmp/warehouse.duckdb", dest_table="main.events")

def events():
    yield [{"id": 1, "event": "signup"}]
    yield [{"id": 2, "event": "purchase"}]

ingestr.ingest(events, dest_uri="postgresql://...", dest_table="public.events")

For push-style code, omit the data argument and use ingest as a context manager. The context value accepts the same shapes as ingestr.ingest(data, ...):

with ingestr.ingest(dest_uri="postgresql://...", dest_table="public.events") as ingest:
    for response in client.list_events():
        ingest(response["items"])

For very large already-materialized data, use the existing mmap Arrow IPC file transport:

ingestr.ingest(df, dest_uri="duckdb:///tmp/warehouse.duckdb", dest_table="main.events", transport="mmap")

For full CLI pass-through, use ingestr.run(["ingest", "--source-uri", "...", "--dest-uri", "...", "--source-table", "..."]), or ingestr.run_cli(...) for keyword arguments that map to CLI flags.

Quickstart

ingestr ingest \
    --source-uri 'postgresql://admin:admin@localhost:8837/web?sslmode=disable' \
    --source-table 'public.some_data' \
    --dest-uri 'bigquery://<your-project-name>?credentials_path=/path/to/service/account.json' \
    --dest-table 'ingestr.some_data'

That's it.

This command:

gets the table public.some_data from the Postgres instance.
uploads this data to your BigQuery warehouse under the schema ingestr and table some_data.

Documentation

You can see the full documentation here.

Community

Join our Slack community here.

Contributing

Pull requests are welcome. However, please open an issue first to discuss what you would like to change. We maybe able to offer you help and feedback regarding any changes you would like to make.

Note

After cloning ingestr make sure to run make setup to install githooks.

Supported sources & destinations

	Source	Destination
Databases
AWS Athena	✅	✅
AWS Redshift	✅	✅
Cassandra	✅	✅
ClickHouse	✅	✅
Couchbase	✅	-
CrateDB	✅	✅
Databricks	✅	✅
DuckDB	✅	✅
DynamoDB	✅	✅
Elasticsearch	✅	✅
Google BigQuery	✅	✅
GCP Spanner	✅	-
IBM Db2	✅	-
InfluxDB	✅	-
Kafka	✅	-
Local CSV file	✅	✅
MaxCompute	✅	✅
Microsoft Fabric	✅	✅
Microsoft OneLake	-	✅
Microsoft SQL Server	✅	✅
MongoDB	✅	✅
MotherDuck	✅	✅
MySQL	✅	✅
Oracle	✅	-
Postgres	✅	✅
RabbitMQ	✅	-
SAP Hana	✅	-
Snowflake	✅	✅
Socrata	✅	-
SQLite	✅	✅
Synapse	-	✅
Trino	✅	✅
Platforms
Adjust	✅	-
Airtable	✅	-
Allium	✅	-
Amazon Kinesis	✅	-
Anthropic	✅	-
AppsFlyer	✅	-
Apple Ads	✅	-
Apple App Store	✅	-
Applovin	✅	-
Applovin Max	✅	-
Asana	✅	-
Attio	✅	-
Azure Data Lake Storage Gen2	✅	✅
Bruin	✅	-
Chess.com	✅	-
ClickUp	✅	-
Cursor	✅	-
Docebo	✅	-
Dune	✅	-
Facebook Ads	✅	-
Fireflies	✅	-
Fluxx	✅	-
Frankfurter	✅	-
Freshdesk	✅	-
FundraiseUp	✅	-
G2	✅	-
GitHub	✅	-
Google Ads	✅	-
Google Analytics	✅	-
Google Cloud Storage (GCS)	✅	✅
Google Sheets	✅	-
Gorgias	✅	-
Granola	✅	-
Hostaway	✅	-
HubSpot	✅	-
Indeed	✅	-
Intercom	✅	-
Internet Society Pulse	✅	-
Jira	✅	-
JobTread	✅	-
Klaviyo	✅	-
Linear	✅	-
LinkedIn Ads	✅	-
Mailchimp	✅	-
Mixpanel	✅	-
Monday	✅	-
Notion	✅	-
Paddle	✅	-
Personio	✅	-
PhantomBuster	✅	-
Pinterest	✅	-
Pipedrive	✅	-
Plus Vibe AI	✅	-
PostHog	✅	-
Primer	✅	-
QuickBooks	✅	-
Reddit Ads	✅	-
RevenueCat	✅	-
S3	✅	✅
Salesforce	✅	-
SFTP	✅	-
Shopify	✅	-
Slack	✅	-
Smartsheet	✅	-
Snapchat Ads	✅	-
Solidgate	✅	-
Stripe	✅	-
SurveyMonkey	✅	-
TikTok Ads	✅	-
Trustpilot	✅	-
Wise	✅	-
Zendesk	✅	-
Zoom	✅	-

Feel free to create an issue if you'd like to see support for another source or destination.

License

ingestr is source-available under the Functional Source License 1.1, with Apache 2.0 as the future license. You can use ingestr freely for internal production use, development, testing, education, research, and professional services. You cannot use ingestr to offer a competing commercial ingestion, ELT, connector, or managed data pipeline product/service.

Each version becomes Apache 2.0 two years after release.

Name		Name	Last commit message	Last commit date
Latest commit History 2,537 Commits
.githooks		.githooks
.github/workflows		.github/workflows
benchmarks		benchmarks
cmd		cmd
docs		docs
hack		hack
ingestr		ingestr
internal		internal
pkg		pkg
resources		resources
styles		styles
testdata		testdata
tests		tests
.dockerignore		.dockerignore
.gitignore		.gitignore
.gitleaksignore		.gitleaksignore
.golangci.yml		.golangci.yml
.goreleaser.yaml		.goreleaser.yaml
.npmrc		.npmrc
.python-version		.python-version
.vale.ini		.vale.ini
AGENTS.md		AGENTS.md
CLAUDE.md		CLAUDE.md
Dockerfile		Dockerfile
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
THIRD_PARTY_LICENSES.txt		THIRD_PARTY_LICENSES.txt
depot.json		depot.json
gitleaks-baseline.json		gitleaks-baseline.json
go.mod		go.mod
go.sum		go.sum
install.sh		install.sh
main.go		main.go
package-lock.json		package-lock.json
package.json		package.json
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Installation

Quickstart

Documentation

Community

Contributing

Supported sources & destinations

License

About

Uh oh!

Releases 37

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Installation

Quickstart

Documentation

Community

Contributing

Supported sources & destinations

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 37

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages