PixelRiver - Immerse in the river of pixels

PixelRiver is a file upload and processing system designed to handle file uploads and process images. Primary objective of the system is to do the compression of images whoose URLs are provided in the CSV.

This service takes the file, return the uploadID, process the csvfile immages compression & fire the webhook after completion. It also offers the process status check API.

System Design Diagram

Technical Design Docs

Please refer the detailed technical design docs here.

Tech Stack Used

NodeJs: For API service
Python: For Worker service
MongoDB: NoSQL, transactional based DB for Storing Upload Metadata
Kafka: For Messaging queue for worker service
Redis: For Caching, Rate-limiting, & Pub/Sub(for webhook firing)
GCP: For File Uploads
Nginx: Load Balacing & Reverse Proxy

System Overview & Core Components & Infra Services

The system consists of the following key components: Each component is explained seperatly.

API Server (Node.js + TypeScript) - Handles file uploads, status checks, and API requests.
Ngnix (LB & Reverse Proxy) - Used as the Load Balancer for API service.
Rate Limiting - For status check API. Uses Fixed window technique, implemented via redis.
Storage Bucket (GCP) - Stores uploaded files.
Database (MongoDB) - Persists metadata and processed results.
Caching Layer (Redis) - For frequent status checks & preventing DB calls. Also used for rate limiting.
Message Queue (Kafka) - Ensures reliable communication between services.
Image Processing Service (Python based processors) - Process images asynchronously.
Webhook Dispatcher(Redis Pub/Sub) - Notifies the user when processing is complete.
Logging & Monitoring (Winston) - Various services generate logs. These logs are collected in the log/ directory of service. Can be later used for monitoring purpose.

For Detailed Explaination of each comonent please refer the technical design docs

API Documentations

Public API Documentations

You can find the public API Documentation here
These are working Documentaion which you can import into the postman to test varoius APIs like this:

Internal API Documentations

Project also provides the internal API documentation for developers. To access them, start your local development server.
These are internal docs meant for developers only, so they don't work on production.
Then you can access the documentation by going to /api-docs-internal route. It will serve you the docs. And will looks like this.

Repository Structure

Repo follows the monolithic structure. Here is the overview. Please refer the README file of each module for more info

./
 ├── api # This is the API Service which provides upload API.
 │   ├── docs/ # This conatins the internal developer API docs
 │   ├── README.md
 │   ├── src # Here the actual code lies.
 │   │   ├── app.ts # main service file
 │   │   ├── http # all http related stuff
 │   │   │   ├── controllers
 │   │   │   ├── middlewares
 │   │   │   ├── routes
 │   │   │   └── server.ts # boots up http server
 │   │   ├── infra # handles all infra services like redis, kafak, etc
 │   │   ├── models # contains the DB models
 │   │   └── services # contains the actual business logic
 │   │
 ├── image-processor # This is the Consumer Service which provides process the uploaded files.
 │   └── README.md
 │   │
 ├── pixelriver # This directory contains the project documentation & methodology
 └── README.md

Local Dev Setup

System Requirements

Node.js
NPM
Python
Kafka with zookeeper
Redis
GCP Storage Bucket
MongoDB
Nginx(not required for local env)

1. Repo setup

git clone git@github.com:chinmayagrawal775/pixelriver.git
cd pixelriver

Ensure you kafka broker is running. Please refer the Kafka Quickstart Guid Following are the few useful commands:

# Format log directory
KAFKA_CLUSTER_ID="$(bin/kafka-storage.sh random-uuid)"
bin/kafka-storage.sh format --standalone -t $KAFKA_CLUSTER_ID -c config/server.properties

# Run kafka server
bin/kafka-server-start.sh config/server.properties

# consume messages (for testing only)
bin/kafka-console-consumer.sh --topic pixelriver-new-upload --from-beginning --bootstrap-server localhost:9092

Then make sure to create this topic in your kafka

# create kafka topic
bin/kafka-topics.sh --create --topic pixelriver-new-upload --bootstrap-server localhost:9092

Note: You can also run the server without kafka by adding DISABLE_KAFKA="true" to the .env file for quick server launch

For GCP Setup: If you do not have valid GCP Creds then you can spin up this fake GCP server: https://github.com/fsouza/fake-gcs-server This will work fine.

2. API Service setup

cd api

make .env from .env.example. Then:

npm install
npm run dev

3. Consumer Service setup

cd image_processor

make .env from .env.example. Then:

virtualenv venv
source ./venv/bin/activate
pip install -r requirements.txt

# for running image worker
python -m upload.main

# for running webhook worker
python -m webhook.main

Further Reference & Link

Desing Diagram: here
Technical Design Docs: here
Public API Docs: here

Future Developments Scope

In future service can be extesible with:

Providing the user authentication
Saving the product data(which is in CSV) in DB
Implement Long Polling in Status-check API
Implement DLQ for failed processing.
Providing the internal analytics dashboards.

Name		Name	Last commit message	Last commit date
Latest commit History 50 Commits
.vscode		.vscode
api		api
image_processor		image_processor
pixelriver		pixelriver
.gitignore		.gitignore
.prettierrc		.prettierrc
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

PixelRiver - Immerse in the river of pixels

System Design Diagram

Technical Design Docs

Tech Stack Used

System Overview & Core Components & Infra Services

API Documentations

Public API Documentations

Internal API Documentations

Repository Structure

Local Dev Setup

System Requirements

1. Repo setup

2. API Service setup

3. Consumer Service setup

Further Reference & Link

Future Developments Scope

About

Uh oh!

Uh oh!

Languages

chinmayagrawal775/pixelriver

Folders and files

Latest commit

History

Repository files navigation

PixelRiver - Immerse in the river of pixels

System Design Diagram

Technical Design Docs

Tech Stack Used

System Overview & Core Components & Infra Services

API Documentations

Public API Documentations

Internal API Documentations

Repository Structure

Local Dev Setup

System Requirements

1. Repo setup

2. API Service setup

3. Consumer Service setup

Further Reference & Link

Future Developments Scope

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Uh oh!

Languages