StepsTrack

TLDR: Tracks every data and metrics during your pipeline run, and visualize it in easily tracable way.

StepsTrack is an observability tool built to help tracking, visualizing and inspecting intermediate steps in a complex pipeline-based application. It automatically captures and stores the intermediate data, results and execution times of each steps in a pipeline, visualizing the execution details and allowing easier debug or analysis through an analytic dashboard. It is originally developed as a go-to tool to inspect runtime data of an agentic RAG pipeline.

It now supports both Python and Typescript / Node.js

Background of StepsTrack

StepsTrack is a lightweight inspection and debugging tool originally built to monitor an agentic Retrieval-Augmented Generation (RAG) pipeline running in a production environment—where visibility, performance, and stability are critical.

When chaining multiple LLM agents with custom logic and dynamic inputs, non-deterministic nature of LLM outputs of each steps often lead to dynamic route of logics and behaviors. I needed a inspection tool but the existing tools didn't provide the granularity I needed to trace what happened inside each step of the pipeline.

So I built StepsTrack to do just that: trace, inspect, and understand every step of each request. It helped me quickly spot bottlenecks, unexpected behaviors and performance drags, and address them effectively.

I'm open-sourcing it in the hope that it helps others building and operating complex LLM pipelines.

Contributions welcome!

⭐🌟 Sounds interesting? Kindly give it a Star, it means a lot! ⭐🌟

Features

1. Tracking Pipeline Steps

Tracking: Define steps in pipeline to track intermediates data, results and execution time
Visualizing: Exporting the details and generating basic visualizations including Gantt and Execution Graph
Event Emitting: Listen to step events for real-time monitoring and custom handling
ES6 Decorators: Easy integration with ES6 decorators
LLM Tracking Extension: Simple tracker optimized for LLM usage

2. Using Dashboard

Monitor and analyze pipeline executions through an interactive web interface

Detailed Steps Data and Results Insepection
Real-time Execution Monitoring
Gantt Chart Visualization for pipeline
Step Execution Stats

Note: StepsTrack is designed for any pipeline-based / multi-steps logic, especially agentic LLM pipelines

Getting Started

This repository is a monorepo containing following packages:

Typescript / Python libraries that provides basic tracker and chart generation function for your pipeline
Dashboard that visualizes and allows you to monitor tracked data for analysis.

Installation

# Typescript
npm install --save steps-track

# Python
pip install steps-track

Tracking Pipeline Steps

Create a pipeline and track steps with nested, sequential, or parallel logic:

Simple Usage

Typescript

import { Pipeline, Step } from 'steps-track';

const pipeline = new Pipeline('my_pipeline');

await pipeline.track(async (st: Step) => {

    // Track a simple step
    await st.step('some_step', async (st: Step) => {

      // ... your logic ...
      st.record('key', 'value'); // Record data for analysis

      return 'step_result'  // Results are automatically recorded
    });
    
    // Track nested steps
    await st.step('parent', async (st: Step) => {
      await st.step('child_1', async (st: Step) => { /* ... */ });
      await st.step('child_2', async (st: Step) => { /* ... */ });
  });
  
    // Track parallel steps
    await Promise.all([
        st.step('parallel_1', async (st: Step) => { /* ... */ }),
        st.step('parallel_2', async (st: Step) => { /* ... */ })
    ]);
});

Python

from steps_track import Pipeline, Step, HttpTransport

# Create pipeline with HTTP transport
pipeline = Pipeline('my-pipeline')

# Run your pipeline
async def pipeline_logic(st: Step):

    async def some_task(some_args: str):
        # ... your logic ...
        st.record('key', 'value')  # Record data for analysis
        return 'some_result'  # Results are automatically recorded

    # Track a simple step
    result = await st.step('some_step', async lambda st: some_task(your_args, st))
    
    # Track nested steps
    await st.step('parent', async lambda st:
        await st.step('child_1', async lambda st: some_task(your_args, st))
        await st.step('child_2', async lambda st: some_task(your_args, st))
    )
    
    # Track parallel steps
    await asyncio.gather(
        st.step('parallel_1', async lambda st: some_task(your_args, st)),
        st.step('parallel_2', async lambda st: some_task(your_args, st))
    )

# Run the pipeline
await pipeline.track(pipeline_logic)

Using Decorators (Recommended)

Typescript

import { Pipeline, Step, WithStep } from 'steps-track';

class PipelineController {
  private pipeline: Pipeline;

  constructor() {
    this.pipeline = new Pipeline('my_pipeline');
  }

  /**
   * Method to run the pipeline
   */
  public async run() {
    await this.pipeline.track(async (st: Step) => {
      await this.someStep('some_value', st);
      await this.parentFunc(st);
    });
  }

  @WithStep('some_step')
  async someStep(str: string, st: Step) {
    // ... some logic ...
    st.record('key', 'value');  // Record data for analysis

    return 'step_result'  // Results are automatically recorded
  }

  @WithStep('child')
  async childFunc(param: number, st: Step) {
    // ... some logic ...
  }

  @WithStep('parent')
  async parentFunc(st: Step) {
    // Track nested steps
    await this.childFunc(1, st);
    await this.childFunc(2, st);

    // Track parallel steps
    await Promise.all([
        this.childFunc(3, st),
        this.childFunc(4, st),
    ]);
  }
}

const controller = new PipelineController();
await controller.run();

Python

from steps_track import Pipeline, Step, with_step

class PipelineController:
    def __init__(self):
        self.pipeline = Pipeline('my_pipeline')

    async def run(self):
        """Method to run the pipeline"""
        await self.pipeline.track(self._run_steps)

    async def _run_steps(self, st: Step):
        """Actual pipeline implementation method"""
        await self.some_step('some_value', st)
        await self.parent_func(st)

    @with_step('some_step')
    async def some_step(self, input_str: str, st: Step):
        # ... some logic ...
        await st.record('key', 'value')  # Record data for analysis
        return 'some_result'  # Results are automatically recorded

    @with_step('child')
    async def child_func(self, param: int, st: Step):
        # ... some logic ...

    @with_step('parent')
    async def parent_func(self, st: Step):
        # Track nested steps
        await self.child_func(1, st)
        await self.child_func(2, st)

        # Track parallel steps
        await asyncio.gather(
            self.child_func(3, st),
            self.child_func(4, st),
        )

controller = PipelineController()
await controller.run()

Exporting and Visualizing Executions

Generate visual outputs to understand and analyze execution flow:

Typescript

// Generate a Gantt chart Buffer using quickchart.io
const ganttChartBuffer = await pipeline.ganttQuickchart();

// Generate a Gantt chart HTML file with Google Charts
const ganttChartHtml = await pipeline.ganttGoogleChartHtml();

// Generate an execution graph URL
const executionGraphUrl = pipeline.executionGraphQuickchart();

// Get the hierarchical output of all steps
const stepsHierarchy = pipeline.outputNested();

Python

# Generate a Gantt chart Buffer using quickchart.io
gantt_chart_buffer = await pipeline.gantt_quickchart()

# Generate a Gantt chart HTML file with Google Charts
gantt_chart_html = await pipeline.gantt_google_chart_html()

# Generate an execution graph URL
execution_graph_url = pipeline.execution_graph_quickchart()

# Get the hierarchical output of all steps
steps_hierarchy = pipeline.output_nested()

Sample Gantt Chart

Sample Execution Graph

Sample Hierarchy Output

json

{
    "name": "document-parse",
    "key": "document-parse",
    "time": { "startTs": 1739357985509, "endTs": 1739357990192, "timeUsageMs": 4683 },
    "records": {},
    "substeps": [
        {
            "name": "preprocess",
            "key": "document-pipeline.preprocess",
            "time": { "startTs": 1739357985711, "endTs": 1739357986713, "timeUsageMs": 1002 },
            "records": {
                "pageCount": 3
            },
            "result": [ "page_1_content", "page_2_content"],
            "substeps": []
        },
        {
            "name": "parsing",
            "key": "document-pipeline.parsing",
            "time": { "startTs": 1739357985711, "endTs": 1739357990192, "timeUsageMs": 4481 },
            "records": {},
            "substeps": [
                {
                    "name": "page_1",
                    "key": "document-pipeline.parsing.page_1",
                    "time": { "startTs": 1739357987214, "endTs": 1739357990192, "timeUsageMs": 2978 },
                    "records": {},
                    "result": "page_1_content",
                    "substeps": []
                },
                {
                    "name": "page_2",
                    "key": "document-pipeline.parsing.page_2",
                    "time": { "startTs": 1739357987214, "endTs": 1739357989728, "timeUsageMs": 2514 },
                    "records": {},
                    "result": "page_2_content",
                    "substeps": []
                }
            ]
        },
        {
            "name": "sample-error",
            "key": "document-pipeline.sample-error",
            "time": { "startTs": 1739357990192, "endTs": 1739357990192, "timeUsageMs": 0},
            "records": {},
            "error": "Sample Error",
            "substeps": []
        }
    ]
}

Advanced Usages

StepsTrack also provides Event Emitting listeners, ES6/Python Decorators and - LLM Tracking Extension support for easier integration. For more detailed usages, check out the Basic Usage and Advanced Usage guides.

Using Dashboard

StepsTrack includes a dashboard that provides several features for monitoring and analyzing pipeline executions.

Initial Configuration

During pipeline initialization, define a Transport to relay pipeline run data to dashboard later on. Currently supported a HttpTransport. See Advanced Usage for more details.

Typescript

const httpTransport = new HttpTransport({
  baseUrl: 'http://localhost:3000',
  batchLogs: true,
});

// Create pipeline with HTTP transport
const pipeline = new Pipeline('my-pipeline', {
  autoSave: 'real_time',
  transport: httpTransport
});

// Run your pipeline
await pipeline.track(async (st) => {
  // Your pipeline steps here
});

// Make sure to flush any pending logs when your application is shutting down
await httpTransport.flushAndStop();

Python

from steps_track import Pipeline, Step, HttpTransport

http_transport = HttpTransport(HttpTransportOptions(
    base_url='http://localhost:3000',
    batch_logs=True
))

# Create pipeline with HTTP transport
pipeline = Pipeline('my-pipeline', 
    auto_save='real_time',
    transport=http_transport
)

# Run your pipeline
async def pipeline_logic(st):
    # Your pipeline steps here
    pass

await pipeline.track(pipeline_logic)

# Make sure to flush any pending logs when your application is shutting down
await http_transport.flush_and_stop()

Starting up Dashboard

# Uses SQLite storage as default
docker run -p 3000:3000 lokwkin/steps-track-dashboard

See Dashboard for more details.

Detailed Steps Insepection

Details of a pipeline run. From here you can examine all the steps running in the pipeline, their auto-captured data and results as well as the time usage information.

Real-time Execution Monitoring

The dashboard includes auto-refreshing option, allowing you to monitor real-time pipeline runs.

Gantt Chart Visualization for pipeline

Gantt Chart for visualizing the time usages of each steps in a pipeline run. You can see real-time progress of the pipeline, highlighted by status of running / success / failed.

Step Execution Stats

Step Execution Stats. Aggregated from past run histories with basic statistical information for performance analyzing.

Roadmap and To Dos

License

MIT © lokwkin

Name		Name	Last commit message	Last commit date
Latest commit History 389 Commits
.github/workflows		.github/workflows
docs		docs
packages		packages
.eslintrc.js		.eslintrc.js
Dockerfile-dashboard		Dockerfile-dashboard
LICENSE		LICENSE
README.md		README.md
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json
update-versions.sh		update-versions.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

StepsTrack

Features

1. Tracking Pipeline Steps

2. Using Dashboard

Getting Started

Installation

Tracking Pipeline Steps

Simple Usage

Using Decorators (Recommended)

Exporting and Visualizing Executions

Advanced Usages

Using Dashboard

Initial Configuration

Starting up Dashboard

Detailed Steps Insepection

Real-time Execution Monitoring

Gantt Chart Visualization for pipeline

Step Execution Stats

Roadmap and To Dos

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 3

Uh oh!

Languages

License

lokwkin/steps-track

Folders and files

Latest commit

History

Repository files navigation

StepsTrack

Features

1. Tracking Pipeline Steps

2. Using Dashboard

Getting Started

Installation

Tracking Pipeline Steps

Simple Usage

Using Decorators (Recommended)

Exporting and Visualizing Executions

Advanced Usages

Using Dashboard

Initial Configuration

Starting up Dashboard

Detailed Steps Insepection

Real-time Execution Monitoring

Gantt Chart Visualization for pipeline

Step Execution Stats

Roadmap and To Dos

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

Packages