Kashyap’s blog

Rewrites outside location blocks in Nginx are bad!

2024-06-05T00:00:00+05:30

When we did our recent performance tests on one of our nginx clusters I noticed something odd: the CPU was choking at a request rate that was too little for a system like that. It’s a static proxy server running vanilla nginx, and the downstream servers were doing okay in terms of latency. CPU on this system shouldn’t choke before saturating those downstream systems, but it did. perf reports on the process showed most of the samples occupied by symbols related to rewrite and ngx_http_regex_exec. While we have a lot of location rules in this codebase, and many of them are regular expression style matchers, it seemed like way too much time was occupied by these routines. What’s worse is that this happens even when we try running the benchmarks with known wrong URLs or triggering the block/rate-limiting configurations, which should bypass most of the location matching anyway. At one point I noticed Nginx (depending on compilation flags) has support for a PCRE JIT enigne configuration that promises improvement in regular expression matching. Turning this on did improve the situation quite a bit, but it wasn’t anything spectacular. The regex symbols still formed a large part in perf for every cut of URL type. Debugging this further pointed towards a combination of the following issues caused the problem:

We have lots of rewrite rules within a server block outside of location blocks.
The bypass routines in cases of known errors and/or rate limiting used relative paths in the config (internal redirects of sorts)

Our config had a bit of this shape:

http {
  server {
    server_name x.example.com localhost;
    listen 80;

    error_page  404 /404.html;
    error_page  403 /403.html;

    # hundreds of rewrite rules go here, intended to "normalize" matching URLs
    # rewrite {regex} ...
    # rewrite {regex} ...
    # rewrite {regex} ...
    # ... and so on

    location /404.html {
      root /srv/www/html;
    }

    location /403.html {
      root /srv/www/html;
    }

    location / {
      proxy_pass http://upstream;
    }
    # ... and so on
  }
}

In our case the configuration is spread over hundreds of files, and the rewrite rules shown here are all in a separate file which gets included right after the main server block configurations end and any location block definitions start.

A long time ago on the project a decision was made to add a file that contained a few URL matching rules that had to run in-order for every URL, before any location matching runs, sort of a pre-processor to “normalize” URLs.

This is okay as long as it doesn’t get abused. But slowly over time there were additions that should’ve simply been location blocks instead. For the uninitiated, location matching tends to be more efficient in matching URLs as nginx builds out a tree of these at startup rather than going at it serially one by one, which is what our problem technically became. This set of rules started growing as it became a kitchen-sink of sorts for every “wrong” URL—at the time of debugging the number of such rules were in the hundreds. These rules get executed multiple times if there are internal redirects; rewrite rules themselves can be internal redirects if they don’t use one of the bypassing flags like break, redirect, permanent, which further exacerbates the probem. This was true in our case since the intent was to run these in-order, which means last and break can’t be used by definition.

Secondly, we used relative paths for all the error_page configurations mostly as a carry-over from nginx configurations that are documented pretty much everywhere¹ . So when an error status is triggered nginx will redo the matching from the beginning. In isolation this is not a problem, and I can understand why the default documentation snippets use this pattern. In our case these two problems in combination create a cascading effect: when testing out our rate limiting and error handling checks, which should’ve bypassed the relatively-costly location matching, every rewrite rule got run twice, which made the performance pathologically worse!

So here’s a PSA of sorts:

Try not to have rewrite rules outside of a location block
Prefer named routes for “jump”s or bypass internal redirects instead of normal URLs/paths. This would’ve avoided the second execution of the rewrite rules. Something like the below snippet:

error_page 404 @notfound;

location @notfound {
  try_files 404.html =500;
}

Example

Just to demonstrate this with an example, I’m going to use this configuration in nginx, which is deliberately close to what we had structurally:

events {  }

error_log   /dev/stderr notice;

http {
    include         /etc/nginx/mime.types;
    default_type    application/octet-stream;
    access_log      /dev/stdout combined;
    rewrite_log     on;

    # The 404 page definition is copied verbatim from the example config that
    # every debian Nginx package ships with.
    #
    # Culprit 1:
    error_page 404 /404.html;
    error_page 403 /403.html;

    server {
        listen      80;
        server_name localhost;
        root /usr/share/nginx/html;

        # Culprit 2: Naked rewrite rules that try to match for every route, even
        # internal redirects
        rewrite /unknown1/(.*)  /unknown/$1;
        rewrite /unknown2/(.*)  /unknown/$1;
        rewrite /unknown3/(.*)  /unknown/$1;
        rewrite /unknown4/(.*)  /unknown/$1;

        # This will try to send out the file /usr/share/nginx/html/main.html or
        # else respond with a 404 error page.
        location = /main {
            try_files index.html /index.html =404;
        }

        location /notauthorized {
            return 403;
        }

        location /nonexistent {
            return 404;
        }

        # Only the 50x.html template is present in the latest nginx container,
        # so using that as a generic error page to keep things simple.
        location /404.html { try_files 50x.html /50x.html =500; }
        location /403.html { try_files 50x.html /50x.html =500; }

        # Always go to /main by issuing an internal rewrite
        location / {
            rewrite ^/.*$ /main;
        }
    }
}

This sets up three main user-facing routes: /, /main, /notauthorized, /nonauthorized. / redirects to /main internally, although the user won’t see any 3xx, while /notauthorized returns a 403 response. The latter too (return, as used in this case) is implemented as an internal redirect within nginx, so the routing and rule execution behaviour is going to be similar between the 404 case and the 403 case. For the uninitiated, try_files (as used here) in nginx checks the paths given to it within the root path, or else return the status code mentioned at the end. error_page allow for configuring extra routes when nginx has to respond to a particular status code. Effectively, this too is an internal redirect before and after: when a location block has the redirect rule, and when the redirect rule itself has a path as the target location.

I’ll try the following four routes:

curl localhost/
curl localhost/main
curl localhost/notauthorized
curl localhost/nonexistent

The / and /main runs are just to demonstrate the extra rewrite between them. rewrite_log on; does what it says on the tin, and here’s a filtered snippet from the logs:

GET /
"/unknown1/(.*)" does not match "/"
"/unknown2/(.*)" does not match "/"
"/unknown3/(.*)" does not match "/"
"/unknown4/(.*)" does not match "/"
"^/.*$" matches "/"
rewritten data: "/main", args: ""

GET /main
"/unknown1/(.*)" does not match "/main"
"/unknown2/(.*)" does not match "/main"
"/unknown3/(.*)" does not match "/main"
"/unknown4/(.*)" does not match "/main"

GET /notauthorized
"/unknown1/(.*)" does not match "/notauthorized"
"/unknown2/(.*)" does not match "/notauthorized"
"/unknown3/(.*)" does not match "/notauthorized"
"/unknown4/(.*)" does not match "/notauthorized"
"/unknown1/(.*)" does not match "/403.html"
"/unknown2/(.*)" does not match "/403.html"
"/unknown3/(.*)" does not match "/403.html"
"/unknown4/(.*)" does not match "/403.html"

GET /nonexistent
"/unknown1/(.*)" does not match "/nonexistent"
"/unknown2/(.*)" does not match "/nonexistent"
"/unknown3/(.*)" does not match "/nonexistent"
"/unknown4/(.*)" does not match "/nonexistent"
"/unknown1/(.*)" does not match "/404.html"
"/unknown2/(.*)" does not match "/404.html"
"/unknown3/(.*)" does not match "/404.html"
"/unknown4/(.*)" does not match "/404.html"

Both the / route and /main work as expected: the naked rewrite rules run once, but in the cases of the other two these get executed twice. With the current config it’s a bit hard to demonstrate, but the pathological case happens even when those 403, 404 cases happen naturally: an undefined location etc. Using named routes this is the rewritten (no pun intended) config:

events {  }

error_log   /dev/stderr notice;

http {
    include         /etc/nginx/mime.types;
    default_type    application/octet-stream;
    access_log      /dev/stdout combined;
    rewrite_log     on;

    error_page 404 @404.html;
    error_page 403 @403.html;

    server {
        listen      80;
        server_name localhost;
        root /usr/share/nginx/html;

        # Culprit 2: Naked rewrite rules that try to match for every route, even
        # internal redirects
        rewrite /unknown1/(.*)  /unknown/$1;
        rewrite /unknown2/(.*)  /unknown/$1;
        rewrite /unknown3/(.*)  /unknown/$1;
        rewrite /unknown4/(.*)  /unknown/$1;

        # This will try to send out the file /usr/share/nginx/html/main.html or
        # else respond with a 404 error page.
        location = /main {
            try_files index.html /index.html =404;
        }

        location /notauthorized {
            return 403;
        }

        location /nonexistent {
            return 404;
        }

        # Only the 50x.html template is present in the latest nginx container,
        # so using that as a generic error page to keep things simple.
        location @404.html { try_files 50x.html /50x.html =500; }
        location @403.html { try_files 50x.html /50x.html =500; }

        # Always go to /main by issuing an internal rewrite
        location / {
            rewrite ^/.*$ /main;
        }
    }
}

GET /
"/unknown1/(.*)" does not match "/"
"/unknown2/(.*)" does not match "/"
"/unknown3/(.*)" does not match "/"
"/unknown4/(.*)" does not match "/"
"^/.*$" matches "/"
rewritten data: "/main", args: ""

GET /main
"/unknown1/(.*)" does not match "/main"
"/unknown2/(.*)" does not match "/main"
"/unknown3/(.*)" does not match "/main"
"/unknown4/(.*)" does not match "/main"

GET /notauthorized
"/unknown1/(.*)" does not match "/notauthorized"
"/unknown2/(.*)" does not match "/notauthorized"
"/unknown3/(.*)" does not match "/notauthorized"
"/unknown4/(.*)" does not match "/notauthorized"

GET /nonexistent
"/unknown1/(.*)" does not match "/nonexistent"
"/unknown2/(.*)" does not match "/nonexistent"
"/unknown3/(.*)" does not match "/nonexistent"
"/unknown4/(.*)" does not match "/nonexistent"

As expected, only one set of rewrite rule runs. That said, the actual fix would be to refactor the rewrites into location blocks to improve the matching performance a little further.

Footnotes

Similar configuration is also shipped with the default debian package at least as of Debian 11, and the official Nginx container at least as of 1.27.0. ↩

Go can only read 1GiB per Read call

2024-02-07T00:00:00+05:30

UPDATE: I don’t mean to say that this is a bad choice, or that it’s a bug, or even a performance implication. It’s just a choice that was made which seemed a bit opaque without doing all the history spelunking I did here, and it’s interesting to see the reasoning behind it.

There’s a 1GiB limit for a single Read call for an os.File entity (object? struct?) in Go, even though native read syscall can fill a 2GiB buffer (as tested in my arm macos and Intel Linux machine). I ran into this when looking at a pprof profile of a sample word count program I was writing, which showed the program was spending way too much time in the syscall module. That in this context can only mean one thing: way too many read syscalls were getting called. Something like this would show this behaviour:

f, err := os.Open("superlargefile.txt")
if err != nil {
    log.Fatal("error opening input file: ", err)
}
defer f.Close()

buf := make([]byte, 1024*1024*1024*2) // 2GiB buffer
fmt.Println("buffer size", len(buf))

for iter := 1; ; iter += 1 {
    n, err := f.Read(buf)

    if err != nil {
        if err == io.EOF {
            fmt.Println("done")
            break
        }

        log.Fatal("error reading input file: ", err)
    }

    fmt.Println("bytes read: ", n)
    fmt.Println("iter: ", iter)
}

That, on a 2.5G file would output something like:

buffer size 2147483648
bytes read:  1073741824
iter:  1
bytes read:  1073741824
iter:  2
bytes read:  490442752
iter:  3
done

Even though the initialised buffer size is 2GiB, only 1GiB is read into the buffer per iteration. Upon digging into the source code, it looks like this is a deliberate choice. The main change logs from the history point to the following:

https://codereview.appspot.com/89900044 as a fix for golang/go#7812. This had a fix for failing reads on file sizes greater than or equal to 2GiB on macos and freebsd by capping each read syscall to only read a 2GiB-1 bytes. For the rest of operating systems, at this point, there was no cap.
https://codereview.appspot.com/94070044 as a followup of 1, where the limit was decreased without any OS checks to 1GiB, with an explanation that at least it would allow for aligned reads from disk, as opposed to an odd number that might miss page caches (my understanding).

Note that a lot has changed since that changeset, and the current file reference for that _unix.go file in the changeset is src/internal/poll/fd_unix.go.

Aside: System limits

As per the linux read syscall documentation, the maximum bytes that can be transferred is 2GiB. And I tested this out with rudimentary scripts in Rust and C. The Rust program is taken verbatim from the example for read_to_end(). Running that under strace has the following output (truncated here):

read(3, ..., 6594816000) = 2147479552
read(3, ..., 4447336448) = 2147479552
read(3, ..., 2299856896) = 2147479552
read(3, ..., 152377344) = 152377344
read(3, "", 32)         = 0

And a similar, simple C program results in similar output, when using the read syscall in a loop until the file is read:

SSIZE_MAX: 9223372036854775807 # outputting the limits.h constant
bytes read: 2147479552
bytes read: 2147479552
bytes read: 2147479552
bytes read: 152377344

Although that’s neither here nor there, it’s still interesting that Go’s choice has been to pick 2GiB-1 and then 1GiB justifying the odd buffer size in the former.

classnames library composes well!

2023-05-02T00:00:00+05:30

This is an unpublished draft from 6 years ago. Unpublished until now, that is.

The classnames library is a very handy tool to apply CSS classes conditionally in JavaScript components. Since the output of the function is just a string, this can be composed very well on multiple conditionals layered on on various parts of the code.

For example, consider the following:

import cx from 'classnames';

switch (type) {
  case: 'textarea':
    const textareaClassNames = cx('text-area', 'text-input', 'invalid': !this.state.valid);
    return <textarea className={textareaClassNames} />
  default:
    const inputClassNames = cx('text-input', 'invalid': !this.state.valid);
    return <input type={type} className={inputClassNames} />
}

The class-names are the same except for one extra item in the case of textarea type input field. Until today, I would’ve done something like the above example, since I never bothered to look at the actual output of the call. A quick glance at the source code of the library made it evident that the library would enable composition with output of another classname-generated value (which is a String). So that code can be simplified to:

import cx from 'classnames';
const className = cx('text-input', 'invalid': !this.state.valid);

switch (type) {
  case: 'textarea':
    return <textarea className={cx(className, 'text-area')} />
  default:
    return <input type={type} className={className} />
}

Much better.

Node has native CLI argument parsing

2023-02-09T00:00:00+05:30

I knew this was in the works, but wasn’t aware this was shipped with v16! (released in 2022). I was playing with TypeScript code transforms and wanted to update the source file after the transformation based on a flag. The script was basically standalone, so I didn’t want to depend on any external depedencies like argparse. The API I was aiming at was basically:

node enum-to-const-object.mjs source.ts [...]
node enum-to-const-object.mjs -w source.ts [...]
node enum-to-const-object.mjs --write source.ts [...]

The first invocation would print out the result to standard out, whereas the latter two would update the source file in-place, exactly how prettier works. The standard library API is pretty neat for such a simple interface:

// file: enum-to-const-object.mjs
// Note the mjs extension, which is why I'm able to use import. Otherwise,
// you'll have to use require in place of the following line
import { parseArgs } from 'node:util';

const options = {
  write: {
    type: 'boolean',
    short: 'w',
    default: false
  }
};

const { values, positionals } = parseArgs({ options, allowPositionals: true });
// values is of the shape { write:  }
// positionals: [ source.ts, ... ]

The options object is the one used by the parser as the configuration of the flags. The keys of this object are the expected flags in long-hand. The short property for each of these long-hand flags helps with adding aliases.

In addition to the flag format and strings, there is one more additional option that I had to configure: allowPositionals. This returns rest of the arguments that are not flags, which in my case are the files I wanted to transform. Once parseArgs is called using the configuration, and (by default) on process.argv, the flag values as an key-value pair, and the rest of the arguments are returned―values which contains the flag values, and positionals which contains the file list.

Docs Link

Using CSP in report-only and enforcement mode

2023-01-17T00:00:00+05:30

I recently came across this strategy which uses the standardised Content Security Policy for both enforcement and script monitoring on a web page for security. We use CSP for enforcement already, but I was under the assumption that report-only mode and enforcement mode are exclusive. That is, if the Content-Security-Policy header was used with a few rules, I thought the Content-Security-Policy-Report-Only can’t be used; or perhaps it doesn’t work if we send both. But, in hindsight, this was wrong. For example, let’s say a page returns the following headers, and there’s an tag on this page trying to load images from example.com.

Content-Security-Policy: default-src 'self' images.kgrz.io; report-uri: /report-block
Content-Security-Policy-Report-Only: default-src 'self'; report-uri: /report-only

This CSP setting ensures resources only from images.kgrz.io are successfully loaded onto the page. I had always had the implicit understanding that the image load will be blocked, and a report sent out to /report-block path. But this is not the case: there’ll be two reports sent-out: one to /report-block, and one to /report-only.

The sample application that demonstrates this example is hosted at /apps/csp. You may have to have a modern-ish website to use this since I’m using no build pipeline for the JS that’s used on the page. (Anything that [supports