3D Rendering in Software

This is my work to build a software 3D renderer as guided by the "Learn Computer Graphics Programming" course by Gustavo Pezzi.

My completion certificate

Progress

Here's where I take videos of significant milestones to look back on the progress I've made.

01. Simple Perspective Cube Points

A cube made up of a cloud of points with simple perspective projection applied.

01.Simple-Perspective-Cube-Points.mp4

01.Simple-Perspective-Cube-Points.mp4

02. Simple Rotation Transformation

Rotating the cube with simple rotation transformations.

02.Simple-Vector-Rotation-Transformation.mp4

02.Simple-Vector-Rotation-Transformation.mp4

03. Wireframe Cube

The cube is now expressed as a collection of triangle faces and is rendered as a wireframe using a simple line rasterization algorithm.

03.Wireframe-Cube.mp4

03.Wireframe-Cube.mp4

04. Rendering OBJ Files

Instead of a static cube, the renderer can now read in arbitrary OBJ files to render.

04.Render-OBJ-File.mp4

04.Render-OBJ-File.mp4

05. Back Face Culling

Implemented a bunch of vector functions and used them to implement back-face culling; mesh faces that aren't visible by the camera are no longer rendered.

05.Back-Face-Culling.mp4

05.Back-Face-Culling.mp4

06. Triangle Rasterization

Triangles are now filled in with a set of static colors. Render modes can be changed at runtime between wireframe and rasterized, plus an option for showing dots on vertices and enabling/disabling back-face culling.

06.Triangle-Rasterization.mp4

06.Triangle-Rasterization.mp4

07. Face Depth Sorting

With a naive sorting algorithm, faces are rasterized in the correct depth order, preventing back faces from "bleeding through."

07.Face-Depth-Sorting.mp4

07.Face-Depth-Sorting.mp4

08. Transformation Matrices

Use rotation, scaling, and translation matrices to apply transformations to mesh vertices.

08.Transformation-Matrices.mp4

08.Transformation-Matrices.mp4

09. Flat Shading with Global Lighting

Simple lighting appearance by shading each face in relation to a global light source.

09.Flat-Shading-Global-Lighting.mp4

09.Flat-Shading-Global-Lighting.mp4

10. Texture UV Mapping

Textures are mapped onto triangle faces using barycentric weighting.

10.Texture-UV-Mapping.mp4

10.Texture-UV-Mapping.mp4

11. Perspective Correct UV Interpolation

Textures are mapped with perspective-corrected barycentric weighting.

11.Perspective-Correct-Interpolation.mp4

11.Perspective-Correct-Interpolation.mp4

12. Loading Arbitrary Textures and Texture Maps

Textures can be loaded from PNG files and mapped using UV coordinates from associated OBJ files.

12.Obj-Texture-Loading.mp4

12.Obj-Texture-Loading.mp4

13. Z-Buffer

Z-Buffer is used to determine which pixels are rendered on top, reducing glitching of triangles popping on top of others.

13.Z-Buffer.mp4

13.Z-Buffer.mp4

14. Simple Camera

A simple camera that can rotate left/right and translate up/down/forward/back.

14.Simple-Camera.mp4

14.Simple-Camera.mp4

15. Frustum Clipping

Meshes are now clipped against the edges of the camera frustum, adjusting triangle and texture coordinates to only draw what is visible.

15.Frustum-Clipping.mp4

15.Frustum-Clipping.mp4

16. Camera Adjustments

Updated input processing to smooth out camera movement. Reduced resolution to achieve a retro look with higher frame rates.

16.Camera-Adjustments.mp4

16.Camera-Adjustments.mp4

17. Multiple Meshes

Multiple meshes can now be rendered in the scene.

17.Multiple-Meshes.mp4

17.Multiple-Meshes.mp4

Topics To Review

Concepts that I still lack some total understanding of:

Coordinate system handedness and how it affects operations such as cross product
Perspective correct interpolation

Additional Learning Resources

Subpixel Rendering

Notes

Perspective Projection

The two triangles that make up the viewer's angle to the screen-space projected point and the 3D point are similar triangles, which share a constant ratio $\frac{BC}{DE} = \frac{AB}{AD}$ as illustrated in the top-down view of the point's X axis below.

This can be simplified:

$$ \frac{P'_x}{P_x}=\frac{1}{P_z} $$

$$ P'_x=\frac{P_x}{P_z} $$

Note $P_z$ in the denominator, indicating that the projected X is scaled by the Z distance to the point. This is called the perspective divide.

Similarly, the Y perspective projection:

$$ P'_y=\frac{P_y}{P_z} $$

Left vs Right-handed Coordinate Systems

The "handedness" of the coordinate system defines how different dimensional axis are interpreted.

"Left-handed" coordinate systems define Z values as growing "into" the screen, away from the viewer.
"Right-handed" coordinate systems define Z values as growing "out of" the screen, toward the viewer.

DirectX uses a left-handed coordinate system, while OpenGL uses a right-handed coordinate system.

Illustration from https://www.oreilly.com/library/view/learn-arcore/9781788830409

A separate but related convention is triangle "winding order," or the order in which the vertices around the edge of a triangle are traversed. The winding order can be clockwise, or counter-clockwise.

The coordinate system and winding order convention can determine how normal values should be calculated.

There's additional context in this lesson video.

Left Hand Rule

Given a triangle with points $A$, $B$, $C$, point your index finger in the direction of the first vector $(B - A)$ and your middle finger in the direction of the second vector $(C - A)$. Your thumb will be pointing in the direction of the normal vector.

Right Hand Rule

Use the same method as above, but with the right hand. Notice that the normal direction is inverted given the same triangle.

Physicists normally use the right hand rule.

Transformation

Rotation

Applying a rotation to a 2D vector using trigonometry functions

Goal: Rotate a 2D vector $(x, y)$ around the origin by angle $\beta$ to get $(x', y')$.

Let $\alpha$ represent the angle from the origin to $(x, y)$.

Let $r$ represent the hypotenuse of the triangle, or radius of the rotation.

Since $\cos(\alpha) = \frac{x}{r}$, $x = r \times \cos(\alpha)$

Since $\sin(\alpha) = \frac{y}{r}$, $y = r \times \sin(\alpha)$

After applying the angle $\beta$, the new coordinates can be calculated:

$\cos(\alpha + \beta) = \frac{x'}{r}$, so $x' = r \times \cos(\alpha + \beta)$

$\sin(\alpha + \beta) = \frac{y'}{r}$, so $y' = r \times \sin(\alpha + \beta)$

Trig functions that add two values can be expanded using the angle addition formula:

$$ x' = r\cos(\alpha + \beta) $$

$$ x' = r(\cos{\alpha} \cos{\beta} - \sin{\alpha} \sin{\beta}) $$

$$ x' = r \cos{\alpha} \cos{\beta} - r \sin{\alpha} \sin{\beta} $$

You can substitute $r \cos{\alpha}$ with $x$, and $r \sin{\alpha}$ with $y$:

$$ x' = x \cos{\beta} - y \sin{\beta} $$

Similarly,

$$ y' = r \sin(\alpha + \beta) $$

$$ y' = r(\sin{\alpha} \cos{\beta} + \cos{\alpha} \sin{\beta}) $$

$$ y' = r \sin{\alpha} \cos{\beta} + r \cos{\alpha} \sin{\beta} $$

$$ y' = y \cos{\beta} + x \sin{\beta} $$

These are the formulas that are used by a rotation transformation matrix.

The same principle applies to 3 dimensions, but with one dimension at a time:

vec3_t Vec3RotateX(vec3_t v, float angle)
{
    vec3_t rotated_vector = {
        .x = v.x,
        .y = v.y * cosf(angle) - v.z * sinf(angle),
        .z = v.y * sinf(angle) + v.z * cosf(angle),
    };
    return rotated_vector;
}

vec3_t Vec3RotateY(vec3_t v, float angle)
{
    vec3_t rotated_vector = {
        .x = v.x * cosf(angle) - v.z * sinf(angle),
        .y = v.y,
        .z = v.x * sinf(angle) + v.z * cosf(angle),
    };
    return rotated_vector;
}

vec3_t Vec3RotateZ(vec3_t v, float angle)
{
    vec3_t rotated_vector = {
        .x = v.x * cosf(angle) - v.y * sinf(angle),
        .y = v.x * sinf(angle) + v.y * cosf(angle),
        .z = v.z,
    };
    return rotated_vector;
}

Vector Math Review

Vector Magnitude

Magnitude refers to the length of the vector:

$$ |\vec{v}| $$

$$ |\vec{v}| = \sqrt{{v_x}^2 + {v_y}^2} $$

Vector Addition

Adding vectors is basically starting one vector from the end of the other:

$$ \vec{a} + \vec{b} = (a_x + b_x, a_y + b_y) $$

Vector Subtraction

Subtraction is the same as addition, but invert/negate the second vector:

$$ \vec{a} - \vec{b} = (a_x - b_x, a_y - b_y) $$

Cross Product

The cross product helps to calculate the normal vector of a plane.

The cross product of two vectors $\vec{x} \times \vec{y}$ yields a vector that is perpendicular to both of those vectors.

To calculate the cross product:

$$ \vec{N} = \vec{a} \times \vec{b} $$

$$ N_x = a_y b_z - a_z b_y $$

$$ N_y = a_z b_x - a_x b_z $$

$$ N_z = a_x b_y - a_y b_x $$

There are two possible perpendicular vectors for any given pair of vectors. The order of operands will determine which direction is calculated.

The magnitude of the cross product is related to the angle between the two input vectors:

$$ |\vec{a} \times \vec{b}| = |a| |b| \sin{\theta} $$

Resource for additional information on how to derive the cross product

Dot Product

The dot product of two vectors produces a scalar value of the sum of the components of each given vector multiplied together.

$$ \vec{a} \cdot \vec{b} = {a_x}{b_x} + {a_y}{b_y} $$

When used with unit vectors, the dot product can be used to produce a "projection" of one vector onto the other.

The more "aligned" the vectors are, the larger the dot product is. If they are exactly the same, the dot product is $1$.

At a 90 degree offset, the dot product is $0$.

If the two vectors are complete opposites, the dot product is $-1$.

Normalizing Vectors

A normalized vector is a vector with a magnitude of 1.

$$ \hat{a} = \frac{\vec{a}}{|\vec{a}|} $$

If you don't care about the length of a vector, it's often better to express it as a normalized vector.

Back Face Culling

If we'd like to avoid rendering faces that are facing away from the camera, we can simply compare their normal vector to the vector of the camera.

Here's how we can get the normal vector of a triangle face:

Note that we take our vertices in clockwise order, consistent with our chosen coordinate system.

Once we have the normal vector, we can compare it to the camera ray vector using the dot product to determine if the face is facing toward the camera or away from it.

To find the camera ray vector, we simply subtract the camera position from the point we are observing.

Matrices

Just a way of expressing and manipulating a set of values in rows and columns.

$$ M = \begin{bmatrix} -2 & 5 & 6 \\ 5 & 2 & 7 \end{bmatrix} $$

Matrix $M$ has 2 rows and 3 columns; the dimensions are $2 \times 3$.

A matrix has a set of elements that can be referenced as followed:

$$ M_{3 \times 2} = \begin{bmatrix} m_{11} & m_{12} \\ m_{21} & m_{22} \\ m_{31} & m_{32} \end{bmatrix} $$

Matrices are useful for solving systems of equations:

$$ \left \lbrace \begin{alignedat}{3} x + 2y -4z = 5 \\ 2x + y - 6z = 8 \\ 4x - y - 12z = 13 \end{alignedat} \right. $$

$$ \begin{bmatrix} 1 & 2 & -4 & 5 \\ 2 & 1 & -6 & 8 \\ 4 & -1 & -12 & 13 \end{bmatrix} $$

In computer graphics, matrices are useful in converting sets of geometric data into different coordinate systems. They can be used to apply translation, rotation, projection, and many other transformations.

Matrix Operations

Matrix Addition

Simply add each element together.

$$ \begin{bmatrix} 2 & 3 \\ 1 & -5 \end{bmatrix} + \begin{bmatrix} 3 & 1 \\ 1 & 2 \end{bmatrix} = \begin{bmatrix} 5 & 4 \\ 2 & -3 \end{bmatrix} $$

Matrix Subtraction

Simply subtract each element from each other.

$$ \begin{bmatrix} 2 & 3 \\ 1 & -5 \end{bmatrix} - \begin{bmatrix} 3 & 1 \\ 1 & 2 \end{bmatrix} = \begin{bmatrix} -1 & 2 \\ 0 & -7 \end{bmatrix} $$

Matrix Multiplication

Matrix multiplication is more complex. For each combination of row and column you must multiply the row elements with the column elements and sum the results:

$$ \begin{bmatrix} 1 & 2 \\ 3 & 4 \end{bmatrix} * \begin{bmatrix} 5 & 6 \\ 7 & 8 \end{bmatrix} = \begin{bmatrix} (1 \ast 5) + (2 \ast 7) & (1 \ast 6) + (2 \ast 8) \\ (3 \ast 5) + (4 \ast 7) & (3 \ast 6) + (4 \ast 8) \end{bmatrix} = \begin{bmatrix} 19 & 22 \\ 43 & 50 \end{bmatrix} $$

Multiplication is only possible when the number of columns on the left matrix is equal to the number of rows on the right matrix.

The dimension of the resulting matrix will have the number of rows of the left matrix and the number of columns of the right matrix.

$$ M_{N \times M} * M_{M \times P} = M_{N \times P} $$

Matrix multiplication is not commutative:

$$ A * B \neq B * A $$

Identity Matrix

A square matrix with 1's in the diagonal and 0's everywhere else.

Any matrix multiplied by the identity matrix will return an unchanged result.

$$ \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \end{bmatrix} $$

Simple Rotation Matrix

Earlier, we determined that you can calculate the new $x$ and $y$ positions for a given rotation $\alpha$ using simple trigonometry functions:

$$ x' = x \cos{\alpha} - y \sin{\alpha} $$

$$ y' = y \cos{\alpha} + x \sin{\alpha} $$

This can be represented in matrix form:

$$ \begin{bmatrix} x' \\ y' \end{bmatrix} = \begin{bmatrix} \cos{\alpha} & -\sin{\alpha} \\ \sin{\alpha} & \cos{\alpha} \end{bmatrix} * \begin{bmatrix} x \\ y \end{bmatrix} = \begin{bmatrix} x\cos{\alpha} - y\sin{\alpha} \\ x\sin{\alpha} + y\cos{\alpha} \end{bmatrix} $$

This matrix is called a 2D rotation matrix:

$$ R = \begin{bmatrix} \cos{\theta} & -\sin{\theta} \\ \sin{\theta} & \cos{\theta} \end{bmatrix} $$

When it is multiplied against a set of coordinates, it produces a set of transformed coordinates rotated by $\theta$.

Transformation Matrices

In linear algebra, linear transformations can be represented by matrices.

$$ \begin{bmatrix} m & m & m & m \\ m & m & m & m \\ m & m & m & m \\ m & m & m & m \end{bmatrix} * \begin{bmatrix} x \\ y \\ z \\ w \end{bmatrix} $$

4x4 matrices are usually used to represent 3D transformations (scale, translation, rotation, etc.)

We use 4x4 instead of 3x3 because some transformations (ex. translation) require an extra row/column.

To enable multiplication, an extra component $w$ is added to our original vector.

Scale Matrix

$$ \begin{bmatrix} sx & 0 & 0 & 0 \\ 0 & sy & 0 & 0 \\ 0 & 0 & sz & 0 \\ 0 & 0 & 0 & 1 \\ \end{bmatrix} * \begin{bmatrix} x \\ y \\ z \\ 1 \end{bmatrix} $$

Performing this multiplication yields the following:

$$ \begin{bmatrix} (sx * x) + 0 + 0 + 0 \\ 0 + (sy * y) + 0 + 0 \\ 0 + 0 + (sz * z) + 0 \\ 0 + 0 + 0 + (1 * 1) \\ \end{bmatrix} = \begin{bmatrix} sx * x \\ sy * y \\ sz * z \\ 1 \end{bmatrix} $$

Translation Matrix

$$ \begin{bmatrix} 1 & 0 & 0 & tx \\ 0 & 1 & 0 & ty \\ 0 & 0 & 1 & tz \\ 0 & 0 & 0 & 1 \end{bmatrix} * \begin{bmatrix} x \\ y \\ z \\ 1 \end{bmatrix} $$

Performing the multiplication yields the following:

$$ \begin{bmatrix} x + 0 + 0 + tx \\ 0 + y + 0 + ty \\ 0 + 0 + z + tz \\ 0 + 0 + 0 + 1 \end{bmatrix} = \begin{bmatrix} x + tx \\ y + ty \\ z + tz \\ 1 \end{bmatrix} $$

Rotation Matrix

These are defined in a left-handed coordinate system, such that each axis is rotated counter-clockwise around its axis. See direction.

The rotation matrix for the X axis looks like this:

$$ \begin{bmatrix} 1 & 0 & 0 & 0 \\ 0 & \cos{\alpha} & \sin{\alpha} & 0 \\ 0 & -\sin{\alpha} & \cos{\alpha} & 0 \\ 0 & 0 & 0 & 1 \end{bmatrix} * \begin{bmatrix} x \\ y \\ z \\ 1 \end{bmatrix} $$

The rotation matrix for the Y axis looks like this:

$$ \begin{bmatrix} \cos{\alpha} & 0 & -\sin{\alpha} & 0 \\ 0 & 1 & 0 & 0 \\ \sin{\alpha} & 0 & \cos{\alpha} & 0 \\ 0 & 0 & 0 & 1 \end{bmatrix} * \begin{bmatrix} x \\ y \\ z \\ 1 \end{bmatrix} $$

The rotation matrix for the Z axis looks like this:

$$ \begin{bmatrix} \cos{\alpha} & \sin{\alpha} & 0 & 0 \\ -\sin{\alpha} & \cos{\alpha} & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \end{bmatrix} * \begin{bmatrix} x \\ y \\ z \\ 1 \end{bmatrix} $$

World Matrix

By combining translation, rotation, and scaling matrices via matrix multiplication, we can express the location of an object in the world with a single matrix.

$$ M_{world} = M_{translation} * M_{rotation} * M_{scale} $$

$$\vec{v}_{vertex} = M_{world} * \begin{bmatrix} x \\\ y \\\ z \\\ 1 \end{bmatrix}$$

Order of Transformations

The order of transformations matters. The usual order is:

Scale
Rotate
Translate

If these are performed out-of-order, it may result in unexpected values. For example, if translation is applied before rotation such that the object has moved away from the origin (0, 0), then rotation will be still be applied around the origin of (0, 0), exaggerating the result of the rotation transformation.

Perspective Projection Matrix

You can also use matrices to achieve projection of points onto a plane.

Projection Matrices handle:

Aspect ratio: adjust x and y values based on screen width and height
Field of view: adjust x and y values based on FOV angle
Normalization: adjust x, y, and z values to sit between -1 and 1

The aspect ratio of the height vs width of the screen.

$$ a = \frac{h}{w} $$

The field of view is defined as the 'scale factor' for how points should be adjusted to fit within the given FOV angle.

$$ f = \frac{1}{\tan(\theta / 2)} $$

Normalizing Z

We must also normalize z to a 'normalized device coordinate' between 0 and 1.

We do this by defining two planes; $zfar$ and $znear$. We can then define a scaling factor to adjust values with respect to these two planes.

$$ \lambda = \frac{zfar}{zfar - znear} - \frac{zfar * znear}{zfar - znear} $$

We can use values $a$, $f$, $\lambda$ from above to convert world points $\begin{bmatrix}x \\ y \\ z \end{bmatrix}$ to screen space $\begin{bmatrix} a f x \\ f y \\ \lambda z - \lambda znear \end{bmatrix}$.

Deriving the Projection Matrix

We can substitute in the values we defined above:

$$ \begin{bmatrix} a f x \\ f y \\ \lambda z - \lambda znear \end{bmatrix} = \begin{bmatrix} (\frac{h}{w})(\frac{1}{\tan(\theta/2)}) * x \\ (\frac{1}{\tan(\theta/2)}) * y \\ (\frac{zfar}{zfar - znear}) * z - (\frac{zfar}{zfar - znear}) * znear \end{bmatrix} $$

To apply this using matrix multiplication, we can use a matrix like the following:

$$ \begin{bmatrix} (\frac{h}{w})(\frac{1}{\tan(\theta/2)}) & 0 & 0 & 0 \\ 0 & (\frac{1}{\tan(\theta/2)}) & 0 & 0 \\ 0 & 0 & (\frac{zfar}{zfar - znear}) & -(\frac{zfar}{zfar - znear}) * znear \\ 0 & 0 & 1 & 0 \end{bmatrix} $$

Note the $1$ in position $M_{4, 3}$. This is used to maintain the original un-normalized $z$ value in the 4th dimension $w$ in order to later perform operations such as perspective divide.

This note has additional explanation for the values contained within the projection matrix.

Column-Major vs Row-Major

In the examples above, vertices are represented in "column-major" order:

$$ \begin{bmatrix} x \\ y \\ z \\ w \end{bmatrix} $$

An alternative representation is "row-major":

$$ \begin{bmatrix} x & y & z & w \end{bmatrix} $$

Different graphics APIs may choose to use different representations for a variety of reasons.

One implication to row-major vs column-major is the order of operands for matrix multiplication. Vertices defined in row-major format are "post-multiplied" against a projection matrix:

$$\vec{p}_{projected} = \vec{p} * M_{scale} * M_{rotate} * M_{translate}$$

Where as column-major vertices are "pre-multiplied":

$$\vec{p}_{projected} = M_{translate} * M_{rotate} * M_{scale} * \vec{p}$$

This note has more details.

Flat Shading with Global Illumination

Simple lighting can be achieved by implementing one global light with a direction vector. The direction vector can be compared against each face normal via dot product to determine a lighting intensity.

Texture Mapping

Texture coordinates are represented as $(u, v)$ simply to differentiate from the normal $x, y$ values.

UV mapping is the process of mapping the vertices of a face to positions on a texture.

Interpolating with Barycentric Coordinates

Barycentric coordinates are like applying a set of weight values on vertices to decide where a point is located in the middle of a triangle face.

$$P = (\alpha, \beta, \gamma)$$

These 'weight values' also represent the areas of the three sub-triangles made by the point $P$

The sum of the barycentric weights is always $1$.

$$\alpha + \beta + \gamma = 1$$

The weights 'pull' the vertices to result in coordinate $P$.

$$\vec{P} = \alpha \vec{A} + \beta \vec{B} + \gamma \vec{C}$$

Finding Barycentric Coordinates of a Point

Given the triangle and point $P$ defined in the illustration above...

$$\alpha = \frac{area\_triangle(PBC)}{area\_triangle(ABC)}$$

To calculate the area of the triangle $PBC$, treat it like a parallelogram. This allows us to easily calculate the area using the cross product. Use the same approach to determine the area of $ABC$.

$$\alpha = \frac{area\_parallelogram(PBEC)}{area\_parallelogram(ABDC)}$$

$$\alpha = \frac{PC \times PB}{AC \times AB}$$

$$\beta = \frac{AC \times AP}{AC \times AB}$$

$$\gamma = 1.0 - \alpha - \beta$$

Ensure that the order of the cross product matches the coordinate system in use, left-handed (clockwise) in this case.

Perspective Correct Texture Mapping

The straight mapping we achieved so far is called "affine mapping." It does not take into account perspective.

To find depth values, you cannot simply interpolate $z$ values using barycentric weighting because the depth value is not linear across the triangle.

However, the reciprocal of the Z components is linear. So we can use $\frac{1}{z}$ to find nthe interpolated $z$ at point $P$.

The original $z$ value of each vertex is stored in the $w$ value of the vector, so that should be used in place of $z$.

To achieve perspective correct mapping, we:

Use the reciprocal of all attributes ($\frac{1}{w}$) (now linear in screen space)
Inerpolate over the triangle face (using barycentric weights, $\frac{1}{w}$ factor)
Divide all attributes by $\frac{1}{w}$ (undoes the perspective transform).

There is a good academic paper describing the derivation of perspective correct interpolation here.

Additional resources:

Z-Buffer

Can also be called the depth buffer.

This stores the depth of each screen pixel in an array, and helps determine which pixel is "in front."

Alternative to painter's algorithm.

As explained in perspective-correct texture interpolation, the Z depth is not linear in screen space across the surface of the triangle. Instead, like texture mapping, the reciprocal $\frac{1}{w}$ is used instead.

Camera

Or a "view matrix" is used to transform the 3D scene into a perspective from a camera or view.

One approach to doing this is implementing a "look at" function that returns a matrix that can transform world vertices into camera space from a certain point looking at another point.

The matrix will:

Translate the whole scene inversely from the camera eye position to the origin (matrix $M_T$)
Rotating the scene with reverse orientation (matrix $M_R$) so the camera is positioned at the origin and facing the positive Z axis (since our renderer is left-handed).

$$M_{view} = M_R * M_T = \begin{bmatrix} r_{11} & r_{12} & r_{13} & 0 \\\ r_{21} & r_{22} & r_{23} & 0 \\\ r_{31} & r_{32} & r_{33} & 0 \\\ 0 & 0 & 0 & 1 \end{bmatrix} * \begin{bmatrix} 1 & 0 & 0 & t_x \\\ 0 & 1 & 0 & t_y \\\ 0 & 0 & 1 & t_z \\\ 0 & 0 & 0 & 1 \end{bmatrix}$$

$$= \begin{bmatrix} r_{11} & r_{12} & r_{13} & (r_{11}t_x + r_{12}t_y + r_{13}t_z) \\\ r_{21} & r_{22} & r_{23} & (r_{21}t_x + r_{22}t_y + r_{23}t_z) \\\ r_{31} & r_{32} & r_{33} & (r_{31}t_x + r_{32}t_y + r_{33}t_z) \\\ 0 & 0 & 0 & 1 \end{bmatrix}$$

The translation matrix will simply be the negated coordinates of the eye position:

$$M_T = \begin{bmatrix} 1 & 0 & 0 & -eye_x \\\ 0 & 1 & 0 & -eye_y \\\ 0 & 0 & 1 & -eye_z \\\ 0 & 0 & 0 & 1 \\\ \end{bmatrix}$$

For the rotation matrix, we need to compute the forward ($z$), right ($x$), and up ($y$) vectors.

$$\begin{bmatrix} x_x & y_x & z_x & 0 \\\ x_y & y_y & z_y & 0 \\\ x_z & y_z & z_z & 0 \\\ 0 & 0 & 0 & 1 \end{bmatrix}^{-1}$$

This matrix is used to convert between coordinate systems. Note, it must be inverted (since the scene must move 'around' the camera). An inverted matrix can be thought of like an "undo" of the original matrix.

For orthogonal matrices, inversion is a simple matter of transposing (flipping so that rows become columns and columns become rows).

$$\begin{bmatrix} x_x & y_x & z_x & 0 \\\ x_y & y_y & z_y & 0 \\\ x_z & y_z & z_z & 0 \\\ 0 & 0 & 0 & 1 \end{bmatrix}^{T} = \begin{bmatrix} x_x & x_y & x_z & 0 \\\ y_x & y_y & y_z & 0 \\\ z_x & z_y & z_z & 0 \\\ 0 & 0 & 0 & 1 \end{bmatrix}$$

Multiplying the rotation and transformation matrices with the values above:

$$M_{view} = M_R * M_T = \begin{bmatrix} x_x & x_y & x_z & 0 \\\ y_x & y_y & y_z & 0 \\\ z_x & z_y & z_z & 0 \\\ 0 & 0 & 0 & 1 \end{bmatrix} * \begin{bmatrix} 1 & 0 & 0 & -eye_x \\\ 0 & 1 & 0 & -eye_y \\\ 0 & 0 & 1 & -eye_z \\\ 0 & 0 & 0 & 1 \\\ \end{bmatrix}$$

$$= \begin{bmatrix} x_x & x_y & x_z & (-x_x eye_x - x_y eye_y -x_z eye_z) \\\ y_x & y_y & y_z & (-y_x eye_x - y_y eye_y -y_z eye_z) \\\ z_x & z_y & z_z & (-z_x eye_x - z_y eye_y -z_z eye_z) \\\ 0 & 0 & 0 & 1 \end{bmatrix}$$

The last column can be simplified using dot product:

$$= \begin{bmatrix} x_x & x_y & x_z & -dot(x, eye) \\\ y_x & y_y & y_z & -dot(y, eye) \\\ z_x & z_y & z_z & -dot(z, eye) \\\ 0 & 0 & 0 & 1 \end{bmatrix}$$

Clipping

Clipping is the process of removing objects or line segments that are outside the viewing volume.

For frustum clipping, six planes are used:

Top
Bottom
Left
Right
Near
Far

A plane is defined by a point $P$ and a normal vector $\vec{n}$.

Calculating the Frustum Planes

Notably, the camera origin point is present on every frustum plane. So it makes for a convenient starting point $P$.

To calculate the right frustum plane, simply draw the normal vector at 90 degrees from the right camera boundary vector ($\frac{fov}{2}$ from origin) and calculate the X and Y components of the normal vector using trig functions.

$$P = (0, 0, 0)$$

$$\vec{n}_x = -\cos{\frac{fov}{2}}$$

$$\vec{n}_y = 0$$

$$\vec{n}_z = \sin{\frac{fov}{2}}$$

A similar process can be used for the left, top, and bottom planes.

Right Frustum Plane

$$\vec{n} = ( -\cos{\frac{fov}{2}}, 0, \sin{\frac{fov}{2}} )$$

Left Frustum Plane

$$\vec{n} = ( \cos{\frac{fov}{2}}, 0, \sin{\frac{fov}{2}} )$$

Top Frustum Plane

$$\vec{n} = ( 0, -\cos{\frac{fov}{2}}, \sin{\frac{fov}{2}} )$$

Bottom Frustum Plane

$$\vec{n} = ( 0, \cos{\frac{fov}{2}}, \sin{\frac{fov}{2}} )$$

Near Frustum Plane

Unlike the other planes, the point $P$ is offset from camera origin. The normal vector is straight out along the $Z$ axis.

$$P = ( 0, 0, z_{near} )$$

$$\vec{n} = (0, 0, 1)$$

Far Frustum Plane

Similarly, for the far plane:

$$P = ( 0, 0, z_{far} )$$

$$\vec{n} = (0, 0, -1)$$

The negative $\vec{n}_z$ value is notable because it denotes which direction is considered "inside" the plane versus outside.

Determining Whether Points are Inside, Outside, or on a Plane

A point $Q$ will be "on" the plane if $(Q - P) \cdot \vec{n} = 0$

A point $Q$ will be "inside" the plane if $(Q - P) \cdot \vec{n} > 0$

A point $Q$ will be "outside" the plane if $(Q - P) \cdot \vec{n} < 0$

Determining Intersection of Lines with Planes

The linear interpolation equation allows us to calculate any point along a line:

$$I = Q_1 + t(Q_2 - Q_1)$$

The interpolation factor $t$ describes the amount $0 - 1$ along the line between points $Q_1$ and $Q_2$.

Given a plane that intersects the line between $Q_1$ and $Q_2$, we need to determine the value of $t$ that provides us with the intersection point.

You can use the dot product to determine each point's relationship with the plane:

$$dot_{Q_1} = \vec{n} \cdot (Q_1 - P)$$

$$dot_{Q_2} = \vec{n} \cdot (Q_2 - P)$$

We can use these along with the linear interpolation equation:

$$I = Q_1 + t(Q_2 - Q_1)$$

$$(I - P) = (Q_1 - P) + t((Q_2 - P) - (Q_1 - P))$$

An dot product each component with $\vec{n}$...

$$\vec{n} \cdot (I - P) = \vec{n} \cdot (Q_1 - P) + t(\vec{n} \cdot (Q_2 - P) - \vec{n} \cdot (Q_1 - P))$$

We know the value of $\vec{n} \cdot (I - P)$ is $0$ because it is on the plane. Simplifying:

$$0 = dot_{Q_1} + t(dot_{Q_2} - dot_{Q_1})$$

We need to isolate $t$ to determine the intersection point.

$$t = \frac{-dot{Q_1}}{dot_{Q_2} - dot_{Q_1}}$$

or

$$t = \frac{dot{Q_1}}{dot_{Q_1} - dot_{Q_2}}$$

How to Clip a Polygon

First, list each of the vertices along the boundary of the polygon and determine if each is inside or outside of the plane. The lines with vertices that straddle the boundary must be clipped, and the intersection point should be added to both lists.

Inside	Outside
$I_1$	$Q_1$
$Q_2$	$I_1$
$Q_3$	$I_2$
$Q_4$	$Q_5$
$I_2$

The resulting polygon is the set of points from the "inside" vertices list.

This operation needs to be repeated for each plane in the view frustum in order to achieve frustum space clipping.

Turning a Clipped Polygon Back Into Triangles

To turn a polygon back into a set of triangles, we can simply iterate through sets of 3 vertices like so:

for (i = 0; i < (num_vertices - 2); ++i) {
    index0 = 0;
    index1 = i + 1;
    index2 = i + 2;

    create_triangle(index0, index1, index2);
}

Clipping UV Coordinates

UV coordinates for triangles can be clipped via linear interpolation using same interpolation factor that produces the new vertices along the edges of the clipping plane.

Homogeneous Clipping

Usually, graphics pipelines will perform clipping after projection but before perspective divide. There are several advantages to doing this:

Perspective divide is where x, y, and z are divided by w. Thus, before perspective divide, every vertex that is inside the frustum is between $-1 * w$ and $1 * w$, making frustum culling as trivial as comparing each component against $w$.
Texture coordinates can still be interpolated linearly in this space, since the perspective divide has not happened yet.
Division by zero is avoided, since clipping and culling are against $z_{near}$.

An additional resource on homogeneous clipping: https://fabiensanglard.net/polygon_codec/index.php

Additional Topics

Rasterization Rules

Conventions that handle how geometry primitives should be rasterized.

Defined conventions can make sure cases like shared edges can be handled properly without gaps or overdraw.

A "fill convention" handles these cases with neighboring triangles. One such convention is called the "top left rule," where pixels are defined as "inside" a triangle if they are along the top edge of left edge.

Course video here

Optimizing Rasterization

https://kristoffer-dyrkorn.github.io/triangle-rasterizer/

Course video here

Subpixel Rasterization

Use floating points to represent sub-pixels. Bias towards the center of each pixel when calculating geometry (0.5, 0.5).

Fixed-point math becomes important here (vs floating points).

Course video here

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
assets		assets
images		images
src		src
subprojects		subprojects
vendor		vendor
videos		videos
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
meson.build		meson.build

Folders and files

Latest commit

History

Repository files navigation

3D Rendering in Software

Progress

01. Simple Perspective Cube Points

02. Simple Rotation Transformation

03. Wireframe Cube

04. Rendering OBJ Files

05. Back Face Culling

06. Triangle Rasterization

07. Face Depth Sorting

08. Transformation Matrices

09. Flat Shading with Global Lighting

10. Texture UV Mapping

11. Perspective Correct UV Interpolation

12. Loading Arbitrary Textures and Texture Maps

13. Z-Buffer

14. Simple Camera

15. Frustum Clipping

16. Camera Adjustments

17. Multiple Meshes

Topics To Review

Additional Learning Resources

Notes

Perspective Projection

Left vs Right-handed Coordinate Systems

Left Hand Rule

Right Hand Rule

Transformation

Rotation

Applying a rotation to a 2D vector using trigonometry functions

Vector Math Review

Vector Magnitude

Vector Addition

Vector Subtraction

Cross Product

Dot Product

Normalizing Vectors

Back Face Culling

Matrices

Matrix Operations

Matrix Addition

Matrix Subtraction

Matrix Multiplication

Identity Matrix

Simple Rotation Matrix

Transformation Matrices

Scale Matrix

Translation Matrix

Rotation Matrix

World Matrix

Order of Transformations

Perspective Projection Matrix

Normalizing Z

Deriving the Projection Matrix

Column-Major vs Row-Major

Flat Shading with Global Illumination

Texture Mapping

Interpolating with Barycentric Coordinates

Finding Barycentric Coordinates of a Point

Perspective Correct Texture Mapping

Z-Buffer

Camera

Clipping

Calculating the Frustum Planes

Right Frustum Plane

Left Frustum Plane

Top Frustum Plane

Bottom Frustum Plane

Near Frustum Plane

Far Frustum Plane

Determining Whether Points are Inside, Outside, or on a Plane

Determining Intersection of Lines with Planes

How to Clip a Polygon

Turning a Clipped Polygon Back Into Triangles

Clipping UV Coordinates

Homogeneous Clipping

Additional Topics

Packages