https://www.jmeiners.com/pre-rendered-backgrounds/

An Adventure in Pre-Rendered Backgrounds

By: Justin Meiners and Hunter Rasmussen

Pre-rendered backgrounds is a 3D graphics technique used to create
higher quality scenes with limited computational power. Instead of
rendering a full 3D environment in real-time, artists create
still-images of scenery offline. Realtime interactive 3D objects,
such as characters and objects, are then drawn in-game on top of this
static background. The tradeoff is that in-game objects can move
around freely, but the background, and consequently the camera angle,
must remain fixed for each scene. For the image to be coherent, the
in-game camera parameters and perspective must closely match that of
the pre-rendered image.

In the summer of 2017, Hunter and I (Justin) wanted to make a game
using this technique. We thought it had a lot of charm and had been
overlooked for a long time. Furthermore, with modern hardware, we
could push it further than before by leveraging its original
advantages in new ways. Imagine having the full creative potential of
modern 3D tools to create scenes, and then dedicating the entire
graphics budget to high-detail characters.

Since we still had jobs (and needless to say, a very limited amount
of time), we set a goal to make as simple a project as possible, with
just two characters, one level, and point-and-click game mechanics.
We didn't get close to this and finished the summer with only one
character, and almost no gameplay. This is mostly due to how long it
took for us to create artwork, even at placeholder quality. But we
learned a lot. This article is about our project (the source is
available on GitHub.

 1. History
 2. Prototype
 3. Engine
 4. Perspective
 5. Depth
 6. Shadows
 7. Reflections
 8. Scripting
 9. Backgrounds
10. Character
11. The Future
12. Additional References

History

Pre-rendered backgrounds became popular in the mid-1990s on the
PlayStation 1, and to a lesser extent, the Nintendo 64. Since 3D
rendering was new and under-powered, many developers felt limited in
the kinds of experiences they could create. In particular, creating
cinematic experiences with compelling characters was a bit out of the
question. Pre-rendered backgrounds allowed artists to add a lot more
detail, particularly in lighting and materials. They could take hours
to render a single frame, using sophisticated rendering software,
usually reserved for films. This also freed the hardware to focus
entirely on the characters and the few other objects in the scene.

It's important to note that this idea isn't entirely new. It's a
computer analogue of traditional 2D animation techniques, such as
drawing on cels. Creating detailed artwork is time consuming. For a
film, it's too much for artists to redraw an entire scene for every
frame of a film (about 86,400 per hour). Instead, animators paint a
detailed backdrop to be used for a longer window of time and only
redraw the characters and moving objects on top.

Even though the motivation is similar, film and games differ quite a
bit in their methodology for applying this idea, as we will discuss
later on.

Here are a few examples of classic video games that used the
pre-rendered background technique.

Final Fantasy VII

The backgrounds of Final Fantasy are beautiful and diverse. They
often include small animated pieces to avoid appearing too flat and
static. You can admire a full gallery of backgrounds here. You can
also learn more about the engine and technology here.

Slums

Final Fantasy 7

Resident Evil

Resident Evil not only used this technique to create better graphics,
they also embraced its limitations to complement their gameplay. For
example, the fixed camera angles gave the designers full control of
the players view allowing them to craft some cinematic and scary
scenes.

    I wanted the game to present the player with something visible
    that could actually confront and threaten him. The player had to
    feel scared, as if something was waiting for him around the next
    corner. I believe the player could simultaneously feel fear and
    enjoy playing the game. - Shinji Mikami

Resident Evil 1 mansion

Resident Evil

Resident Evil 2

Resident Evil

Later resident evil games changed to a free camera and full 3D
environments and subsequently transformed from horror games into
action games.

The Legend of Zelda: Ocarnia of Time

You might not remember Zelda using pre-rendered backgrounds. It's
only used in a few scenes, such as inside Link's house and outside
the Temple of Time. Unlike the other games, it is not clear to me why
this technique was included at all. perhaps it allowed them to create
impressive looking vistas, but this didn't seem to be a limitation
outside of Ganon's castle. Another possibility is it allowed the team
to create these scenes faster.

Temple of Time

Ocarina of Time

Link's House

Ocarina of Time Links house

Prototyping

With this idea for an engine, we thought about what kind of game
would make sense. I really like the original Alien movie and thought
similar themes would fit into a slower paced exploration and
adventure game. What if the entire game was just you and an Alien
stuck on a ship? As you try to stay alive on the ship, you solve
puzzles and face the alien in various encounters.

This movie concept art already looks like a possible scene in the
game.

Prometheus Concept Art. By David Levy

alien concept

The screenshots below show the final demo. The astronaut walks
anywhere you click on the floor. He can also interact with objects,
such as switches, by clicking on them (they are invisible for testing
purposes). As the astronaut moves offscreen, the camera swaps to
another view to follow him. The demo has only one scene: the
character's sleeping quarters, and 3 small rooms in this area.

screenshot1

screenshot2

Even though its a small amount of content, it represents a lot of
design work and demonstrates that the project works.

Whenever you design a new system or way of doing things, there are a
million micro-questions that need to be answered to make all the
parts fit together. Furthermore, they need to be answered in a way
that creates a coherent work and follows a style. The standard way to
approach this design challenge for a video game is by creating a
"vertical slice" of the game. This is a short playable demo that
represents what 4-5 minutes of the game will look like, at the
quality you expect ship. Creating such a demo forces you to answer
all the critical artistic and technical questions.

For us, these questions included:

  * how will the engine work?
  * what is the process for making a scene?,
  * how big should the player be relative to objects?
  * what camera angles look appropriate?,
  * how does in game lighting match with pre-rendered lighting?

With a good vertical slice, the rest of the game can then be
developed fairly easily. You know everything works. The content and
methods can be scaled out "horizontally", by applying the lessons
learned and following the established style over and over again.

Engine

I adapted an engine I had written for a previous game. It is written
entirely in C and targets OpenGL ES 2.0 & OpenGL 3.2 hardware (macOS,
iPhone, iPad, etc). This generation of OpenGL APIs focused on a fully
programmable fragment and vertex stage, but do not yet support
geometry and tessellation shaders (similar to Direct X 9).

The primary function of the engine is to load, manage, and apply
operations to 3D assets. It is largely flexible in regards to how the
actual gameplay is programmed. Most of the asset formats are entirely
custom and have custom export scripts for Blender. It also supports
OBJ for prototyping static models. I am especially proud of the
skeletal animation system inspired by Doom 3's MD5 file format. The
system is very simple, and flexible, for attaching items to bones or
writing parametric driven animations. The skinning is also computed
entirely on the GPU.

The engine is divided into systems, each with their own
responsibilities:

  * SndSystem

  * ScriptSystem

  * PartSystem

  * GuiSystem

  * NavSystem

  * RenderSystem

  * InputSystem

I tried to keep them decoupled, but it was not helpful to try and
make them entirely independent. The engine calls into most of the
systems, but, for example, GuiSystem may call some functions of
InputSystem.

I like plain C quite a bit, especially for games. Along with ability
to tailor your program perfectly to the hardware, it forces you to
solve concrete problems in the simplest possible way. When
approaching a new feature, it is pretty clear what you need to do.
You think about what structs are needed to represent data, and what
functions you need to operate on them and you write them. In Boost
style C++, I find myself thinking more about the general cases,
rather than the one I actually need to solve. But writing
abstractions is the exception not the rule. Don't get me wrong, I
still love generic programming and think it can be very useful, but
it isn't the best solution for every problem.

Abstraction and generalization are still possible in C, but they
aren't the default. You create them as needed. For example, the
OpenGL render and sound driver are entirely abstracted away from the
rest of the code. Both are defined as structs of function pointers
which are implemented for each desired API. This keeps the rest of
the code insulated from platform specific APIs which makes it easier
to reason about and port to other platforms.

Perspective

For pre-rendered backgrounds to function in the game, they must match
the visuals of the real-time objects, and the gameplay in the scene.
If the characters are drawn with a different camera angle or
perspective than the background portrays, they won't look like they
are actually there. The characters will appear to stand on top of a
scene, rather than being inside it. If what the player sees on the
screen doesn't match what is actually happening, then they will have
a frustrating experience. In other words, the walls and barriers in
the background should behave like walls and barriers to the player.

We can summarize this by saying, The closer the pre-rendered
backgrounds match the game the more they will feel a part of the game
.

With this principle in mind, we want to establish as many connections
between the two as possible. In our game, we accomplished this by
matching the coordinate systems in Blender and the game exactly. Both
the pre-rendered background and the game level are edited together,
keeping everything in sync.

Blender camera view

camera view blender

In-game camera view

camera view game

The scenes are modeled in Blender in full detail with materials and
lighting. After the scene is complete, then we place cameras at
viewpoints of interest, making sure that the cameras can see every
area that the player can walk.

Our thinking here is that we want the scenes to look good from all
angles. This allows us to adjust camera angles freely. It also
removes the temptation to create a "false view" for one camera, which
isn't accurately depicted by the others. This strategy could be
extended to the whole game. Instead of creating the game scenery in
pieces, you could model the entire ship coherently, an then add in
cameras and games objects.

A navigation mesh is modeled just above the floor representing the
playable or walkable area. Unlike the scenery, this navigation mesh
exists in the actual game and is used to move the characters. Each
edge of the navigation mesh can be flagged with attributes, such as
whether the edge is a wall or just an area that the player cannot
walk (like a hole).

Scene in blender

blender level

When a scene is exported from Blender, the coordinates of all the
cameras and game objects are recorded along with the navigation mesh.
These are loaded into the actual game simulation. As the player walks
around the navigation mesh and leaves the view of one camera, the
in-game camera swaps to another, and the background swaps with it.
The game simulation stays continuous during camera changes. During
major scenery changes, the game may not be continuous such as when
loading an entire new area, but this is not shown in this demo.

In the screenshot below, the only things that exist in the actual
game simulation are the wireframes, the player, and the wrench. The
large gray boxes are triggers which swap the camera when the player
walks into them.

Hidden objects

navigation mesh

With all the components together a scene is composed of:

 1. Coordinates of game objects, lights, and cameras.
 2. Lens settings for each camera such as field of view and near/far
    planes.
 3. A color and depth image for each camera.
 4. A cubemap for each reflection probe.
 5. Navigation mesh.
 6. Level script.

I will discuss each of these in more detail.

Depth

Earlier, I gave the impression that the characters were drawn on top
of the backgrounds, as if they were backdrops, like in old Disney
movies. Notice that in this scene from Snow White all the characters
are just sitting on the background with only a few hints as to their
depth such as shadows and relative scale. In the top left one bird is
covered by the foreground.

Snow White

snow white move

Unlike movies, games don't have control over all the objects in the
scene. In a game, the depth of objects in a scene is dynamic. What
happens if the player walks behind an object?

I initially wondered if this could be resolved by dividing the
pre-rendered images into two layers: foreground and background, as
shown in the movie. The foreground would always be drawn last, over
top of the interactive objects.

I did a test render to try it out. (Yes, the artwork is awful!)

Background & Foreground

Layer Prototype Background Layer Prototype Foreground

Combined

Layer Prototype combined

Although this works well for the foreground structure, you can see
right away that this comes with a lot of restrictions. What about the
columns and benches? Can't the player walk in front of and behind
those? Clearly, a column cannot be safely assigned to either layer
unless you force the player to only walk on one side of it.

One could design a game around this layering system, but it would
require a lot of tedious planning to make sure that you never had
objects that could be walked in front of and behind. Final Fantasy
VII actually does something similar, as explained in the analysis I
mentioned earlier. The background is cut up into tiles, and each of
these is assigned a draw order. They appear to dynamically adjust so
the player can move in front of and behind certain objects.

Final Fantasy VII Background & Tiles

ff7 background layers

I wanted something more robust and flexible. I then had the idea to
pre-render a depth buffer along with each background. Provided that
the camera in Blender and in the game and used the same near and far
planes the pre-rendered depth buffer would be equivalent to a depth
buffer from rendering the same scene in real-time.

Depth Buffer

Layer Prototype Depth

Coercing Blender to render a depth buffer was tricky since it isn't
used for its raytracing renderer. The best I could do is create a
linear depth buffer. However, OpenGL does not uses a linear depth
buffer . It prioritizes higher precision for objects close to the
camera and uses less precision for those far away. (Just like
floating point does with large and small values.) This makes a lot of
sense since you don't care as much about minuscule differences in
depth, at long distances.

The following snippet shows how the fragment shader renders both the
color and depth values. The linear depth buffer values are scaled
appropriately.

vec4 depth = texture(u_depth, v_uv);

float near = 0.5;
float far = 25.0;
float z_linear = depth.r;

float z_non_linear = -((near + far) * z_linear - (2.0 * near)) / ((near - far) * z_linear);
gl_FragDepth = z_linear;

// Copy background color
vec4 color = texture(u_albedo, v_uv);
fragColor = color;

I tried to create a post-process filter which would encode the depth
buffer in Blender. Then the shader could copy directly into the depth
buffer without transforming from linear, but for some reason, I
couldn't get this precise enough to be satisfied with.

Some graphics hardware doesn't support writing to the depth buffer in
a shader. Even OpenGL ES 2.0 requires an extension. You may be able
to accomplish something similar with glColorMask and a fullscreen
quad.

Shadows

Shadows in real-time graphics can be tricky. The dominant technique
is shadow mapping followed by Doom 3's stencil shadow volumes.

For this project, I had to consider which shadow rendering method
could combine with pre-rendered backgrounds. I had no insight into
how this could work with shadow volumes. With shadow mapping, one may
be able to pre-render the shadow map of the detailed scenery from the
perspective of each light. But, these depth maps must also be
combined with the real-time objects in a similar way to the
backgrounds. Since they could be rendered in high resolution and
detail, this may provide a boost to shadow quality that is not
possible in real-time. (As far as I know, this could not have been
done in the old games, and I encourage anyone interested in this idea
to explore it.)

I have been through the pains of writing shadow mapping and volumes
in the past, so for this project, I wanted to use simple planar
projection instead. This involves flattening the model onto a plane
with a transformation matrix and then filling it in with a dark
color. Planar projection works well for showing a simple shadow
underneath a character but has a lot of limitations. The most glaring
issue is that shadows cannot be cast on other scene objects. If you
walk in front of a wall or obstacle, the shadow will leak on the
other side. At least these limitations are no worse with pre-rendered
backgrounds than in a real-time scene.

A simple improvement I made was to have the character cast two
shadows instead of one. In each scene, one light is designated as the
primary source and another as the secondary. The primary light casts
a dark shadow while the secondary light casts a lighter one. This
avoids some of the harsh looks that are typical of planar projection
and makes the shadows look more lively.

Primary & Secondary Shadows

shadows diagram

Reflections

If you look closely at the Astronaut's helmet and vest you will
notice that it reflects the surrounding scene. Following the
principle of connecting the background to the game, I thought this
was an essential effect for grounding the character in the scene. It
also helps give the material of the visor an authentic look.

The reflections are done by rendering a cubemap that accompanies each
background. I wrote a script for Blender which would orient the
camera in all 6 directions and capture an image at each. The
reflective surfaces sample the cubemap to obtain reflections.

Cubemap

reflection cubemap

The cubemaps don't need to be very high resolution to give a nice
looking effect. Even a 32x3 faces can make a big difference, and the
performance cost in the shader is almost negligible.

Here is the reflection code, which is fairly standard. The camera
direction is simply reflected along the surface normal. This
resulting vector is used to lookup a color in the cubemap.

vec3 viewDir = normalize(v_fragPosition - u_camPosition);

vec3 envVec = reflect(viewDir, normal);
vec3 envColor = texture(u_envMap, vec3(envVec.x, -envVec.y, envVec.z)).rgb;

Scripting

Since the game was supposed to have a lot of interactive
point-and-click objects, I thought scripting would be essential. The
script system was inspired by TSC, a language invented for Cave Story
. I liked TSC because it is very simple and also very effective for
scripting game events. Cave Story also has a lot of dialogue and
scripted scenes which are similar in structure and level of
complexity that I was going for.

Cave Story screenshot

cave story

The language is structured like assembly but is interpreted in plain
text. It doesn't have a fancy runtime stack and mainly uses branch
operations and event labels. Unlike a typical interpreted language,
the script doesn't continue on immediately after executing an
instruction and so it cannot piggyback on top of the C stack. For
example, the instruction for playing a sound waits until the sound is
completed before moving on to the next instruction.

Here is a sample:

#0030
<PRI<FLJ0202:0032<FLJ0201:0031<FL+0201<GIT0005<AM+0005:0010
<CMU0010Got the =Missile Launcher=!<WAI0160<NOD<RMU<CLRThe Missile Launcher is a powerful
weapon, but its ammo is limited.<NOD
You can replenish your stockpile
with items dropped by enemies.<NOD<END
#0031
<PRI<AM+0005:0005<MSG
<GIT0006Max missiles increased by <NUM0000!<NOD<END
#0032
<PRI<AM+0010:0005<MSG
<GIT0011Max missiles increased by <NUM0000!<NOD<END

I had some questions about how TSC is interpreted so I contacted a
Cave Story expert. This is probably more info than most readers care
about, but it tells how rudimentary the scripts are. I decided
include it for those who are interested.

    "At any given time there's actually only one script buffer which
    stores the "Active script", in plaintext. Normally, that would be
    the contents of Head.tsc with the current stage's script appended
    to the end of it. However, when you bring up the item menu, it
    actually replaces the contents of that buffer by loading
    ArmsItem.tsc - and likewise, it loads the head and stage scripts
    again when the menu is closed. Other than this, there's also a
    limiting factor in that the size of the script buffer is
    allocated on startup so if a script is too large the game will
    crash.

    When the game goes to run an event, it scans through the script
    from start to end looking for the event number. If it finds
    something that it thinks is an event number and is larger than
    what it's searching for, it stops immediately and assumes that
    the event couldn't be found. Thus, putting large event #s in
    head.tsc can lead to some unexpected behaviour." - Noxid

Here is a sample of my inspired language. I made my syntax a bit more
flexible and robust while retaining the simple goto control flow.

#bathroom
    AMB "SOUND_TUNNEL"
    END

#hallway
    AMB "SOUND_GENERATOR"
    END

#bedroom
    AMB "SOUND_TUNNEL"
    END

#elevButton
    SND "SOUND_BEEP"
    CTL 0
    PATH "exit_waypoint"
    WPATH
    CTL 1
    SND "SOUND_BEEP"
    END

This was my first time designing an interpreted language, and
unfortunately I made a few mistakes. A major one was the lack of
ability to maintain the state of the machine when calling into C
code. For example, if script A triggers a game event which starts
script B, then B takes control of the program counter and does not
continue with A's sequence of events. My script language would need
to be redesigned before it was usable. Maybe next time I'll try
something like Forth.

Backgrounds

Since we could take a lot of time to render the backgrounds, we used
the Blender Cycles engine. Blender now prefers that materials are
composed procedurally, rather than with hand-painted textures. Cycles
can simulate these materials more accurately. The procedural
materials are also more flexible since they look clear at any viewing
distance.

Hunter and I both were unfamiliar with this kind of material system
and the node based editor, so we had to do a lot of experimentation
and reading.

Graph for a material

node material editor

Below is an early scenery test we made. I don't think we ever quite
got the hang of combining the base shaders to produce the materials
we wanted, but we were especially proud of the floor material.

Scenery mockup

level mockup

Character

Since most of the graphics budget could be allocated for the
characters, we could put a lot more detail into them than is
conventional. Since this was a tablet game, we budgeted 4-5k
triangles for each character, along with sophisticated materials with
2048x2048 textures. Even at that conservative resolution, each
individual triangle only occupies a few pixels on the screen, so
there isn't much sense in taking it further, even though most
hardware could handle it.

Hunter created the base model by modeling one side of the astronaut
first and then mirroring it across the middle axis. I suggested this
technique to Hunter in order to maximize texture usage. However, we
found out later that this completely breaks normal maps. We fixed
this by separating the UV maps after mirroring, so that none of the
pieces overlapped. The mirroring method can still save time for
modeling and UV mapping.

UV Mapping

astronaut before mirror

After mirroring, Hunter added a few asymmetric pieces, such as the
hose connecting his backpack and vest.

When the in-game mesh was finished, the astronaut was subdivided a
few times to create an extra high-res version. Hunter then sculpted
the high-res model to create the cloth and small details in the
equipment. ZBrush and Mudbox are the standard tools for doing this,
but Blender sculpt worked just fine for our needs.

High-res sculpt

astronaut high resolution

This is a standard technique for generating normal maps. The normals
from the high-res mesh are baked on to the low-res surface. You can
learn more about how to use this technique from Making of Varga.

To finish off the material we created a specular map, which controls
the color and intensity of specular reflections, and a gloss map,
which controls how shiny each surface appears. The normal map makes
the most significant difference overall, but the other maps are
important for making each part of the model look like its intended
material.

This material model was popular for PS3 and Xbox-360 games but has
since been replaced by physically based rendering which has a lot of
production advantages along with being more correct.

All 4 astronaut texture maps

astronaut texture maps

The last step before animating the character is creating a skeleton.
This is fun, except the tedious part of assigning vertices to each
bone. A nice touch Hunter added is a bone to bend the backpack tube.

Astronaut rig

astronaut skeleton rig

To summarize, to get just one character ready for animation, we had
to:

 1. Sketch a concept
 2. Create a low-res model.
 3. UV unwrap the low-res model.
 4. Generate a high-res model.
 5. Sculpt the high-res model.
 6. Bake normal maps.
 7. Paint texture map.
 8. Paint specular map.
 9. Paint gloss map.
10. Create a rig.
11. Bind mesh to the rig and paint weights.

As you can imagine, this takes quite a bit of time, and since it was
Hunter's first time, there was a lot of learning along the way and
parts he wishes he could go back and change.

The Future

Since pre-rendered backgrounds are no longer popular, there are not
many resources to learn from. We hope that this article will
rejuvenate some interest in the subject and help other game
developers work with them in the future.

Hunter and I have moved on to other projects, but we are still
interested in the game concept and the technology and are excited to
hear others' ideas.

The source code for the engine is GPL and is fully capable of being
used in other games of different genres. Contact us if you are
interested in an alternative license.

Bonus: Comic Book Concept

Towards the end of the project, I was messing around with some toon
style shading and came up with this render:

Toon Shading

toon astronaut

That looked like a more exciting direction to take the art. The end
goal of hyper-detailed rendering would be grungey realism. But, what
if we turned that around and used pre-rendered backgrounds to make
something unrealistic? What if the entire game looked like a comic
book, including dialog bubbles and panels?

You could even model the levels with simple shapes to define the
perspective and depth, and then hand paint the scenery on top!

Aliens comic

aliens comic

Additional References

  * 3D Lighting Tutorial

  * Art Tutorial

  * dEngine by Fabien Sanglard

  * Graphics Tech: Precomputed Lighting

  * Image Based Lighting

  * Low Poly Game Character Hair

  * STB Single Header Libraries

  * The Cabal: Valve's Design Process for Creating Half-Life

  * Zelda: Wind Waker Tech and Texture Analysis