4. Screen‐Space Classification for Efficient Deferred Shading (1/4)

Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

 55

Screen‐SpaceClassificationfor

EfficientDeferredShading

Balor Knight

Matthew Ritchie

George Parrish

Black Rock Studio

4.1Introduction

Deferred shading is an increasingly popular technique used in video game ren-

dering. Geometry components such as depth, normal, material color, etc., are

rendered into a geometry buffer (commonly referred to as a G-buffer), and then

deferred passes are applied in screen space using the G-buffer components as

inputs.

A particularly common and beneficial use of deferred shading is for faster

lighting. By detaching lighting from scene rendering, lights no longer affect sce-

ne complexity, shader complexity, batch count, etc. Another significant benefit of

deferred lighting is that only relevant and visible pixels are lit by each light, lead-

ing to less pixel overdraw and better performance.

The traditional deferred lighting model usually includes a fullscreen lighting

pass where global light properties, such as sun light and sun shadows, are ap-

plied. However, this lighting pass can be very expensive due to the number of

onscreen pixels and the complexity of the lighting shader required.

A more efficient approach would be to take different shader paths for differ-

ent parts of the scene according to which lighting calculations are actually re-

quired. A good example is the expensive filtering techniques needed for soft

shadow edges. It would improve performance significantly if we only performed

this filter on the areas of the screen that we know are at the edges of shadows.

56 4.Screen‐SpaceClassificationforEfficientDeferredShading

This can be done using dynamic shader branches, but that can lead to poor per-

formance on current game console hardware.

Swoboda [2009] describes a technique that uses the PlayStation 3 SPUs to

analyze the depth buffer and classify screen areas for improved performance in

post-processing effects, such as depth of field. Moore and Jefferies [2009] de-

scribe a technique that uses low-resolution screen-space shadow masks to classi-

fy screen areas as in shadow, not in shadow, or on the shadow edge for improved

soft shadow rendering performance. They also describe a fast multisample anti-

aliasing (MSAA) edge detection technique that improves deferred lighting per-

formance.

These works provided the background and inspiration for this chapter, which

extends things further by classifying screen areas according to the global light

properties they require, thus minimizing shader complexity for each area. This

work has been successfully implemented with good results in Split/Second, a rac-

ing game developed by Disney’s Black Rock Studio. It is this implementation

that we cover in this chapter because it gives a practical real-world example of

how this technique can be applied.

4.2OverviewofMethod

The screen is divided into



pixel tiles. For every frame, each tile is classified

according to the minimum global light properties it requires. The seven global

light properties used on Split/Second are the following:

1. Sky. These are the fastest pixels because they don’t require any lighting cal-

culations at all. The sky color is simply copied directly from the G-buffer.

2. Sun light. Pixels facing the sun require sun and specular lighting calculations

(unless they’re fully in shadow).

3. Solid shadow. Pixels fully in shadow don’t require any shadow or sun light

calculations.

4. Soft shadow. Pixels at the edge of shadows require expensive eight-tap per-

centage closer filtering (PCF) unless they face away from the sun.

5. Shadow fade. Pixels near the end of the dynamic shadow draw distance fade

from full shadow to no shadow to avoid pops as geometry moves out of the

shadow range.

6. Light scattering. All but the nearest pixels have a light scattering calculation

applied.

7. Antialiasing. Pixels at the edges of polygons require lighting calculations for

both 2X MSAA fragments.

4.2

verviewof

stor

excl

whe

inde

sha

mea

sha

exis

the

add

scre

erti

gree

ethod

e calculate

the result i

sive for a si

properties

nce we’ve

x buffer for

er with the

e found th

utation tim

ller tiles me

t more lig

ers. A size

ing sc

een-s

lassification

up to 57,6

nshots from

s highlighte

re 4.1. A scre

which light

a 7-bit class

ngle pixel, s

re combined

generated a

ach ID that

inimum ligh

t a

44

til

and shader

nt spending

ting propert

f 44 pixe

ace shadow

code, as ex

0 tiles at a

the Split/Se

enshot from

roperties ar

ification ID.

ch as sky a

into

44

lassificatio

oin

s to the

ing code req

size gave t

complexity,

oo much ti

es affec

ing

s also conve

mask [Moor

lained later.

esolution of

ond tutorial

lit/Second w

required fo

Some of the

d sunlight, b

el tiles.

ID for eve

iles with tha

ired for tho

e best balan

leading to

e classifyin

each tile, l

niently matc

and Jefferi

or Split/Sec

1280 720 .

ode with d

th soft shado

each

44

e properties

ut they can e

y tile, we th

t ID and ren

e light prop

ce between

est overall

the tiles, an

ading to m

es the reso

s 2009], whi

ond, the use

igures 4.1 a

fferent glob

edge pixels

ixel tile and

are mutually

xist

ogether

en create an

er it using a

rties.

lassification

erformance.

larger tiles

re complex

ution of our

h simplifies

44

tiles

d 4.2 show

l light prop-

ighlighted in

57

58

Figure 4

Depth

‐

Tile cla

the seve

we clas

screen-s

(320 1



and it i

shader

mask co

screen-s

pixels i

work re

not in s

this text

areas ex

For

shadow

reading

jections.

2. A screensh

Related

sification in

light prop

ify the othe

ace shado

0) texture,

also readin

omplexity i

e to perfor

re and Jeffer

ace shado

shadow, pi

ults in a tex

adow, and a

re for each

ept those ne

ile classifica

fade since t

epth in thes

4.Scre

t from Split/S

lassific

Split/Secon

rties during

three in a

code is

hich perfec

depths, me

the pe

-pix

all depth-re

ies [2009] e

mask textu

els not in s

ure containi

l other valu

screen-space

r the edges

ion, we exte

ey’re both c

e shaders to

n‐SpaceCla

cond with M

tion

is broken i

ur screen-sp

-pixel pas

lready gen

ly matches

ning that w

l pass by e

ated classifi

plain how w

e that conta

adow, and

g zeros for

s for pixels

position, w

f shadows t

d this code

lculated fro

econstruct

sificationfo

AA edge pix

to two parts

ace shadow

s. The reaso

rating a o

ur tile resol

can minim

tending the

ation.

generate a

ns three sha

ixels near t

pixels in sh

ear a shado

can avoid

at we want t

o also classi

depth alo

orld positio

EfficientDe

ls highlighted

. We classif

ask genera

n for this is

e-quarter r

tion of



ze texture r

screen-space

ne-quarter r

dow types p

e shadow e

dow, ones f

edge. By l

xpensive PC

be soft.

y light scatt

e, and we’r

for the sha

erredShadi

in green.

four of

ion, and

that the

solution

pixels,

ads and

shadow

solution

r pixel:

ge. This

r pixels

oking at

F for all

ring and

already

ow p

g

4.3Depth‐RelatedClassification 59

float shadowType = CalcShadowType(worldPos, depth);

float lightScattering = (depth > scatteringStartDist) ? 1.0 : 0.0;

float shadowFade = (depth > shadowFadeStartDist) ? 1.0 : 0.0;

output.color = float4(shadowType, lightScattering, shadowFade, 0.0);

Listing 4.1. Classifying light scattering and shadow fade in the first-pass shadow mask shader.

Recall that the shadow mask is generated in two passes. The first pass calcu-

lates the shadow type per pixel at one-half resolution (

640 36



) and the second

pass conservatively expands the pixels marked as near shadow edge by down-

sampling to one-quarter resolution. Listing 4.1 shows how we add a simple light

scattering and shadow fade classification test to the first-pass shader.

Listing 4.2 shows how we extend the second expand pass to pack the classi-

fication results together into four bits so they can easily be combined with the

per-pixel classification results later on.

// Read 4 texels from 1st pass with sample offsets of 1 texel.

#define OFFSET_X (1.0 / 640.0)

#define OFFSET_Y (1.0 / 360.0)

float3 rgb = tex2D(tex, uv + float2(-OFFSET_X, -OFFSET_Y)).rgb;

rgb += tex2D(tex, uv + float2(OFFSET_X, -OFFSET_Y)).rgb;

rgb += tex2D(tex, uv + float2(-OFFSET_X, OFFSET_Y)).rgb;

rgb += tex2D(tex, uv + float2(OFFSET_X, OFFSET_Y)).rgb;

// Pack classification bits together.

#define RAW_SHADOW_SOLID (1 << 0)

#define RAW_SHADOW_SOFT (1 << 1)

#define RAW_SHADOW_FADE (1 << 2)

#define RAW_LIGHT_SCATTERING (1 << 3)

float bits = 0.0;

if (rgb.r == 0.0)

bits += RAW_SHADOW_SOLID / 255.0;

else if (rgb.r < 4.0)

bits += RAW_SHADOW_SOFT / 255.0;

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

Table of Contents for 4. Screen‐Space Classification for Efficient Deferred Shading (1/4)

Create new playlist

Sign In

Sign Up

Table of Contents for
4. Screen‐Space Classification for Efficient Deferred Shading (1/4)