7. A Spatial and Temporal Coherence Framework for Real‐Time Graphics (3/5)

Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

7.2TheSpatiotemporalFramework 107

float4 out = freshData;

out.a = ActiveFrame.w;

return (out);

}

Listing 7.2. Simplified reprojection cache.

BilateralFiltering

Bilateral filtering is conceptually similar to bilateral upsampling. We perform

Gaussian filtering with weights influenced by a geometric similarity function

[Tomasi and Manduchi 1998]. We can treat it as an edge-aware smoothing filter

or a high-order reconstruction filter utilizing spatial coherence. Bilateral filtering

proves to be extremely efficient for content-aware data smoothing. Moreover,

with only insignificant artifacts, a bilateral filter can be separated into two direc-

tions, leading to



On running time. We use it for any kind of slowly-varying

data, such as ambient occlusion or shadows, that needs to be aware of scene ge-

ometry. Moreover, we use it to compensate for undersampled pixels. When a

pixel lacks samples, lacks history data, or has missed the cache, it is reconstruct-

ed from spatial coherency data. That solution leads to more plausible results

compared to relying on temporal data only. Listing 7.3 shows a separable, depth-

aware bilateral filter that uses hardware linear filtering.

float Bilateral3D5x5(sampler2D inSampler, float2 texelSize,

float2 UV, float2 Dir)

{

const float centerWeight = 0.402619947;

const float4 tapOffsets = float4(-3.5, -1.5, 1.5, 3.5);

const float4 tapWeights = float4(0.054488685, 0.244201342,

0.244201342, 0.054488685);

const float E = 1.0;

const float diffAmp = IN_BilateralFilterAmp;

float2 color;

float4 pSamples, nSamples;

float4 diffIp, diffIn;

float4 pTaps[2];

float2 offSize = Dir * texelSize;

108 7.ASpatialandTemporalCoherenceFrameworkforReal‐TimeGraphics

pTaps[0] = UV.xyxy + tapOffsets.xxyy * offSize.xyxy;

pTaps[1] = UV.xyxy + tapOffsets.zzww * offSize.xyxy;

color = tex2D(inSampler, UV.xy).ra;

// r – contains data to be filtered

// a – geometry depth

pTaps[0].xy = tex2D(inSampler, pTaps[0].xy).ra;

pTaps[0].zw = tex2D(inSampler, pTaps[0].zw).ra;

pTaps[1].xy = tex2D(inSampler, pTaps[1].xy).ra;

pTaps[1].zw = tex2D(inSampler, pTaps[1].zw).ra;

float4 centralD = color.y;

diffIp = (1.0 / (E + diffAmp * abs(centralD - float4(pTaps[0].y,

pTaps[0].w, pTaps[1].y, pTaps[1].w)))) * tapWeights;

float Wp = 1.0 / (dot(diffIp, 1) + centerWeight);

color.r *= centerWeight;

color.r = Wp * (dot(diffIp, float4(pTaps[0].x, pTaps[0].z,

pTaps[1].x, pTaps[1].z)) + color.r);

return (color.r);

}

Listing 7.3. Directional bilateral filter working with depth data.

SpatiotemporalCoherency

We would like to combine the described techniques to take advantage of the spa-

tiotemporal coherency in the data. Our default framework works in several steps:

1. Depending on the data, caching is performed at lower resolution.

2. We operate with the history buffer (HB) and the current buffer (CB).

3. The CB is computed with a small set of current samples.

4. Samples from the HB are accumulated in the CB by means of reprojection

caching.

5. A per-pixel convergence factor is saved for further processing.

7.3Applications 109

6. The CB is bilaterally filtered with a higher smoothing rate for pixels with a

lower convergence rate to compensate for smaller numbers of samples or

cache misses.

7. The CB is bilaterally upsampled to the original resolution for further use.

8. The CB is swapped with the HB.

The buffer format and processing steps differ among specific applications.

7.3Applications

Our engine is composed of several complex pixel-processing stages that include

screen-space ambient occlusion, screen-space soft shadows, subsurface scattering

for skin shading, volumetric effects, and a post-processing pipeline with depth of

field and motion blur. We use the spatiotemporal framework to accelerate most

of those stages in order to get the engine running at production-quality speeds on

current-generation consoles.

Screen‐SpaceAmbientOcclusion

Ambient occlusion AO is computed by integrating the visibility function over a

hemisphere H with respect to a projected solid angle, as follows:



OV d





p ω

N ωω

where

N is the surface normal and

is the visibility function at p (such that

0V 

p ω

when occluded in the direction

, and



p ω

otherwise). It can be effi-

ciently computed in screen space by multiple occlusion checks that sample the

depth buffer around the point being shaded. However, it is extremely taxing on

the GPU due to the high sample count and large kernels that trash the texture

cache. On current-generation consoles, it seems impractical to use more than

eight samples. In our case, we could not even afford that many because, at the

time, we had only two milliseconds left in our frame time budget.

After applying the spatiotemporal framework, we could get away with only

four samples per frame, and we achieved even higher quality than before due to

amortization over time. We computed the SSAO at half resolution and used bi-

lateral upsampling during the final frame combination pass. For each frame, we

changed the SSAO kernel sampling pattern, and we took care to generate a uni-

formly distributed pattern in order to minimize frame-to-frame inconsistencies.

110

Due to

mation,

Further

motion

questio

miss de

gence b

position

tional p

ilateral

only).

while ot

is worth

Therefo

without

second

Figure 7

umn sho

7.ASp

emory cons

eaving the s

ore, since

ectors, so a

. During cac

ection algori

sed on the

That policy

ocessing ste

y filtered, t

ixels with

ers were re

noticing tha

e, we were

significant q

f GPU time

.7 shows fin

7. The left co

s our final X

atialandTe

raints on co

rface norma

e used only

additional p

hing, we res

hm compen

istance bet

tended to gi

s involved.

king conve

igh tempora

onstructed s

we were s

iltering ove

ality loss.

and enabled

l results co

umn shows S

ox 360 imple

poralCoher

soles, we d

vectors ava

amera-

ase

ss for

otio

ed to cam

ated for tha

een a histo

e good resul

fter reproje

gence into

l confidence

atially depe

itching hist

time, whic

he complete

us to use SS

ared to the

AO without

entation.

nceFrame

cided to rel

lable only fo

motion blu

field comp

era reproject

by calculati

y sample an

s, especially

tion, ambie

onsideration

retained hi

ding on the

ry buffe

s a

enables us

solution req

O in real ti

efault algori

sing spatial c

orkforReal

‐

only on de

SSAO com

, we lacked

tation was

ion only. O

g a runnin

the predic

considering

t occlusion

when avail

h-frequenc

onvergence

ter bilateral

to use smal

uired only o

me on the X

thm.

herency. The

‐

TimeGraphi

th infor-

utation.

-pixel

ut of the

r cache-

conver-

ed valid

he addi-

ata was

ble (PC

details,

actor. It

iltering.

kernels

e milli-

ox 360.

right col-

s

7.3

pplicatio ns

Sof

Our

fra

visi

perc

dist

200

ilar

spa

7.9

filte

of th

Shadows

shadowing s

ework for s

le all the ti

r. While sh

entage close

ibuted samp

]. Reproject

o our SSAO

e and bilater

how our fin

re 7.8. Lever

ed, look free

e original sce

olution work

n shadows

e. Firs

, we

dow testing

filter. For e

e set in ord

on caching a

solution. Th

ally upsamp

l results for t

ging the spati

f undersampl

e (top).

s in a deferr

nly since th

draw sun s

against a ca

ch frame,

to leverag

cumulates t

n the shado

ed for the fi

e Xbox 360

temporal coh

ng artifac

d manner.

se are comp

adows to an

caded shado

e use a diff

temporal c

e samples o

buffer is bi

al composi

implementat

ency of shad

ithout

aising

e use the sp

tationally e

offscreen lo

w map, we

rent sample

herence [S

ime in a

aterally filte

on pass. Fig

ion.

ws (bottom)

the shadow

atiotemporal

pensive and

w-resolution

se a custom

rom a well-

herzer et al.

manne

sim-

ed in screen

ures 7.8 and

nables a soft,

ap resolution

111

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

Table of Contents for 7. A Spatial and Temporal Coherence Framework for Real‐Time Graphics (3/5)

Create new playlist

Sign In

Sign Up

Table of Contents for
7. A Spatial and Temporal Coherence Framework for Real‐Time Graphics (3/5)