7. A Spatial and Temporal Coherence Framework for Real‐Time Graphics (4/5)

Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

112

Figure

flickerin

Shado

Since o

ient oc

tion run

uffer,

depth o

shadowi

floating

the inte

these va

Eve

shadowi

tions. T

main de

perform

ble 7.1.

7.ASp

.9. The s

ati

artifacts (rig

sandAm

r shadowing

lusion, we i

ing on con

nd it is store

the four u

g and ambi

point value

er part holds

ues are sho

y step of t

g and ambi

e last step o

erred shadin

nce gained

atialandTe

temporal fra

t) that appear

ientOccl

pipeline is s

tegrate

oth

oles. Our hi

in RG16F

derlying pix

nt occlusio

s used for o

the shadowi

n in Listing

e spatiotem

nt occlusion

f the frame

g pass. Figu

y using ou

poralCoher

ework efficie

n the original

sionCom

milar to the

into one pas

tory buffer

ormat. The

ls in the Z

information

clusion

g factor. Fu

.4.

oral frame

values usin

ork is bilate

e 7.10 show

technique

nceFrame

ntly handles

scene (left).

ined

one used du

in our most

is half the r

reen channe

buffer. The

. The fractio

use it requi

ctions for p

ork runs in

the packing

al upsampli

an overvie

n the Xbox

orkforReal

‐

hadow acne

ing screen-s

efficient imp

solution of

l stores the

red channel

al part of t

es more var

cking and u

parallel on

and unpacki

g combined

of the pipe

360 is show

‐

TimeGraphi

nd other

ace am-

emen

he back

inimum

contains

e 16-bit

ety, and

packing

oth the

ng func-

with the

ine. The

in Ta-

s

7.3Applications 113

#define PACK_RANGE 31.0

#define MIN_FLT 0.01

float PackOccShadow(float Occ, float Shadow)

{

return (floor(saturate(Occ) * PACK_RANGE) +

clamp(Shadow, MIN_FLT, 1.0 - MIN_FLT));

}

float2 UnpackOccShadow(float OccShadow)

{

return (float2((floor(OccShadow)) / PACK_RANGE, frac(OccShadow)));

}

Listing 7.4. Code for shadow and occlusion data packing and unpacking.

Figure 7.10. Schematic diagram of our spatiotemporal framework used with SSAO and

shadows.

Shadow buffer

generation

Min depth

rewrite

SSAO

generation

Shadow and

AO packing

Reprojection

caching

Separable

bilateral

filtering

Bilateral

upsampling

during main

deferred

shading pass

Shadow map

Depth buffer

Volumetric

noise

Normal buffer

Shadow + AO

Min depth

Shadow + AO

Min depth

Shadow + AO

Min depth

Shadow buffer

Min depth

History buffer

Shadow + AO

Min depth

History buffer

114 7.ASpatialandTemporalCoherenceFrameworkforReal‐TimeGraphics

Stage ST Framework Reference

Shadows 0.7 ms 3.9 ms

SSAO generation 1.1 ms 3.8 ms

Reprojection caching 0.35 ms –

Bilateral filtering 0.42 ms (0.2 ms per pass) –

Bilateral upsampling 0.7 ms 0.7 ms

Total 3.27 ms 8.4 ms

Table 7.1. Performance comparison of various stages and a reference solution in which

shadowing is performed in full resolution with



jittered PCF, and SSAO uses 12 taps

and upsampling. The spatiotemporal (ST) framework is 2.5 times faster than the refer-

ence solution and still yields better image quality.

Postprocessing

Several postprocessing effects, such as depth of field and motion blur, tend to

have high spatial and temporal coherency. Both can be expressed as a multisam-

pling problem in time and space and are, therefore, perfectly suited for our

framework. Moreover, the mixed frequency nature of both effects tends to hide

any possible artifacts. During our tests, we were able to perform production-

ready postprocessing twice as fast as with a normal non-cached approach.

Additionally, blurring is an excellent candidate for use with the spatiotem-

poral framework. Normally, when dealing with extremely large blur kernels, hi-

erarchical downsampling with filtering must be used in order to reach reasonable

performance with enough stability in high-frequency detail. Using importance

sampling for downsampling and blurring with the spatiotemporal framework, we

are able to perform high-quality Gaussian blur, using radii reaching 128 pixels in

a 720p frame, with no significant performance penalty (less than 0.2 ms on the

Xbox 360). The final quality is shown in Figure 7.11.

First, we sample nine points with linear filtering and importance sampling in

a single downscaling pass to 1/64 of the screen size. Stability is sustained by the

reprojection caching, with different subsets of samples used during each frame.

The resulting image is blurred, cached, and upsampled. Bilateral filtering is used

when needed by the application (e.g., for depth-of-field simulation where geome-

try awareness is required).

7.4FutureWork

Gau

roc

fram

7.4Fut

The

to p

con

to p

The

(FS

defe

anti

re 7.11. The

sian blur use

ss is efficie

ework.

reWork

e are several

erformed ex

oject deadli

epts were no

esent our fin

ialiasing

spatiotempo

A) at a reas

red renderer

utation at a

as the o

igi

n enough p

liasing sche

ottom image

for volumetri

t and stable

interesting

eriments th

es, addition

t implement

ings here a

al framewor

nable perfo

s, we normal

higher resol

al frame

ocessing po

es are prefe

shows the res

water effects

n the Xbox

oncep

s that

t produced s

l memory r

d in the final

d improve u

is also eas

mance and

y have to re

tion. In gen

fer in

oth

er and me

red.

ult of applyin

to the scene s

360 using th

se the spati

rprisingly g

quirements,

iteration of t

on them in t

ly extended

emory cost

der the G-

ral, FSAA b

he horizont

ory are av

a large-kern

hown in the t

spatiotempo

temporal co

od results.

nd lack of t

he engine.

e future.

to full-scen

[Yang et. Al

ffer and per

ffers tend t

l and vertic

ilable, high

el (128-pixel)

p image. This

al coherency

erency, and

owever, due

esting, those

e would like

antialiasing

2009]. With

orm lighting

be twice as

l directions.

-resolution

115

116 7.ASpatialandTemporalCoherenceFrameworkforReal‐TimeGraphics

The last stage of antialiasing is the downsampling process, which generates

stable, artifact-free, edge-smoothed images. Each pixel of the final frame buffer

is an average of its subsamples in the FSAA buffer. Therefore, we can easily re-

construct the valid value by looking back in time for subsamples. In our experi-

ment, we wanted to achieve 4X FSAA. We rendered each frame with a subpixel

offset, which can be achieved by manipulating the projection matrix. We as-

sumed that four consecutive frames hold the different subsamples that would

normally be available in 4X FSAA, and we used reprojection to integrate those

subsamples over time. When a sample was not valid, due to unocclusion, we re-

jected it. When misses occurred, we could also perform bilateral filtering with

valid samples to leverage spatial coherency.

Our solution proved to be efficient and effective, giving results comparable

to 4X FSAA for near-static scenes and giving results of varying quality during

high-frequency motion. However, pixels in motion were subject to motion blur,

which effectively masked any artifacts produced by our antialiasing solution. In

general, the method definitely proved to be better than 2X FSAA and slightly

worse than 4X FSAA since some high-frequency detail was lost due to repeated

resampling. Furthermore, the computational cost was insignificant compared to

standard FSAA, not to mention that it has lower memory requirements (only one

additional full-resolution buffer for caching). We would like to improve upon

resampling schemes to avoid additional blurring.

High‐QualitySpatiotemporalReconstruction

We would like to present another concept to which the spatiotemporal framework

can be applied. It is similar to the one used in antialiasing. Suppose we want to

draw a full-resolution frame. During each frame, we draw a

n-resolution buffer,

called the refresh buffer, with a different pixel offset. We change the pattern for

each frame in order to cover the full frame of information in n frames. The final

image is computed from the refresh buffer and a high-resolution history buffer.

When the pixel being processed is not available in the history or refresh buffer,

we resort to bilateral upsampling from coarse samples. See Figure 7.12 for an

overview of the algorithm. This solution speeds up frame computation by a factor

of n, producing a properly resampled high-resolution image, with the worst-case

per-pixel resolution being

n of the original. Resolution loss would be mostly

visible near screen boundaries and near fast-moving objects. However, those arti-

facts may be easily masked by additional processing, like motion blur. We found

that setting 4n  generally leads to an acceptable solution in terms of quality and

performance. However, a strict rejection and bilateral upsampling policy must be

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

Table of Contents for 7. A Spatial and Temporal Coherence Framework for Real‐Time Graphics (4/5)

Create new playlist

Sign In

Sign Up

Table of Contents for
7. A Spatial and Temporal Coherence Framework for Real‐Time Graphics (4/5)