Home Page Icon
Home Page
Table of Contents for
Digital Audio Signal Processing, 3rd Edition
Close
Digital Audio Signal Processing, 3rd Edition
by Udo Zölzer
Digital Audio Signal Processing, 3rd Edition
Cover
Title Page
Copyright
Preface for the Third Edition
Preface for the Second Edition
Preface for the First Edition
Chapter 1: Introduction
Chapter 2: Quantization
Chapter 3: Sampling Rate Conversion
Chapter 4: AD/DA Conversion
Chapter 5: Audio Processing Systems
Chapter 6: Equalizers
Chapter 7: Room Simulation
Chapter 8: Dynamic Range Control
Chapter 9: Audio Coding
Chapter 10: Nonlinear Processing
Chapter 11: Machine Learning for Audio
Index
End User License Agreement
Search in book...
Toggle Font Controls
Playlists
Add To
Create new playlist
Name your new playlist
Playlist description (optional)
Cancel
Create playlist
Sign In
Email address
Password
Forgot Password?
Create account
Login
or
Continue with Facebook
Continue with Google
Sign Up
Full Name
Email address
Confirm Email Address
Password
Login
Create account
or
Continue with Facebook
Continue with Google
Prev
Previous Chapter
Cover
Next
Next Chapter
Title Page
Table of Contents
Cover
Title Page
Copyright
Preface for the Third Edition
Preface for the Second Edition
Preface for the First Edition
Chapter 1: Introduction
1.1 Continuous‐time Signals and Convolution
1.2 Continuous‐time Fourier Transform and Laplace Transform
1.3 Sampling and Reconstruction
1.4 Discrete‐time Signals and Convolution
1.5 Discrete‐time Fourier Transform and Z‐Transform
1.6 Discrete Fourier Transform
1.7 FIR and IIR Filters
1.8 Adaptive Filters
1.9 Exercises
References
Notes
Chapter 2: Quantization
2.1 Signal Quantization
2.2 Dither
2.3 Spectrum Shaping of Quantization – Noise Shaping
2.4 Number Representation
2.5 JS Applet – Quantization, Dither, and Noise Shaping
2.6 Exercises
References
Note
Chapter 3: Sampling Rate Conversion
3.1 Basics
3.2 Synchronous Conversion
3.3 Asynchronous Conversion
3.4 Interpolation Methods
3.5 Exercises
References
Chapter 4: AD/DA Conversion
4.1 Methods
4.2 AD Converters
4.3 DA Converters
4.4 JS Applet – Oversampling and Quantization
4.5 Exercises
References
Chapter 5: Audio Processing Systems
5.1 Digital Signal Processors
5.2 Digital Audio Interfaces
5.3 Two‐channel Systems
5.4 Multi‐channel Systems
References
Notes
Chapter 6: Equalizers
6.1 Basics
6.2 Recursive Audio Filters
6.3 Non‐recursive Audio Filters
6.4 Multi‐complementary Filter Bank
6.5 Delay‐based Audio Effects
6.6 JS Applet – Audio Filters
6.7 Exercises
References
Chapter 7: Room Simulation
7.1 Basics
7.2 Early Reflections
7.3 Subsequent Reverberation
7.4 Approximation of Room Impulse Responses
7.5 JS Applet – Fast Convolution
7.6 Exercises
References
Chapter 8: Dynamic Range Control
8.1 Basics
8.2 Static Curve
8.3 Dynamic Behavior
8.4 Implementation
8.5 Realization Aspects
8.6 Multiband DRC
8.7 Dynamic Equalizers
8.8 Source‐filter DRC
8.9 JS Applet – Dynamic Range Control
8.10 Exercises
References
Chapter 9: Audio Coding
9.1 Lossless Audio Coding
9.2 Lossy Audio Coding
9.3 Psychoacoustics
9.4 ISO‐MPEG1 Audio Coding
9.5 MPEG‐2 Audio Coding
9.6 MPEG‐2 Advanced Audio Coding
9.7 MPEG‐4 Audio Coding
9.8 Spectral Band Replication
9.9 Constrained Energy Lapped Transform – Gain and Shape Coding
9.10 JS Applet – Psychoacoustics
9.11 Exercises
References
Note
Chapter 10: Nonlinear Processing
10.1 Fundamentals
10.2 Overdrive, Distortion, Clipping
10.3 Nonlinear Filters
10.4 Aliasing and its Mitigation
10.5 Virtual Analog Modeling
10.6 Exercises
References
Chapter 11: Machine Learning for Audio
11.1 Introduction
11.2 Unsupervised and Supervised Learning
11.3 Gradient Descent and Backpropagation
11.4 Applications
11.5 Exercises
References
Index
End User License Agreement
List of Tables
Chapter 1
Table 1.1 Computing the gradient weight
.
Chapter 2
Table 2.1 Bit location and range of values.
Table 2.2 Rounding and truncation of 2s complement numbers.
Table 2.3 Special cases of floating‐point number representation.
Chapter 3
Table 3.1 Counter increments for different sampling rate conversions.
Chapter 5
Table 5.1 DSPs (AD/TI/NXP/STM/Cadence/Xilinx) and applications
Table 5.2 Channel status bytes
Table 5.3 Emphasis field
Table 5.4 Sampling rate field
Table 5.5 Channel mode
Table 5.6 Electrical specifications of professional interfaces
Table 5.7 MADI specifications
Table 5.8 Computer interfaces
Table 5.9 Computer interfaces
Chapter 6
Table 6.1 Transfer functions of lowpass and highpass filters.
Table 6.2 Lowpass/highpass/bandpass filter design.
Table 6.3 Peak filter design with gain
in dB.
Table 6.4 Low‐frequency shelving filter design with gain
in dB.
Table 6.5 High‐frequency shelving filter design with gain
in dB.
Table 6.6 Direct‐form – a) noise transfer function, b) quadratic
norm, and ...
Table 6.7 Gold and Rader – a) noise transfer function, b) quadratic
norm, a...
Table 6.8 Kingsbury – a) noise transfer function, b) quadratic
norm, and c)...
Table 6.9 Zölzer – a) noise transfer function, b) quadratic
norm, and c) ou...
Table 6.10 Commonly used scaling.
Table 6.11 Transition frequencies
and transition bandwidths TB in an 8‐band...
Table 6.12 Memory requirements.
Chapter 9
Table 9.1 Critical bands as given by Zwicker. Modified from [Zwi82].
Table 9.2 Forward prediction in the time and frequency domains.
Chapter 10
Table 10.1 Wave domain representation of common linear circuit elements
Table 10.2 Parallel and series adaptors
Table 10.3 Coefficient matrices and nonlinear functions of common circuit el...
Chapter 11
Table 11.1 Average performance of the denoising CNNs on noisy speech and mus...
List of Illustrations
Chapter 1
Figure 1.1 Audio capturing and reproduction for a listener, and representati...
Figure 1.2 Continuous‐time signals
,
,
,
,
, and
.
Figure 1.3 Continuous‐time convolution
showing the folded version of the i...
Figure 1.4 Fourier transforms of an even and a causal rect signal. The small...
Figure 1.5 Fourier transforms of two even sinc signals.
Figure 1.6 Sampling and reconstruction – Time‐domain signals (left column) a...
Figure 1.7 Discrete‐time signals
,
,
,
,
, and
.
Figure 1.8 Discrete‐time convolution
showing the folded version of the imp...
Figure 1.9 Fourier transform of an audio signal
(top plot) and its magnitu...
Figure 1.10 Block diagram of FIR filter.
Figure 1.11 FIR filter impulse response, magnitude response, phase response,...
Figure 1.12 Block diagram of an IIR filter.
Figure 1.13 IIR filter cascade.
Figure 1.14 IIR filter and impulse response, magnitude response, phase respo...
Figure 1.15 Frequency responses and impulse responses of a moving average FI...
Figure 1.16 Frequency response of IIR filter.
Figure 1.17 Adaptive signal approximation.
Figure 1.18 Linear prediction.
Figure 1.19 Linear prediction for coding and source‐filter processing.
Chapter 2
Figure 2.1 AD conversion and quantization.
Figure 2.2 Quantization.
Figure 2.3 (a) Nonlinear characteristic curve of a quantizer. (b) Quantizati...
Figure 2.4 Probability density function (sinusoidal signal and signal with u...
Figure 2.5 Probability density function (signal with Gaussian PDF).
Figure 2.6 Amplitude and time quantization.
Figure 2.7 Probability density function of signal
and quantized signal
....
Figure 2.8 Zone sampling of the PDF.
Figure 2.9 Determining the PDF of the output.
Figure 2.10 Spectral representation.
Figure 2.11 PDF and characteristic function of quantization error.
Figure 2.12 PDF of the model.
Figure 2.13 Probability density function and quantization error.
Figure 2.14 Probability density function of the quantization error.
Figure 2.15 Autocorrelation
and power density spectrum
of quantization e...
Figure 2.16 (a) Probability density function of quantization error for diffe...
Figure 2.17 Addition of a random sequence before a quantizer.
Figure 2.18 Specification of the word length.
Figure 2.19 Truncation – linearizing and suppression of noise modulation (
,...
Figure 2.20 Rounding – linearizing and suppression of noise modulation (
,
Figure 2.21 Normalized power density spectrum for triangular PDF dither (TRI...
Figure 2.22 (a,d) Histogram and (c,d) power density spectrum of uniform PDF ...
Figure 2.23 One‐bit amplitude – quantizer with truncation (a,c) and rounding...
Figure 2.24 One‐bit amplitude – rounding with RECT dither (a,c) and TRI dith...
Figure 2.25 Noise modulation at 0.25‐bit amplitude – rounding with RECT dith...
Figure 2.26 Linear model of a quantizer.
Figure 2.27 Spectrum shaping of quantization error.
Figure 2.28 Highpass spectrum shaping of quantization error.
Figure 2.29 Spectrum shaping (
,
;
, ;
, ‐ ‐ ‐).
Figure 2.30 Dither and spectrum shaping.
Figure 2.31 Modified dither and spectrum shaping.
Figure 2.32 (a) Hearing thresholds in quiet. (b) Inverse ISO 389‐7 threshold...
Figure 2.33 Power density spectra of three filter approximations (Wa3, third...
Figure 2.34 Psychoacoustic noise shaping: signal
; quantized signal
; and ...
Figure 2.35 Fixed‐point formats.
Figure 2.36 Number range.
Figure 2.37 Rounding and truncation curves.
Figure 2.38 Rounding curve and error signal for
bit.
Figure 2.39 Model of a fixed‐point quantizer.
Figure 2.40 Floating‐point number representation.
Figure 2.41 Rounding and truncation curves for floating‐point representation...
Figure 2.42 Model of a floating‐point quantizer.
Figure 2.43 Signal‐to‐noise ratio for an input level.
Figure 2.44 JS applet – quantization, dither, and noise shaping.
Chapter 3
Figure 3.1 Upsampling by L and anti‐imaging filtering in the time and freque...
Figure 3.2 Antialiasing filtering and downsampling by M in the time and freq...
Figure 3.3 Sampling rate conversion by factor L/M.
Figure 3.4 Identities for sampling rate conversion.
Figure 3.5 Decomposition in accordance with Euclid's theorem.
Figure 3.6 Polyphase decomposition for downsampling L/M
.
Figure 3.7 Polyphase decomposition for upsampling L/M
.
Figure 3.8 Sampling rate conversion by factor
.
Figure 3.9 Approximation of DA/AD conversions.
Figure 3.10 Linear interpolation before virtual sample‐and‐hold function.
Figure 3.11 Convolution sum (3.42) in the time domain.
Figure 3.12 Convolution sum (3.42) for different
.
Figure 3.13 Sinc function and different impulse responses.
Figure 3.14 Time‐scaled impulse response.
Figure 3.15 Multistage conversion – frequency‐domain interpretation.
Figure 3.16 Time‐domain interpretation.
Figure 3.17 Sampling rate conversion with interpolation for calculating coef...
Figure 3.18 Calculation of
.
Figure 3.19 Measurement of
.
Figure 3.20 Polynomial interpolation with three samples.
Figure 3.21 Lagrange polynomial.
Figure 3.22 Truncated power functions and the B‐spline of Nth order.
Figure 3.23 Third‐order B‐spline (
,
, five samples).
Figure 3.24 Interpolation with B‐splines of second and third orders.
Figure 3.25 Exploiting the symmetry properties of a second‐order B‐spline.
Figure 3.26 Interpolation with B‐splines of fourth and sixth order.
Chapter 4
Figure 4.1 Schematic diagram of Nyquist sampling.
Figure 4.2 Nyquist sampling – interpretation in the frequency domain.
Figure 4.3 Influence of oversampling and delta‐sigma technique on power spec...
Figure 4.4 Oversampling AD converter and sampling rate reduction.
Figure 4.5 Oversampling and DA conversion.
Figure 4.6 Delta modulation and displacement of integrator.
Figure 4.7 Delta‐sigma modulation and time‐discrete model.
Figure 4.8 Signals in delta‐sigma modulation.
Figure 4.9 Oversampling delta‐sigma AD converter.
Figure 4.10 Oversampling delta‐sigma DA converter.
Figure 4.11 Time‐discrete model of a first‐order delta‐sigma modulator.
Figure 4.12 Time‐discrete model of a second‐order delta‐sigma modulator.
Figure 4.13 Time‐discrete model of a multistage delta‐sigma modulator.
Figure 4.14
with
.
Figure 4.15 Improvement of signal‐to‐noise ratio as a function of oversampli...
Figure 4.16 Higher‐order delta‐sigma modulator.
Figure 4.17 Comparison of different transfer functions of the error signal....
Figure 4.18 Transfer function of the error signal in the stopband.
Figure 4.19 Several stages for sampling rate reduction.
Figure 4.20 Signal flow diagram of a comb filter.
Figure 4.21 Comb filter for sampling rate reduction.
Figure 4.22 Series of comb filters for sampling rate reduction.
Figure 4.23 (a) Transfer function
with
. (b) Third‐order delta‐sigma modu...
Figure 4.24 (a) Sample‐and‐hold circuit. (b) Input and output with clock sig...
Figure 4.25 Offset error and gain error.
Figure 4.26 Differential nonlinearity.
Figure 4.27 Integral nonlinearity.
Figure 4.28 Monotonicity.
Figure 4.29 Parallel converter.
Figure 4.30 Half‐flash AD converter.
Figure 4.31 Subranging AD converter.
Figure 4.32 AD converter with successive approximation.
Figure 4.33 Successive approximation.
Figure 4.34 AD converter with forward‐backward counter.
Figure 4.35 Single‐slope AD converter.
Figure 4.36 Dual‐slope AD converter.
Figure 4.37 Delta‐sigma AD converter.
Figure 4.38 Settling time and sample‐and‐hold function.
Figure 4.39 Offset and gain error.
Figure 4.40 Differential nonlinearity.
Figure 4.41 Integral nonlinearity.
Figure 4.42 Monotonicity.
Figure 4.43 Switched voltage sources.
Figure 4.44 Switched current sources.
Figure 4.45 Weighted current sources.
Figure 4.46 DA conversion with weighted resistors.
Figure 4.47 DA conversion with weighted capacitors.
Figure 4.48 Switched current sources with R–2R resistor network.
Figure 4.49 Delta‐sigma DA converter.
Figure 4.50 JS applet – oversampling and quantization.
Chapter 5
Figure 5.1 Schematic diagram of a fixed‐point DSP.
Figure 5.2 Block diagram of a floating‐point digital signal processor.
Figure 5.3 Two‐channel format.
Figure 5.4 Two‐channel format (subframe).
Figure 5.5 Channel coding.
Figure 5.6 Preamble X.
Figure 5.7 Bytes 0–2 of channel status information.
Figure 5.8 Bytes 0–3 (consumer format).
Figure 5.9 A system link by MADI.
Figure 5.10 MADI frame format.
Figure 5.11 HDMI block diagram.
Figure 5.12 HDMI pinout since HDMI 1.4.
Figure 5.13 Example layer‐1 network (AES50 replaces the traditional multicor...
Figure 5.14 Example layer‐2 network (the PC cannot receive the AVB stream be...
Figure 5.15 Example layer‐3 network (all devices in the network can connect ...
Figure 5.16 Audio over IP.
Figure 5.17 Two‐channel DSP system with two‐channel AD/DA converters (C = co...
Figure 5.18 Mixing console application of a multichannel system.
Chapter 6
Figure 6.1 Linear magnitude responses of lowpass, highpass, bandpass, and ba...
Figure 6.2 Logarithmic magnitude responses of bandpass filters with constant...
Figure 6.3 Linear magnitude responses of octave filters and decomposition of...
Figure 6.4 Parallel connection of bandpass filters (BP) for octave/one‐third...
Figure 6.5 Series connection of shelving and peak filters (low‐frequency LF,...
Figure 6.6 Magnitude responses of weighting filters for root‐mean‐square and...
Figure 6.7 Pole‐zero location for (a) second‐order lowpass and (b) second‐or...
Figure 6.8 Frequency response of lowpass and highpass filters – highpass
=...
Figure 6.9 Frequency response of transfer function (6.12) with varying
and...
Figure 6.10 Frequency responses of transfer function (6.13) with varying
a...
Figure 6.11 Pole‐zero locations of a first‐order low‐frequency shelving filt...
Figure 6.12 Pole‐zero locations of a second‐order low‐frequency shelving fil...
Figure 6.13 Pole‐zero locations of second‐order high‐frequency shelving filt...
Figure 6.14 Frequency responses of second‐order low‐/high‐frequency shelving...
Figure 6.15 Pole‐zero locations of a second‐order peak filter.
Figure 6.16 Frequency response of a peak filter –
= 500 Hz,
= 1.25, cut ...
Figure 6.17 Frequency responses of peak filters –
= 500 Hz, boost/cut
dB...
Figure 6.18 Frequency responses of peak filters – boost/cut
dB,
= 1.25,
Figure 6.19 Filter structure for implementing boost and cut filters.
Figure 6.20 Filter structures by Regalia.
Figure 6.21 Low‐frequency shelving filter and first‐order lowpass filter.
Figure 6.22 First‐order high‐frequency shelving and highpass filters.
Figure 6.23 Second‐order peak filter and bandpass filter.
Figure 6.24 Low‐frequency first‐order shelving filter (
dB;
= 20, 50, 100...
Figure 6.25 First‐order high‐frequency shelving filter (
dB;
= 1, 3, 5, 1...
Figure 6.26 Second‐order peak filter (
dB;
= 50, 100, 1000, 3000, 10000 H...
Figure 6.27 Direct‐form structure – pole distribution (6‐bit quantization)....
Figure 6.28 Direct‐form structure – block diagram of recursive part.
Figure 6.29 Gold and Rader – pole distribution (6‐bit quantization).
Figure 6.30 Gold and Rader – block diagram of recursive part.
Figure 6.31 Kingsbury – pole distribution (6‐bit quantization).
Figure 6.32 Kingsbury – block diagram of recursive part.
Figure 6.33 Geometric interpretation.
Figure 6.34 Zölzer – pole distribution (6‐bit quantization).
Figure 6.35 Zölzer – block diagram of recursive part.
Figure 6.36 Direct form with additive error signal.
Figure 6.37 Gold and Rader structure with additive error signals.
Figure 6.38 Kingsbury structure with additive error signals.
Figure 6.39 Zölzer structure with additive error signals.
Figure 6.40 SNR versus cutoff frequency – quantization of products (
Hz).
Figure 6.41 SNR versus cutoff frequency – quantization of products (
kHz)....
Figure 6.42 Direct‐form filter – quantization after accumulator.
Figure 6.43 Gold and Rader filter – quantization after accumulator.
Figure 6.44 Kingsbury filter – quantization after accumulator.
Figure 6.45 Zölzer filter – quantization after accumulator.
Figure 6.46 SNR versus cutoff frequency – quantization after accumulator (
...
Figure 6.47 SNR versus cutoff frequency – quantization after accumulator (
...
Figure 6.48 Direct‐form with noise shaping.
Figure 6.49 SNR – Noise shaping in direct‐form filter structures.
Figure 6.50 Gold and Rader filter with noise shaping.
Figure 6.51 Kingsbury filter with noise shaping.
Figure 6.52 Zölzer filter with noise shaping.
Figure 6.53 SNR – noise shaping (20–200 Hz).
Figure 6.54 SNR – noise shaping (200 Hz–12 kHz).
Figure 6.55 Fast convolution of signal
of length
and impulse response
...
Figure 6.56 Fast convolution with partitioning of the input signal
into bl...
Figure 6.57 Partitioning of the impulse response
.
Figure 6.58 Scheme for a fast convolution with
.
Figure 6.59 Hybrid fast convolution.
Figure 6.60 Filter design by frequency sampling (
even).
Figure 6.61 Octave‐band QMF bank (SP = signal processing, LP = lowpass, HP =...
Figure 6.62 Octave‐frequency bands.
Figure 6.63 Two‐band decomposition.
Figure 6.64 Modified octave‐band filter bank.
Figure 6.65 Modified octave decomposition.
Figure 6.66 Two‐band complementary filter bank.
Figure 6.67 Design of
,
, and
.
Figure 6.68 Multi‐complementary filter bank.
Figure 6.69 Multirate complementary filter.
Figure 6.70 Modified octave decomposition of the frequency band.
Figure 6.71 Linear phase 8‐band equalizer.
Figure 6.72 Kernel complementary filter structure.
Figure 6.73 Decimation and interpolation filters.
Figure 6.74 Kernel lowpass filter with a transition bandwidth of
Hz.
Figure 6.75 Decimation and interpolation highpass filter.
Figure 6.76 Decimation and interpolation lowpass filter.
Figure 6.77 Phase modulation with delay line.
Figure 6.78 Tremolo, vibrato, and chorus effect.
Figure 6.79 JS applet – audio filters.
Chapter 7
Figure 7.1 Room impulse response
and simplified decomposition into direct ...
Figure 7.2 Model‐based methods for calculating room impulse responses.
Figure 7.3 Two‐dimensional representation of the image source model highligh...
Figure 7.4 Simulated room impulse response via image source model with a len...
Figure 7.5 Measurement of room impulse response with pseudo‐random signal
....
Figure 7.6 Periodic auto‐correlation of pseudo‐random sequence and periodic ...
Figure 7.7 Beginning of an exponential sine sweep
with
,
, and
.
Figure 7.8 Spectrogram of an exponential sine sweep with
,
, and
.
Figure 7.9 Magnitude responses of the exponential sine sweep
, the inverse ...
Figure 7.10 Result of the convolution of the exponential sine sweep
and th...
Figure 7.11 Exemplary measured room impulse response including the first and...
Figure 7.12 Simulation of early reflections.
Figure 7.13 Early reflections.
Figure 7.14 Delay and weighting of the direct signal.
Figure 7.15 Delay and weighting of effect signal.
Figure 7.16 Coupled factors and delays.
Figure 7.17 Multichannel application.
Figure 7.18 Stereo reflections.
Figure 7.19 Increase of density for nine reflections.
Figure 7.20 Recursive comb filter (
feedback factor,
delay length).
Figure 7.21 Allpass filter (
delay length).
Figure 7.22 (a) Impulse response of a comb filter (
= 10,
= −0.6). (b) Im...
Figure 7.23 Parallel circuit of comb filters.
Figure 7.24 Cascade circuit of allpass filters.
Figure 7.25 Short‐time spectra of a comb filter (
).
Figure 7.26 Short‐time spectra of a parallel circuit of comb filters.
Figure 7.27 Impulse response and echogram.
Figure 7.28 Modified lowpass comb filter.
Figure 7.29 Short‐time spectra of a parallel circuit of lowpass comb filters...
Figure 7.30 Stereo room simulation.
Figure 7.31 General feedback system.
Figure 7.32 Feedback system.
Figure 7.33 Impulse response and frequency response of 4‐delay system with a...
Figure 7.34 Impulse response and frequency response of a 4‐delay system with...
Figure 7.35 Room simulation with delay line and forward and backward coeffic...
Figure 7.36 Embedded and absorbing allpass system [Gar92a, Gar92b, Gar92c, G...
Figure 7.37 Room simulator with embedded allpass systems [Gar92a, Gar92b, Ga...
Figure 7.38 Stereo room simulator with absorbent allpass systems. Modified f...
Figure 7.39 Simplified feedback delay network containing parallel allpass/de...
Figure 7.40 System measuring and approximating room impulse responses.
Figure 7.41 Determining model parameters from the measured impulse response....
Figure 7.42 JS applet – fast convolution.
Chapter 8
Figure 8.1 System for dynamic range control.
Figure 8.2 Static curve with the parameters: LT =
limiter threshold
; CT =
co
...
Figure 8.3 Compressor curve (compressor ratio CR/compressor slope CS).
Figure 8.4 Static curve and gain mapping curve of a compressor with soft and...
Figure 8.5 PEAK measurement.
Figure 8.6 Block diagrams of the attack and release parts of the branched pe...
Figure 8.7 Block diagram of the decoupled peak level detector.
Figure 8.8 Attack and release behavior of branching and decoupled peak level...
Figure 8.9 RMS measurement (TAV = averaging coefficient).
Figure 8.10 Implementing attack and release times or gain factor smoothing....
Figure 8.11 Attack and release behavior for time constant filters.
Figure 8.12 Limiter.
Figure 8.13 Feedback DRC System.
Figure 8.14 Ducking DRC system.
Figure 8.15 DRC system with a lookahead of
samples.
Figure 8.16 Compressor/expander/noise gate.
Figure 8.17 Limiter/compressor/expander/noise gate.
Figure 8.18 Shifting the static curve by a gain factor.
Figure 8.19 Signals
,
, and
for dynamic range control.
Figure 8.20 Dynamic system with sampling rate reduction.
Figure 8.21 Nesting technique.
Figure 8.22 Stereo dynamic system.
Figure 8.23 Multiband DRC system.
Figure 8.24 Dynamic EQ system.
Figure 8.25 Frequency response of a dynamic equalizer at sample
.
Figure 8.26 Block diagram of a low‐shelving dynamic filter.
Figure 8.27 Block diagram of a peak dynamic filter.
Figure 8.28 Source‐filter separation and processing using linear predictive ...
Figure 8.29 Block diagram of the combined systems LPC and DRC.
Figure 8.30 Error signals
and
before and after dynamic range expansion, ...
Figure 8.31 Spectrograms of a female singing voice with white noise (30 dB S...
Figure 8.32 Block diagram of the DRC block for transient control.
Figure 8.33 Input signal
and the corresponding extracted transients.
Figure 8.34 Spectrogram of a female singing voice (top left, input; top righ...
Figure 8.35 JS applet – dynamic range control.
Chapter 9
Figure 9.1 Lossless audio coding based on linear prediction and entropy codi...
Figure 9.2 Signals and spectra for linear prediction.
Figure 9.3 Lossless audio coding (Mozart, Stravinsky): word length in bits v...
Figure 9.4 Lossy audio coding based on sub‐band coding and psychoacoustic mo...
Figure 9.5 Absolute threshold (threshold in quiet).
Figure 9.6 Masking threshold of band‐limited noise.
Figure 9.7 Offset between signal level and masking threshold.
Figure 9.8 Masking within a critical band. Based on [Thei88] and [Sauv90].
Figure 9.9 Masking across critical bands.
Figure 9.10 Stepwise calculation of psychoacoustic model.
Figure 9.11 Calculation of the signal‐to‐mask ratio SMR.
Figure 9.12 Calculation of psychoacoustic model for a pure sinusoid with 440...
Figure 9.13 Simplified block diagram of an ISO‐MPEG1 coder.
Figure 9.14 Simplified block diagram of an ISO‐MPEG1 decoder.
Figure 9.15 Pseudo‐QMF bank.
Figure 9.16 Impulse responses and magnitude responses of pseudo‐QMF bank.
Figure 9.17 Polyphase/MDCT hybrid filter bank.
Figure 9.18 Nomenclature of frequency indices.
Figure 9.19 MPEG‐2 AAC coder and decoder.
Figure 9.20 Time‐frequency decompostion with MDCT/inverse modified discrete ...
Figure 9.21 Signals of MDCT/IMDCT.
Figure 9.22 Kaiser–Bessel‐derived window and sine window for
and magnitude...
Figure 9.23 Normalized impulse responses of sine window for
, modulated ban...
Figure 9.24 Switching of window functions.
Figure 9.25 Attack of castanet and spectrum.
Figure 9.26 Forward prediction in time and frequency domains.
Figure 9.27 Temporal noise shaping with forward prediction in the frequency ...
Figure 9.28 Temporal noise shaping: attack of castanet and spectrum.
Figure 9.29 Backward prediction of bandpass signals.
Figure 9.30 M/S coding in the frequency domain.
Figure 9.31 Intensity stereo coding in the frequency domain.
Figure 9.32 MPEG‐4 parametric coder.
Figure 9.33 Parameter extraction with analysis/synthesis.
Figure 9.34 Original signal, sum of sinusoids, and noise‐like signal.
Figure 9.35 Spectrogram of sinusoidal components.
Figure 9.36 MPEG‐4 parametric decoder.
Figure 9.37 SBR coder.
Figure 9.38 SBR decoder. Based on [Edl00] and [Mei02].
Figure 9.39 Functional units of the SBR method.
Figure 9.40 A simplified illustration of the CELT codec.
Figure 9.41 CELT spectrograms for a) energies per Bark band, b) gain coeffic...
Figure 9.42 Mean gain and mean predictive gain coefficients, averaged over a...
Figure 9.43 Actual and ideal distribution of the coarse quantization symbols...
Figure 9.44 Distribution of the fine quantization symbols for
frames.
Figure 9.45 Performance of the CELT codec on an audio snippet.
Figure 9.46 JS applet – psychoacoustics.
Chapter 10
Figure 10.1 Comparison of a linear system and a nonlinear system when excite...
Figure 10.2 Effect of a static nonlinearity on a single sinusoid, two sinuso...
Figure 10.3 First‐order diode clipper.
Figure 10.4 Combination of linear filtering and nonlinear static mapping.
Figure 10.5 Static characteristic curve of a symmetrical soft‐clipping (left...
Figure 10.6 Output signal for sinusoidal input with soft‐clipping (left) and...
Figure 10.7 Wavefolding using nonlinear static mapping functions.
Figure 10.8 Linear (left) and nonlinear (right) second‐order filter.
Figure 10.9 Hyperbolic tangent mapping function.
Figure 10.10 Frequency response of linear second‐order filter.
Figure 10.11 Waterfall presentation of a linear filtered sinesweep.
Figure 10.12 Waterfall presentation of a nonlinear filtered sinesweep.
Figure 10.13 Spectra of (a) continuous‐time signal
with a marker at the Ny...
Figure 10.14 Operation of a nonlinear system at a sampling frequency increas...
Figure 10.15 Spectra of
for different values of
.
Figure 10.16 Spectra of output obtained with antiderivative antialiasing for...
Figure 10.17 Second‐order diode clipper.
Figure 10.18 Wave digital filter structure of second‐order diode clipper wit...
Chapter 11
Figure 11.1 A minimal illustration of unsupervised and supervised learning m...
Figure 11.2 A feedforward neural network with one hidden layer.
Figure 11.3 Activation functions.
Figure 11.4 A convolutional neural network with one hidden layer.
Figure 11.5 A cascaded structure of peak and shelving filters.
Figure 11.6 Signal flow graph of a LF/HF shelving filter.
Figure 11.7 Signal flow graph of a peak filter.
Figure 11.8 Block diagram of the model.
Figure 11.9 Illustration of backpropagation through the cascaded structure....
Figure 11.10 Block diagram illustrating the initialization method for HRTF m...
Figure 11.11 Magnitude responses: in the top plot, the whole filter cascade;...
Figure 11.12 Feedback delay network.
Figure 11.13 Results, in terms of energy decay curves (EDCs) and the final e...
Figure 11.14 Results in terms of EDCs and the final estimated RIR of a room ...
Figure 11.15 Illustration of the denoising model during training phase (top ...
Figure 11.16 Illustration of the CNN architecture, where C0, C1, and C2 deno...
Figure 11.17 Example of noise suppression in an audio file from the PTDB‐TUG...
Figure 11.18 Example of noise suppression in an audio file from the Mirex‐Su...
Figure 11.19 Spectrogram of the example noisy, denoised, and original audio ...
Guide
Cover Page
Table of Contents
Title Page
Copyright
Preface for the Third Edition
Preface for the Second Edition
Preface for the First Edition
Begin Reading
Index
Wiley End User License Agreement
Pages
iv
viii
ix
x
xi
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
401
402
403
404
405
Add Highlight
No Comment
..................Content has been hidden....................
You can't read the all page of ebook, please click
here
login for view all page.
Day Mode
Cloud Mode
Night Mode
Reset