A
ABC (Atanasoff Berry Computer), L-5
Absolute addressing mode, Intel 80x86, K-47
Accelerated Strategic Computing Initiative (ASCI)
system area network history, F-101
Access 1/Access 2 stages, TI 320C55 DSP, E-7
Access bit
IA-32 descriptor table,
B-52
Access time gap, disk storage, D-3
Acknowledgment, packets, F-16
ACS project, L-28 to L-29
Active low power modes, WSCs,
472
Ada language, integer division/remainder,
J-12
Adaptive routing
vs. deterministic routing, F-52 to F-55,
F-54
network fault tolerance, F-94
and overhead, F-93 to F-94
Adders
carry-lookahead, J-37 to J-41
integer division speedup, J-54 to J-58
integer multiplication speedup
many adders,
J-50, J-50 to J-54
multipass array multiplier,
J-51
signed-digit addition table,
J-54
single adder, J-47 to J-49,
J-48 to J-49
radix-4 SRT division,
J-57
time/space requirements,
J-44
Addition operations
integer, speedup
carry-lookahead, J-37 to J-41
carry-lookahead circuit,
J-38
carry-lookahead tree,
J-40
carry-lookahead tree adder,
J-41
carry-select adder,
J-43, J-43 to J-44,
J-44
carry-skip adder, J-41 to J43,
J-42
ripply-carry addition,
J-3
Address aliasing prediction
ILP for realizable processors,
216
Address Coalescing Unit
Multithreaded SIMD Processor block diagram,
294
Address fault, virtual memory definition,
B-42
Addressing modes
compiler writing-architecture relationship,
A-30
desktop architectures,
K-5
embedded architectures,
K-6
instruction set encoding,
A-21
Intel 80x86, K-47 to K-49,
K-58 to K-59, K-59 to K-60
Intel 80x86 operands, K-59
MIPS data transfers,
A-34
RISC architectures, K-5 to K-6
VAX instruction encoding, K-68 to K-69
Address offset, virtual memory,
B-56
Address space
Multimedia SIMD
vs. GPUs,
312
SMP/DSM shared memory,
348
Address specifier
instruction set encoding,
A-21
VAX instruction encoding, K-68 to K-69
Address stage, TI 320C55 DSP, E-7
Address trace, cache performance,
B-4
Address translation
memory hierarchy basics,
77–78
virtual memory definition,
B-42
virtual memory protection,
106
Administrative costs, WSC
vs. datacenters,
455
Adobe Photoshop, multimedia support, K-17
Advanced directory protocol
Advanced load address table (ALAT)
vector sparse matrices, G-13
Advanced loads, IA-64 ISA, H-40
Advanced mobile phone service (AMPS), cell phones, E-25
Advanced Simulation and Computing (ASC) program, system area network history, F-101
Advanced Switching Interconnect (ASI), storage area network history, F-103
Advanced Switching SAN, F-67
Advanced Vector Extensions (AVX)
double-precision FP programs,
284
vs. vector architectures,
282
After rounding rule, J-36
Aggregate bandwidth
effective bandwidth calculations, F-18 to F-19
interconnection networks, F-89
shared-
vs. switched-media networks,
F-22, F-24 to F-25
switched-media networks, F-24
switch microarchitecture, F-56
Aiken, Howard, L-3 to L-4
Airside econimization, WSC cooling systems,
449
Akamai, as Content Delivery Network,
460
Aliases, address translation,
B-38
Alliant processors, vector processor history, G-26
AltaVista search
cluster history, L-62, L-73
shared-memory workloads,
369,
370
Amazon Elastic Computer Cloud (EC2),
456–457
MapReduce cost calculations,
458–459
price and characteristics,
458
Amazon Simple Storage Service (S3),
456–457
Amazon Web Services (AWS)
WSC cost-performance,
474
Amdahl’s law
computer design principles,
46–48
computer system power consumption case study,
63–64
parallel processing calculations,
349–350
vs. processor performance equation,
51
WSC processor cost-performance,
472–473
AMD Athlon 64, Itanium 2 comparison,
H-43
AMD Barcelona microprocessor, Google WSC server,
467
AMD Opteron
address translation,
B-38
misses per instruction,
B-15
multicore processor performance,
400–401
multilevel exclusion,
B-35
NetApp FAS6000 filer, D-42
vs. Pentium protection,
B-57
real-world server considerations,
52–55
server energy savings,
25
TLB during address translation,
B-47
AMD processors
architecture flaws
vs. success,
A-45
GPU computing history, L-52
shared-memory multiprogramming workload,
378
tournament predictors,
164
Amortization of overhead, sorting case study, D-64 to D-67
Annulling delayed branch, instructions,
K-25
Antenna, radio receiver,
E-23
Antialiasing, address translation,
B-38
Antidependences
compiler history, L-30 to L-31
loop-level parallelism calculations,
320
Apple iPad
memory hierarchy basics,
78
Application binary interface (ABI), control flow instructions,
A-20
Application layer, definition,
F-82
Arbitration algorithm
collision detection, F-23
commercial interconnection networks,
F-56
interconnection networks, F-21 to F-22, F-27, F-49 to F-50
network impact, F-52 to F-55
SAN characteristics,
F-76
switched-media networks, F-24
switch microarchitecture, F-57 to F-58
switch microarchitecture pipelining, F-60
system area network history, F-100
Architecturally visible registers, register renaming
vs. ROB,
208–209
Architectural Support for Compilers and Operating Systems (ASPLOS), L-11
Areal density, disk storage, D-2
Argument pointer, VAX, K-71
Arithmetic/logical instructions
desktop RISCs,
K-11,
K-22
embedded RISCs,
K-15,
K-24
Arithmetic-logical units (ALUs)
basic MIPS pipeline,
C-36
branch condition evaluation,
A-19
DSP media extensions, E-10
effective address cycle,
C-6
hardware-based execution,
185
integer multiplication, J-48
integer shifting over zeros, J-45 to J-46
ISA performance and efficiency prediction,
241
microarchitectural techniques case study,
253
operands per instruction example,
A-6
RISC classic pipeline,
C-7
RISC instruction set,
C-4
ARM (Advanced RISC Machine)
addressing modes, K-5,
K-6
arithmetic/logical instructions,
K-15,
K-24
condition codes, K-12 to K-13
control flow instructions,
14
data transfer instructions,
K-23
embedded instruction format,
K-8
GPU computing history, L-52
multiply-accumulate,
K-20
RISC instruction set lineage,
K-43
unique instructions, K-36 to K-37
ARM Cortex-A8
ISA performance and efficiency prediction,
241–243
memory access penalty,
117
processor comparison,
242
ARM Thumb
arithmetic/logical instructions,
K-24
data transfer instructions,
K-23
embedded instruction format,
K-8
multiply-accumulate,
K-20
unique instructions, K-37 to K-38
ARPA (Advanced Research Project Agency)
LAN history, F-99 to F-100
ARPANET, WAN history, F-97 to F-98
Arrays
bubble sort procedure,
K-76
cluster server outage/anomaly statistics,
435
Layer 3 network linkage,
445
loop-level parallelism dependences,
318–319
ocean application, I-9 to I-10
WSC memory hierarchy,
445
ASCII character format,
12,
A-14
Association of Computing Machinery (ACM), L-3
Asynchronous I/O, storage systems, D-35
Asynchronous Transfer Mode (ATM)
interconnection networks, F-89
total time statistics,
F-90
ATA (Advanced Technology Attachment) disks
Berkeley’s Tertiary Disk project, D-12
historical background, L-81
server energy savings,
25
Atanasoff Berry Computer (ABC), L-5
Atomic instructions
barrier synchronization,
I-14
T1 multithreading unicore performance,
229
Atomicity-consistency-isolation-durability (ACID),
vs. WSC storage,
439
Atomic operations
snooping cache coherence implementation,
365
“Atomic swap,” definition, K-20
Attributes field, IA-32 descriptor table,
B-52
Autoincrement deferred addressing, VAX, K-67
Availability
commercial interconnection networks, F-66
computer architecture,
11,
15
computer systems, D-43 to D-44,
D-44
I/O system design/evaluation, D-36
mainstream computing classes,
open-source software,
457
as server characteristic,
Average instruction execution time, L-6
Average Memory Access Time (AMAT)
centralized shared-memory architectures,
351–352
memory hierarchy basics,
75–76
miss penalty reduction,
B-32
Average reception factor
centralized switched networks, F-32
multi-device interconnection networks, F-26