i
i
i
i
i
i
i
i
678 27. Visualization
Display capacity is a third kind of limitation to consider. Visualization de-
signers often “run out of pixels,” where the resolution of the screen is not large
enough to show all desired information simultaneously. The information density
of a particular frame is a measure of the amount of information encoded versus
the amount of unused space. There is a tradeoff between the benefits of showing
as much as possible at once, to minimize the need for navigation and exploration,
and the costs of showing too much at once, where the user is overwhelmed by
visual clutter.
27.2 Data Types
Many aspects of a visualization design are driven by the type of the data that we
need to look at. For example, is it a table of numbers, or a set of relations between
items, or inherently spatial data such as a location on the Earth’s surface or a
collection of documents?
We start by considering a table of data. We call the rows items of data and the
columns are dimensions, also known as attributes. For example, the rows might
represent people, and the columns might be names, age, height, shirt size, and
favorite fruit.
We distinguish between three types of dimensions: quantitative, ordered, and
categorical. Quantitative data, such as age or height, is numerical and we can
do arithmetic on it. For example, the quantity of 68 inches minus 42 inches is
26 inches. With ordered data, such as shirt size, we cannot do full-fledged arith-
metic, but there is a well-defined ordering. For example, Large minus Medium
is not a meaningful concept, but we know that Medium falls between Small and
Large. Categorical data, such as favorite fruit or names, does not have an implicit
ordering. We can only distinguish whether two things are the same (apples) or
different (apples vs. bananas).
Relational data, or graphs, are another data type where nodes are connected by
links. One specific kind of graph is a tree, which is typically used for hierarchical
data. Both nodes and edges can have associated attributes. The word graph is
unfortunately overloaded in visualization. The node-link graphs we discuss here,
following the terminology of graph drawing and graph theory, could also be called
networks.Inthefield of statistical graphics, graph is often used for chart,asin
the line charts for time-series data shown in Figure 27.10.
Some data is inherently spatial, such as geographic location or a field of mea-
surements at positions in three-dimensional space as in the MRI or CT scans used
by doctors to see the internal structure of a person’s body. The information as-
sociated with each point in space may be an unordered set of scalar quantities,