238 ◾ Roque Perez-Velez
For example, an engineer wants to determine which radiological procedures are requested
most. Table27.4 shows 16 of the 43 most common radiological procedures, sorted in descending
order, and their respective cumulative percentage. e engineers used this large data set to create
the Pareto graph shown in Figure27.5.
Data mining is the process of handling, managing, and analyzing extremely large sets of data.
Due to the complexity of working with huge or extremely large data sets, such data sets must usu-
ally be sampled. Take, for example, a bed control manager wants to better prognosticate how bed
usage is aected by patients that are transferred into his institution. e manager may want to
look at daily transfer reports. It may become cumbersome and dicult to visualize daily transfers
on a graph. Perhaps the manager may sample weekly transfers and combine a box plot and linear
graphs to gain the desired eect.
By estimating the quartiles for weekly transfers and then plotting the box plots in a linear
graph, the manager can better prognosticate bed usage as seen in Figure27.6.
On this graph, each bicolor bar depicts the rst quartile, median, and third quartile for each
week. ese are plotted on a weekly basis in a linear pattern that shows a slight increase.
A word of caution: outliers. ese are noticeably unusual values. A data point is considered
an outlier if it is more than 1.5 times the interquartile range away from the nearest quartile. e
reader must detect outliers, but there are no general rules on how outliers should be handled once
Table27.4 Radiology Procedures
Procedure
Number Procedure Name Total Percentage
Cumulative
%
1060 HEAD W/O CNTRST 750 24.1% 24.1
1005 ABD W/CNTRST 492 15.8% 40.0
1225 PELVIS W/CNTRST 492 15.8% 55.8
2000 ABD 1 VIEW 231 7.4% 63.3
1055 CHEST W/CNTRST 185 6.0% 69.2
1000 ABD W/O CNTRST 114 3.7% 72.9
1045 CHEST W/O CNTRST 114 3.7% 76.6
1410 MULTI-PLANAR REFORMATIONS 107 3.4% 80.0
1056 CHEST W/CNTRST EXT 105 3.4% 83.4
1002 RENAL STONE W/O CNTRST 102 3.3% 86.7
1215 PELVIS W/O CNTRST 102 3.3% 90.0
1080 HEAD W/&W/O CNTRST 42 1.4% 91.3
1085 MXFACE 1 PJ W/O CNTRST 39 1.3% 92.6
1025 BIOPSY 30-60 MINUTES 30 1.0% 93.5
1015 ABD W&W/O CNTRST 24 0.8% 94.3
1255 C SPINE W/O CNTRST 21 0.7% 95.0