Discover
|
Analyze
|
Exploit
|
New applications
|
•Determine data storage requirements, including user data size and compression ratio.
•Determine high availability requirements.
•Determine customer corporate networking requirements, such as networking infrastructure and IP addressing.
•Determine whether data node OS disks require mirroring.
•Determine disaster recovery requirements, including backup/recovery and multisite disaster recover requirements.
•Determine cooling requirements, such as airflow and BTU requirements.
•Determine workload characteristics, such as MapReduce or HBase.
•Identify cluster management strategy, such as node firmware and OS updates.
•Identify a cluster rollout strategy, such as node hardware and software deployment.
|
•Propose InfoSphere BigInsights cluster as the solution to big data problems.
•Use the IBM System x M4 architecture for easy scalability of storage and memory.
|
Existing applications
|
•Determine data storage requirements and existing shortfalls.
•Determine memory requirements and existing shortfalls.
•Determine throughput requirements and existing bottlenecks.
•Identify system utilization inefficiencies.
|
•Propose a nondisruptive and lower risk solution.
•Propose a Proof-of-Concept (PoC) for the next server deployment.
•Propose a InfoSphere BigInsights cluster as a solution to big data problems.
•Use System x M4 architecture for easy scalability of storage and memory.
|
Data center health
|
•Determine server sprawl.
•Determine electrical, cooling, space headroom.
•Identify inefficiency concerns.
|
•Propose a scalable InfoSphere BigInsights cluster.
•Propose lowering data center costs with energy efficient System x servers.
|
Component
|
Predefined configuration
|
System
|
System x3550 M4
|
Processor
|
2 x E5-2650 v2 2.6 GHz 8-core
|
Memory - base
|
128 GB = 8 x 16GB 1866 MHz RDIMM
|
Disk (OS)
|
4 x 600 GB 2.5-inch SAS
|
HDD controller
|
ServeRAID M1115 SAS/SATA Controller
|
Hardware storage protection
|
•OS storage on 2 x 600 GB drives that are mirrored by using RAID 1 hardware mirroring.
•Application storage on 2 x 600 GB drives in JBOD or RAID 1 hardware mirroring configuration.
|
User space (per server)
|
None
|
Administration/management network adapter
|
Integrated 1GBaseT Adapter
|
Data network adapter
|
2 x Mellanox ConnectX-3 EN Dual-port SFP+ 10GbE Adapters
|
Environment
|
Required management nodes
|
Node 1
|
Node 2
|
Node 3
|
Node 4
|
Development Environment
|
1
|
NameNode1, JobTracker, BigInsights Console
|
N/A
|
N/A
|
N/A
|
Production/Test Environment
|
32
|
NameNode
|
JobTracker, Secondary NameNode
|
BigInsights Console
|
N/A
|
Production / Test Environment with Highly Available NameNode
|
4b
|
NameNode (Active or Standby)
|
NameNode (Active or Standby)
|
JobTracker
|
BigInsights Console
|
Component
|
Predefined configuration
|
System
|
System x3650 BD
|
Processor
|
2 x E5-2650 v2 2.6 GHz 8-core
|
Memory - base
|
64 GB = 8x 8 GB 1866 MHz RDIMM
|
Disk (OS)1
|
3 TB drives: 1 or 2 x 3 TB NL SATA 3.5-inch
4 TB drives: 1 or 2 x 4 TB NL SATA 3.5-inch
|
Disk (data)2
|
3 TB drives: 12 x 3 TB NL SATA 3.5-inch (36 TB total)
4 TB drives: 12 x 4 TB NL SATA 3.5-inch (48 TB total)
|
HDD controller
|
6 Gb JBOD Controller
|
Hardware storage protection
|
None (JBOD). By default, HDFS maintains a total of three copies of data that is stored within the cluster. The copies are distributed across data servers and racks for fault recovery.
|
Management network adapter
|
Integrated 1GBaseT Adapter
|
Data network adapter
|
Mellanox ConnectX-3 EN Dual-port SFP+ 10GbE Adapter
|
Rack switch
|
Predefined configuration
|
1GbE top of rack switch for administration/management network
(two physical links to each node: one link for in-band OS administration and one link for out-of-band IMM2 hardware management).1
|
IBM System Networking RackSwitch G8052
|
10GbE top of rack switch for data network (two physical 10GbE links to each node, aggregated).
|
IBM System Networking RackSwitch G8264
|
40GbE switch for interconnecting data network across multiple racks (40GbE links interconnecting each G8264 top of rack switches; link aggregation depends on the number of core switches and interconnect topology).2
|
IBM System Networking RackSwitch G8316
|
Important: The number of edge nodes and the edge node server physical attributes that are required depend on ingest volume and velocity. Because of physical space constraints within a rack, adding an edge node to a rack can displace a data node.
|
Rack configuration size
|
Number of data nodes1
|
Number of management nodes2
|
Starter rack
|
33
|
1, 3, or 4
|
Half rack
|
9
|
1, 3, or 4
|
Full rack with management nodes
|
184
|
1, 3, or 4
|
Full data node rack, No management nodes
|
20
|
0
|
Edge nodes: This calculation does not consider edge nodes. Based on the client’s choice of edge node, proportions can vary. Every two 1U edge nodes displace one data node, and every one 2U displaces one data node.
|
Component
|
Predefined configuration
|
System
|
x3550 M4
|
Processor
|
2 x E5-2650 v2 2.6 GHz 8 core
|
Memory - base
|
128 GB = 8 x 16 GB 1866 MHz RDIMM
|
Disk (OS)
|
4 x 600 GB 2.5-inch SAS
|
HDD controller
|
ServeRAID M1115 SAS/SATA Controller
|
Hardware storage protection
|
•OS storage on 2 x 600 GB drives that are mirrored by using hardware mirroring
•Application storage on 2 x 600 GB drives in JBOD configuration
|
User space (per server)
|
None
|
Administration/management network adapter
|
Integrated 1GBaseT Adapter
|
Data network adapter
|
2 x Mellanox ConnectX-3 EN Dual-port SFP+ 10 GbE Adapter
|
Cluster size
|
Required management nodes
|
Node 1
|
Node 2
|
Node 3
|
Node 4
|
Node 5
|
Node 6
|
Starter cluster
|
1
|
NameNode1, JobTracker,
HMaster, BigInsights Console
|
|
|
|
|
|
<20 data nodes
|
42
|
NameNode,
Zookeeper
|
JobTracker, HMaster, Zookeeper
|
Secondary NameNode, HMaster, Zookeeper
|
BigInsights Console
|
|
|
>= 20 data nodes
|
63
|
NameNode, Zookeeper
|
Secondary NameNode4, Zookeeper
|
JobTracker, Zookeeper
|
HMaster, Zookeeper
|
HMaster, Zookeeper
|
BigInsights Console
|
Component
|
Pre-defined configuration
|
System
|
x3650 BD
|
Processor
|
2 x E5-2650 v2 2.6 GHz 8 core
|
Memory - base
|
128 GB =16 x 8 GB 1866 MHz RDIMM
|
Disk (OS)1
|
1 TB drives: 1 or 2 x 1 TB NL SATA 3.5-inch
2 TB drives: 1 or 2 x 2 TB NL SATA 3.5-inch
|
1 TB drives: 12 x 1 TB NL SATA 3.5-inch (12 TB total)
2 TB drives: 12 x 2 TB NL SATA 3.5-inch (24 TB total)
|
|
HDD controller
|
6 Gb JBOD Controller
|
Hardware storage protection
|
None (JBOD). By default, HDFS maintains a total of three copies of data that is stored within the cluster. The copies are distributed across data servers and racks for fault recovery.
|
Administration/management network adapter
|
Integrated 1GBaseT Adapter
|
Data network adapter
|
Mellanox ConnectX-3 EN Dual-port SFP+ 10GbE
|
Important: If the system is initially implemented as a multirack solution or if the system grows by adding more racks, distribute the cluster management nodes across the racks to maximize fault tolerance.
|
Cluster size
|
Number of racks
|
Maximum number of data nodes per rack1
|
Number of management nodes per cluster
|
Starter rack
|
1
|
32
|
1
|
<20 data nodes
|
1 - 2
|
18
|
4
|
>= 20 data nodes
|
2+
|
18 - 203
|
6
|
Edge nodes: This calculation does not consider edge nodes. Based on the client’s choice of edge node, proportions can vary. Every two 1U edge nodes displace one data node, and every one 2U edge node displaces one data node.
|
|
Value options
|
Enterprise options
|
Performance options
|
Processor
|
2 x E5-2630 v2 2.2 GHz 6-core
|
2 x E5-2650 v2 2.6 GHz 8-core
|
2 x E5-2680 v2 2.8 GHz 10-core
|
Memory - base
|
48 GB = 6 x 8 GB
|
64 GB = 8 x 8 GB 1866 MHz
|
128 GB = 16 x 8 GB 1866 MHz1
|
Disk (data and OS)
|
MapReduce: 13 or 14 x 3 TB NL SATA 3.5-inch
HBase: 13 or 14 x 1 TB NL SATA 3.5-inch
|
MapReduce: 13 or 14 x 3 TB or 4 TB NL SATA 3.5-inch
HBase: 13 or 14 x 1 TB or 2 TB NL SATA 3.5-inch
|
MapReduce: 13 or 14 x 3 TB or 4 TB NL SATA 3.5-inch
HBase: 13 or 14 x 1 TB or 2 TB NL SATA 3.5-inch
|
HDD controller
|
6 Gb JBOD Controller
|
ServeRAID M1115 SAS/SATA Controller
|
6 Gb JBOD Controller
|
Hardware storage protection
|
None (JBOD)
|
RAID 5 11+P
RAID 6 10+P+Q (business critical)
|
None (JBOD)
|
Data network
|
1GbE switch with 4 x 10GbE uplinks (IBM G8052)
|
Redundant 10GbE switches with 4 x 40GbE Uplinks per switch (IBM G8264)
|
10GbE switch with 4 x 40GbE uplinks (IBM G8264)
|
|
Recommended Configuration
|
Server
|
System x3250
|
Processor
|
E3-1220 3.1GHz 4-core
|
Memory - base
|
16GB 1333 MHz
|
Disk
|
2 x 1 TB NL SATA
|
HDD controller
|
ServeRAID C100 for System x
|
Storage protection
|
Hardware mirroring (RAID1)
|
Network (Admin & IMM Networks)
|
Integrated 1Gb Ethernet ports1
|
Part number
|
Description
|
Quantity
|
5466
|
IBM System x3650 M4 BD
|
18
|
A4T7
|
PCIe Riser Card 2 (1 x8 LP for Slotless RAID)
|
18
|
A4T6
|
PCIe Riser Card for slot 1 (1 x8 FH/HL + 1 x8 LP Slots)
|
18
|
A3PM
|
Mellanox ConnectX-3 EN Dual-port SFP+ 10GbE Adapter
|
18
|
5977
|
Select Storage devices; RAID configured by IBM is not required
|
18
|
A22S
|
IBM 3TB 7.2K 6 Gbps NL SATA 3.5-inch G2HS HDD
|
252
|
A4RV
|
IBM System x 750W High Efficiency Platinum AC Power Supply
|
18
|
A4WC
|
System Documentation and Software, US English
|
18
|
A4S4
|
Intel Xeon Processor E5-2650 v2 8C 2.6 GHz 20 MB Cache 1866 MHz 95W
|
18
|
A4AS
|
Additional Intel Xeon Processor E5-2650 v2 8C 2.6 GHz 20 MB Cache 1866 MHz 95W
|
18
|
A3QG
|
8 GB (1x8 GB, 1Rx4, 1.5V) PC3-14900 CL13 ECC DDR3 1866 MHz LP RDIMM
|
108
|
A3YY
|
N2215 SAS/SATA HBA for IBM System x
|
18
|
A4RQ
|
System x3650 M4 BD Planar
|
18
|
A4RG
|
System x3650 M4 BD Chassis ASM without Planar
|
18
|
6311
|
2.8m, 10A/100-250V, C13 to IEC 320-C14 Rack Power Cable
|
18
|
A4RR
|
3.5-inch Hot Swap BP Bracket Assembly, 12x 3.5
|
18
|
A4RS
|
3.5-inch Hot Swap Cage Assembly, Rear, 2 x 3.5
|
18
|
2306
|
Rack Installation >1U Component
|
18
|
A4RH
|
BIOS GBM
|
18
|
A4RJ
|
L1 COPT, 1U RIASER CAGE - SLOT 2
|
18
|
A4RK
|
L1 COPT, 1U BUTTERFLY RIASER CAGE - SLOT 1
|
18
|
A4RN
|
x3650 M4 BD Agency Label
|
18
|
A4RP
|
Label GBM
|
18
|
A50F
|
2x2 HDD BRACKET
|
18
|
A207
|
Rail Kit for x3650 M4 BD, x3630 M4, and x3530 M4
|
18
|
A2M3
|
Shipping Bracket for x3650 M4 BD and 3630 M4
|
18
|
Part number
|
Description
|
Quantity
|
7914FT1
|
System x3550 M4
|
3
|
A1H3 ***
|
System x3550 M4 2.5-inch Base Without Power Supply
|
3
|
5977
|
Select Storage devices; RAID configured by IBM is not required
|
3
|
A1MZ
|
ServeRAID M1115 SAS/SATA Controller for System x
|
3
|
A2XD
|
IBM 600 GB 10K 6 Gbps SAS 2.5-inch SFF G2HS HDD
|
12
|
A228
|
IBM System x Gen-III Slides Kit
|
3
|
A229
|
IBM System x Gen-III CMA
|
3
|
A1HG
|
System x3550 M4 4x 2.5-inch HDD Assembly Kit
|
3
|
A1ML
|
IBM Integrated Management Module Advanced Upgrade
|
3
|
A1H5
|
System x 750W High Efficiency Platinum AC Power Supply
|
3
|
A1HL
|
System x3550 M4 PCIe Gen-III Riser Card 2 (1 x16 FH/HL Slot)
|
3
|
A2ZQ
|
Mellanox ConnectX-3 EN Dual-port SFP+ 10GbE Adapter
|
6
|
A1HJ
|
System x3550 M4 PCIe Riser Card 1 (1 x16 LP Slot)
|
3
|
A1HP ***
|
System Documentation and Software, US English
|
3
|
A3QL
|
16 GB (1 x16 GB, 2Rx4, 1.5V) PC3-14900 CL13 ECC DDR3 1866 MHz LP RDIMM
|
24
|
A1H5
|
System x 750W High Efficiency Platinum AC Power Supply
|
3
|
6263
|
4.3m, 10A/100-250V, C13 to IEC 320-C14 Rack Power Cable
|
6
|
A2U6
|
IBM System x Advanced Lightpath Kit
|
3
|
A3WR
|
Intel Xeon Processor E5-2650 v2 8C 2.6 GHz 20 MB Cache 1866 MHz 95W
|
3
|
A3X9
|
Additional Intel Xeon Processor E5-2650 v2 8C 2.6 GHz 20 MB Cache 1866 MHz 95W with Fan
|
3
|
2305
|
Rack Installation of 1U Component
|
3
|
A3XM
|
System x3550 M4 Planar
|
3
|
A1HB
|
System x3550 M4 System Level Code
|
3
|
A1HD
|
System x3550 M4 Agency Label GBM
|
3
|
Part number
|
Description
|
Quantity
|
7309HC1
|
IBM System Networking RackSwitch G8052 (Rear to Front)
|
1
|
6311
|
2.8m, 10A/100-250V, C13 to IEC 320-C14 Rack Power Cable
|
2
|
A1DK
|
IBM 19-inch Flexible 4 Post Rail Kit
|
1
|
2305
|
Rack Installation of 1U Component
|
1
|
Part number
|
Description
|
Quantity
|
7309HC3
|
IBM System Networking RackSwitch G8264 (Rear to Front)
|
1
|
6311
|
2.8m, 10A/100-250V, C13 to IEC 320-C14 Rack Power Cable
|
2
|
A1DK
|
IBM 19-inch Flexible 4 Post Rail Kit
|
1
|
2305
|
Rack Installation of 1U Component
|
1
|
Part number
|
Description
|
Quantity
|
1410RC4
|
e1350 42U rack cabinet
|
1
|
6012
|
DPI Single-phase 30A/208V C13 Enterprise PDU (US)
|
4
|
2202
|
Cluster 1350 Ship Group
|
1
|
2304
|
Rack Assembly - 42U Rack
|
1
|
2310
|
Cluster Hardware & Fabric Verification - 1st Rack
|
1
|
4271
|
1U black plastic filler panel
|
1
|
Part number
|
Description
|
Quantity
|
3735
|
0.5m Molex Direct Attach Copper SFP+ Cable
|
2
|
3736
|
1m Molex Direct Attach Copper SFP+ Cable
|
4
|
3737
|
3m Molex Direct Attach Copper SFP+ Cable
|
36
|
2323
|
IntraRack CAT5E Cable Service
|
42
|
Part number
|
Description
|
Quantity
|
5466
|
IBM System x3650 M4 BD
|
18
|
A4T7
|
PCIe Riser Card 2 (1 x8 LP for Slotless RAID)
|
18
|
A4T6
|
PCIe Riser Card for slot 1 (1 x8 FH/HL + 1 x8 LP Slots)
|
18
|
A3PM
|
Mellanox ConnectX-3 EN Dual-port SFP+ 10GbE Adapter
|
18
|
5977
|
Select Storage devices; RAID configured by IBM is not required
|
18
|
A22T
|
IBM 2TB 7.2K 6 Gbps NL SATA 3.5-inch G2HS HDD
|
252
|
A4RV
|
IBM System x 750W High Efficiency Platinum AC Power Supply
|
18
|
A4WC
|
System Documentation and Software, US English
|
18
|
A4S4
|
Intel Xeon Processor E5-2650 v2 8C 2.6 GHz 20 MB Cache 1866 MHz 95W
|
18
|
A4AS
|
Additional Intel Xeon Processor E5-2650 v2 8C 2.6 GHz 20 MB Cache 1866 MHz 95W
|
18
|
A3QG
|
8 GB (1x8 GB, 1Rx4, 1.5V) PC3-14900 CL13 ECC DDR3 1866 MHz LP RDIMM
|
108
|
A3YY
|
N2215 SAS/SATA HBA for IBM System x
|
18
|
A4RQ
|
System x3650 M4 BD Planar
|
18
|
A4RG
|
System x3650 M4 BD Chassis ASM without Planar
|
18
|
6311
|
2.8m, 10A/100-250V, C13 to IEC 320-C14 Rack Power Cable
|
18
|
A4RR
|
3.5-inch Hot Swap BP Bracket Assembly, 12x 3.5
|
18
|
A4RS
|
3.5-inch Hot Swap Cage Assembly, Rear, 2 x 3.5
|
18
|
2306
|
Rack Installation >1U Component
|
18
|
A4RH
|
BIOS GBM
|
18
|
A4RJ
|
L1 COPT, 1U RIASER CAGE - SLOT 2
|
18
|
A4RK
|
L1 COPT, 1U BUTTERFLY RIASER CAGE - SLOT 1
|
18
|
A4RN
|
x3650 M4 BD Agency Label
|
18
|
A4RP
|
Label GBM
|
18
|
A50F
|
2x2 HDD BRACKET
|
18
|
A207
|
Rail Kit for x3650 M4 BD, x3630 M4, and x3530 M4
|
18
|
A2M3
|
Shipping Bracket for x3650 M4 BD and 3630 M4
|
18
|
Part number
|
Description
|
Quantity
|
7914FT1
|
System x3550 M4
|
4
|
A1H3 ***
|
System x3550 M4 2.5-inch Base Without Power Supply
|
4
|
5977
|
Select Storage devices; RAID configured by IBM is not required
|
4
|
A1MZ
|
ServeRAID M1115 SAS/SATA Controller for System x
|
4
|
A2XD
|
IBM 600 GB 10K 6 Gbps SAS 2.5-inch SFF G2HS HDD
|
16
|
A228
|
IBM System x Gen-III Slides Kit
|
4
|
A229
|
IBM System x Gen-III CMA
|
4
|
A1HG
|
System x3550 M4 4x 2.5-inch HDD Assembly Kit
|
4
|
A1ML
|
IBM Integrated Management Module Advanced Upgrade
|
4
|
A1H5
|
System x 750W High Efficiency Platinum AC Power Supply
|
4
|
A1HL
|
System x3550 M4 PCIe Gen-III Riser Card 2 (1 x16 FH/HL Slot)
|
4
|
A2ZQ
|
Mellanox ConnectX-3 EN Dual-port SFP+ 10GbE Adapter
|
8
|
A1HJ
|
System x3550 M4 PCIe Riser Card 1 (1 x16 LP Slot)
|
4
|
A1HP ***
|
System Documentation and Software, US English
|
4
|
A3QL
|
16 GB (1 x16 GB, 2Rx4, 1.5V) PC3-14900 CL13 ECC DDR3 1866 MHz LP RDIMM
|
32
|
A1H5
|
System x 750W High Efficiency Platinum AC Power Supply
|
4
|
6263
|
4.3m, 10A/100-250V, C13 to IEC 320-C14 Rack Power Cable
|
8
|
A2U6
|
IBM System x Advanced Lightpath Kit
|
4
|
A3WR
|
Intel Xeon Processor E5-2650 v2 8C 2.6 GHz 20 MB Cache 1866 MHz 95W
|
4
|
A3X9
|
Additional Intel Xeon Processor E5-2650 v2 8C 2.6 GHz 20 MB Cache 1866 MHz 95W with Fan
|
4
|
2305
|
Rack Installation of 1U Component
|
4
|
A3XM
|
System x3550 M4 Planar
|
4
|
A1HB
|
System x3550 M4 System Level Code
|
4
|
A1HD
|
System x3550 M4 Agency Label GBM
|
4
|
Part number
|
Description
|
Quantity
|
7309HC1
|
IBM System Networking RackSwitch G8052 (Rear to Front)
|
1
|
6311
|
2.8m, 10A/100-250V, C13 to IEC 320-C14 Rack Power Cable
|
2
|
A1DK
|
IBM 19-inch Flexible 4 Post Rail Kit
|
1
|
2305
|
Rack Installation of 1U Component
|
1
|
Part number
|
Description
|
Quantity
|
7309HC3
|
IBM System Networking RackSwitch G8264 (Rear to Front)
|
1
|
6311
|
2.8m, 10A/100-250V, C13 to IEC 320-C14 Rack Power Cable
|
2
|
A1DK
|
IBM 19-inch Flexible 4 Post Rail Kit
|
1
|
2305
|
Rack Installation of 1U Component
|
1
|
Part number
|
Description
|
Quantity
|
1410RC4
|
e1350 42U rack cabinet
|
1
|
6012
|
DPI Single-phase 30A/208V C13 Enterprise PDU (US)
|
4
|
2202
|
Cluster 1350 Ship Group
|
1
|
2304
|
Rack Assembly - 42U Rack
|
1
|
2310
|
Cluster Hardware and Fabric Verification - 1st Rack
|
1
|
Part number
|
Description
|
Quantity
|
3735
|
0.5m Molex Direct Attach Copper SFP+ Cable
|
4
|
3736
|
1m Molex Direct Attach Copper SFP+ Cable
|
4
|
3737
|
3m Molex Direct Attach Copper SFP+ Cable
|
36
|
2323
|
IntraRack CAT5E Cable Service
|
44
|