HP Cluster Platform Interconnects v2010 Quadrics QsNetII Interconnect - Page 111

Using qselantest to Verify the QM500 Network Adapter

Page 111 highlights

12.2.4 Verifying the View of the Network To ensure that there is a correct and consistent view of the interconnect network, you must verify the following: • That the QM500 PCI adapters in each node can see the correct network size. • Each node has the correct network position. To verify the network, look in the position file and obtain the following data values: • NodeId - The node identifier, in the format nNr0, where: n = Node N - An unique integer representing the node's position in the network. This number is in the range 0 - NumNodes r0 - Rail 0 (HP Cluster Platform uses only a single-rail configuration). • NumLevels - An integer representing the number of levels in the interconnect hierarchy. • NumNodes - An integer representing the total number of nodes in the cluster. Use the following procedure to examine the position file: 1. Log in to any application node in the cluster and enter the following command: # cat /proc/qsnet/elan4/device0/position 2. The following information is displayed: NodeId 0 NumLevels 2 NumNodes 16 This data indicates that the node's position is 0 (the first application node in the network) and it can see a 2-level network that has a total of 16 nodes forming the cluster. You can repeat this step for each node in the cluster, as necessary. 3. Log in to the administrative node and enter the following command: # /cluster/bin/crsh -r all "nodename; cat /proc/qsnet/elan4/device0/position" 4. The following sample output is from a cluster connected to a 64 way interconnect. The output is truncated for ease of reference: admin-0 NodeId 0 NumLevels 2 NodeId 1 NumLevels 2 NodeId 2 NumLevels 2 NumNodes 16 NumNodes 16 NumNodes 16 n1r0 n2r0 The output shows the name of the administrative node, admin-0, and displays the position data for each application node enabling you to verify that the entire network is visible. 12.3 Using qselantest to Verify the QM500 Network Adapter The qselantest command tests for the presence of a QM500 PCI adapter in the node. Use It to verify the function of a QM500 network adapter in a specific node and its ability to access the interconnect network. The following characteristics of the QM500 PCI adapter are checked: • Clock speed. • Thread processor. • SDRAM memory on the QM500 . Maintenance and Diagnostic Procedures 12-9

  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • 7
  • 8
  • 9
  • 10
  • 11
  • 12
  • 13
  • 14
  • 15
  • 16
  • 17
  • 18
  • 19
  • 20
  • 21
  • 22
  • 23
  • 24
  • 25
  • 26
  • 27
  • 28
  • 29
  • 30
  • 31
  • 32
  • 33
  • 34
  • 35
  • 36
  • 37
  • 38
  • 39
  • 40
  • 41
  • 42
  • 43
  • 44
  • 45
  • 46
  • 47
  • 48
  • 49
  • 50
  • 51
  • 52
  • 53
  • 54
  • 55
  • 56
  • 57
  • 58
  • 59
  • 60
  • 61
  • 62
  • 63
  • 64
  • 65
  • 66
  • 67
  • 68
  • 69
  • 70
  • 71
  • 72
  • 73
  • 74
  • 75
  • 76
  • 77
  • 78
  • 79
  • 80
  • 81
  • 82
  • 83
  • 84
  • 85
  • 86
  • 87
  • 88
  • 89
  • 90
  • 91
  • 92
  • 93
  • 94
  • 95
  • 96
  • 97
  • 98
  • 99
  • 100
  • 101
  • 102
  • 103
  • 104
  • 105
  • 106
  • 107
  • 108
  • 109
  • 110
  • 111
  • 112
  • 113
  • 114
  • 115
  • 116
  • 117
  • 118
  • 119
  • 120
  • 121
  • 122
  • 123
  • 124
  • 125
  • 126
  • 127
  • 128
  • 129
  • 130
  • 131
  • 132
  • 133
  • 134
  • 135
  • 136
  • 137
  • 138
  • 139
  • 140
  • 141
  • 142
  • 143
  • 144
  • 145
  • 146
  • 147
  • 148
  • 149
  • 150
  • 151
  • 152
  • 153
  • 154
  • 155
  • 156
  • 157
  • 158
  • 159
  • 160
  • 161
  • 162
  • 163
  • 164
  • 165
  • 166

12.2.4 Verifying the View of the Network
To ensure that there is a correct and consistent view of the interconnect network,
you must verify the following:
That the QM500 PCI adapters in each node can see the correct network size.
Each node has the correct network position.
To verify the network, look in the
position
file and obtain the following data
values:
NodeId – The node identifier, in the format
n
N
r0
, where:
n = Node
N
- An unique integer representing the node’s position in the network. This
number is in the range 0 -
NumNodes
r0
- Rail 0 (HP Cluster Platform uses only a single-rail configuration).
NumLevels – An integer representing the number of levels in the interconnect
hierarchy.
NumNodes – An integer representing the total number of nodes in the cluster.
Use the following procedure to examine the
position
file:
1.
Log in to any application node in the cluster and enter the following command:
#
cat /proc/qsnet/elan4/device0/position
2.
The following information is displayed:
NodeId 0
NumLevels 2
NumNodes 16
This data indicates that the node’s position is 0 (the first application node in
the network) and it can see a 2-level network that has a total of 16 nodes
forming the cluster. You can repeat this step foreach node in the cluster,
as necessary.
3.
Log in to the administrative node and enter the following command:
#
/cluster/bin/crsh -r all "nodename;
cat /proc/qsnet/elan4/device0/position"
4.
The following sample output is from a cluster connected to a 64 way
interconnect. The output is truncated for ease of reference:
admin-0
NodeId 0 NumLevels 2 NumNodes 16 n1r0
NodeId 1 NumLevels 2 NumNodes 16 n2r0
NodeId 2 NumLevels 2 NumNodes 16
<truncated>
The output shows the name of the administrative node,
admin-0
, and displays
the position data for each application node enabling you to verify that the
entire network is visible.
12.3 Using qselantest to Verify the QM500 Network Adapter
The
qselantest
command tests for the presence of a QM500 PCI adapter in the
node. Use It to verify the function of a QM500 network adapter in a specific node
and its ability to access the interconnect network. The following characteristics of
the QM500 PCI adapter are checked:
Clock speed.
Thread processor.
SDRAM memory on the QM500 .
Maintenance and Diagnostic Procedures
12-9