IBM E027SLL-H Troubleshooting Guide - Page 180

HUB Tivoli Enterprise Monitoring Server quiesce prevents

Page 180 highlights

Crash on Linux remote monitoring server during hub failover to Hot Standby The Tivoli Enterprise Monitoring Server can use a large number of file descriptors, especially in a large environment. On UNIX and Linux systems, the maximum number of file descriptors available to a process is controlled by user limit parameters. To display the user limits, run the following command: ulimit -a The "nofiles" parameter is the number of file descriptors available to a process. For the monitoring server process (kdsmain), the "nofiles" parameter should be set larger than the maximum number of agents that will be connecting to the monitoring server. If the monitoring server is unable to get file descriptors when needed, unexpected behavior can occur, including program failures. Consider increasing the value to 1000 file descriptors or more. There are other user limit parameters that control how much data, stack and memory are available to a process. For large environments, consider increasing these memory-related user limit parameters for the monitoring server (kdsmain) process. Configuring the user limit parameters usually requires root access, and involves changing system startup files which are operating system specific. Consult the operating system manuals for information on how to configure the user limit parameters. HUB Tivoli Enterprise Monitoring Server quiesce prevents the display of the data collected by the attached Tivoli Enterprise Monitoring Agents A HUB Tivoli Enterprise Monitoring Server has been running. A shutdown of the Tivoli Enterprise Monitoring Server and Tivoli Enterprise Monitoring Agents on the remote systems is in process, but the shutdown takes awhile due to abends in the remote Tivoli Enterprise Monitoring Server. About 8 or 9 minutes go by before the HUB Tivoli Enterprise Monitoring Server quiesces. There are a lot of remote request communication messages in the HUB's RKLVLOG prior to the QUIESCE, but no other signs of errors until after the abend. You cannot restart the remote environments following the quiesce, until after the HUB environment is recycled. The value of the MINIMUM parameter within the KDSSYSIN member of the RKANPARU library might need to be increased if the STGDEBUG(X) or STGDEBUG(Y) parameter is also supplied within KDSSYSIN. If the address space controlled by this KDSSYSIN member enters a "storage quiesce" state (indicated by a KLVxxxxx message stating that there is a storage shortage or quiesce in effect), you should increase the value of the MINIMUM parameter and restart the address space. During installation of a remote Tivoli Enterprise Monitoring Server on a Windows system, the agent support is applied, but fails In a multiple-level Tivoli Enterprise Monitoring Server environment, the remote Tivoli Enterprise Monitoring Servers obtain their agent support from the hub Tivoli Enterprise Monitoring Server. In order to apply agent support to a remote Tivoli Enterprise Monitoring Server, the hub Tivoli Enterprise Monitoring Server must be running and reachable by the remote Tivoli Enterprise Monitoring Server. 162 IBM Tivoli Monitoring: Troubleshooting Guide

  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • 7
  • 8
  • 9
  • 10
  • 11
  • 12
  • 13
  • 14
  • 15
  • 16
  • 17
  • 18
  • 19
  • 20
  • 21
  • 22
  • 23
  • 24
  • 25
  • 26
  • 27
  • 28
  • 29
  • 30
  • 31
  • 32
  • 33
  • 34
  • 35
  • 36
  • 37
  • 38
  • 39
  • 40
  • 41
  • 42
  • 43
  • 44
  • 45
  • 46
  • 47
  • 48
  • 49
  • 50
  • 51
  • 52
  • 53
  • 54
  • 55
  • 56
  • 57
  • 58
  • 59
  • 60
  • 61
  • 62
  • 63
  • 64
  • 65
  • 66
  • 67
  • 68
  • 69
  • 70
  • 71
  • 72
  • 73
  • 74
  • 75
  • 76
  • 77
  • 78
  • 79
  • 80
  • 81
  • 82
  • 83
  • 84
  • 85
  • 86
  • 87
  • 88
  • 89
  • 90
  • 91
  • 92
  • 93
  • 94
  • 95
  • 96
  • 97
  • 98
  • 99
  • 100
  • 101
  • 102
  • 103
  • 104
  • 105
  • 106
  • 107
  • 108
  • 109
  • 110
  • 111
  • 112
  • 113
  • 114
  • 115
  • 116
  • 117
  • 118
  • 119
  • 120
  • 121
  • 122
  • 123
  • 124
  • 125
  • 126
  • 127
  • 128
  • 129
  • 130
  • 131
  • 132
  • 133
  • 134
  • 135
  • 136
  • 137
  • 138
  • 139
  • 140
  • 141
  • 142
  • 143
  • 144
  • 145
  • 146
  • 147
  • 148
  • 149
  • 150
  • 151
  • 152
  • 153
  • 154
  • 155
  • 156
  • 157
  • 158
  • 159
  • 160
  • 161
  • 162
  • 163
  • 164
  • 165
  • 166
  • 167
  • 168
  • 169
  • 170
  • 171
  • 172
  • 173
  • 174
  • 175
  • 176
  • 177
  • 178
  • 179
  • 180
  • 181
  • 182
  • 183
  • 184
  • 185
  • 186
  • 187
  • 188
  • 189
  • 190
  • 191
  • 192
  • 193
  • 194
  • 195
  • 196
  • 197
  • 198
  • 199
  • 200
  • 201
  • 202
  • 203
  • 204
  • 205
  • 206
  • 207
  • 208
  • 209
  • 210
  • 211
  • 212
  • 213
  • 214
  • 215
  • 216
  • 217
  • 218
  • 219
  • 220
  • 221
  • 222
  • 223
  • 224
  • 225
  • 226
  • 227
  • 228
  • 229
  • 230
  • 231
  • 232
  • 233
  • 234
  • 235
  • 236
  • 237
  • 238
  • 239
  • 240
  • 241
  • 242
  • 243
  • 244
  • 245
  • 246
  • 247
  • 248
  • 249
  • 250
  • 251
  • 252
  • 253
  • 254
  • 255
  • 256
  • 257
  • 258
  • 259
  • 260
  • 261
  • 262
  • 263
  • 264
  • 265
  • 266
  • 267
  • 268
  • 269
  • 270
  • 271
  • 272
  • 273
  • 274
  • 275
  • 276
  • 277
  • 278
  • 279
  • 280
  • 281
  • 282
  • 283
  • 284
  • 285
  • 286
  • 287
  • 288
  • 289
  • 290
  • 291
  • 292
  • 293
  • 294
  • 295
  • 296
  • 297
  • 298
  • 299
  • 300
  • 301
  • 302
  • 303
  • 304
  • 305
  • 306
  • 307
  • 308
  • 309
  • 310

Crash on Linux remote monitoring server during hub failover
to Hot Standby
The Tivoli Enterprise Monitoring Server can use a large number of file descriptors,
especially in a large environment. On UNIX and Linux systems, the maximum
number of file descriptors available to a process is controlled by user limit
parameters. To display the user limits, run the following command:
ulimit -a
The "nofiles" parameter is the number of file descriptors available to a process. For
the monitoring server process (kdsmain), the "nofiles" parameter should be set
larger than the maximum number of agents that will be connecting to the
monitoring server. If the monitoring server is unable to get file descriptors when
needed, unexpected behavior can occur, including program failures. Consider
increasing the value to 1000 file descriptors or more.
There are other user limit parameters that control how much data, stack and
memory are available to a process. For large environments, consider increasing
these memory-related user limit parameters for the monitoring server (kdsmain)
process.
Configuring the user limit parameters usually requires root access, and involves
changing system startup files which are operating system specific. Consult the
operating system manuals for information on how to configure the user limit
parameters.
HUB Tivoli Enterprise Monitoring Server quiesce prevents the
display of the data collected by the attached Tivoli Enterprise
Monitoring Agents
A HUB Tivoli Enterprise Monitoring Server has been running. A shutdown of the
Tivoli Enterprise Monitoring Server and Tivoli Enterprise Monitoring Agents on
the remote systems is in process, but the shutdown takes awhile due to abends in
the remote Tivoli Enterprise Monitoring Server. About 8 or 9 minutes go by before
the HUB Tivoli Enterprise Monitoring Server quiesces. There are a lot of remote
request communication messages in the HUB's RKLVLOG prior to the QUIESCE,
but no other signs of errors until after the abend. You cannot restart the remote
environments following the quiesce, until after the HUB environment is recycled.
The value of the MINIMUM parameter within the KDSSYSIN member of the
RKANPARU library might need to be increased if the STGDEBUG(X) or
STGDEBUG(Y) parameter is also supplied within KDSSYSIN. If the address space
controlled by this KDSSYSIN member enters a "storage quiesce" state (indicated by
a KLVxxxxx message stating that there is a storage shortage or quiesce in effect),
you should increase the value of the MINIMUM parameter and restart the address
space.
During installation of a remote Tivoli Enterprise Monitoring
Server on a Windows system, the agent support is applied,
but fails
In a multiple-level Tivoli Enterprise Monitoring Server environment, the remote
Tivoli Enterprise Monitoring Servers obtain their agent support from the hub Tivoli
Enterprise Monitoring Server. In order to apply agent support to a remote Tivoli
Enterprise Monitoring Server, the hub Tivoli Enterprise Monitoring Server must be
running and reachable by the remote Tivoli Enterprise Monitoring Server.
162
IBM Tivoli Monitoring: Troubleshooting Guide