IBM BS029ML Self Help Guide - Page 42

Two sets of production environments, The Flip-Flop approach

Page 42 highlights

undertaken during a period of scheduled outage, such as during the weekend or overnight, when the respective users of the solution may be unaffected by any downtime. Alternatively, in-situ maintenance can be performed by adhering to the IBM documented 24x7 maintenance procedure. However, while this latter approach represents a distinct improvement over the 24x7 maintenance procedures applicable to previous versions of WebSphere Portal Server, the complexities of performing such maintenance, and maintaining an uninterrupted level of service, arguably remain high risk for many organizations. As such, the decision to implement the 24x7 maintenance procedure in a single clustered WebSphere Portal Server V6.0.x environment can only rest with the comfort factor required by each particular organization. The key differentiator between this and the other maintenance methods described in this section is that the in-situ procedure does not require an additional environment during the period of maintenance. Full details for this procedure, including the IBM documented 24x7 maintenance proceedure, can be found in the WebSphere Portal Server Version 6.0 Information Center. 2.4.2 Two sets of production environments This option considers two sets of production environments, each supporting duplicate WebSphere Portal Server configurations. It is not implied that either environment shares the same user community, customization, and wmm database domains in a peer-to-peer fashion, as is now achieveable with WebSphere Portal Server V6.0.x. Instead, it is characterized by the replication of all data and artifacts between hosting environments prior to undertaking maintenance. As such, the deployment of two sets of production environments has long been an acknowledged approach for maintaining operational continuity and for addressing disaster recovery. However, unlike the approach detailed in 2.4.3, "The dual cluster with two lines of production architecture" on page 29, such a deployment does not normally see both sets of production servers actively handling the load. It is what is commonly referred to as an active/passive implementation. Furthermore, the passive environment may also serve the purpose of addressing disaster recovery. When considering an architecture based on the selection of two sets of production environments, there are two sub approaches that are most commonly implemented. The Flip-Flop approach One approach for maintaining a continuous level of operation in a WebSphere Portal Server environment has been with the adoption of a Flip-Flop architecture. Such an architecture utilizes a second WebSphere Portal Server instance identical to the primary instance. Each environment is normally clustered and highly available in its own right. During scheduled maintenance, user requests are first gracefully quiesced over to the secondary instance, or Flip environment, before Fix Packs and updates are applied to primary instance. With the successful completion of all maintenance activities to the primary instance, the decision can be made to Flop the users back or to retain the users against the current secondary instance. If the latter option is selected, the secondary WebSphere Portal Server environment effectively becomes the primary instance. Unfortunately, there are several weaknesses associated with the Flip-Flop architecture that make the approach somewhat less than straight forward to implement. First, it should be fairly evident that the procedure requires an additional physical environment. Typically, many organizations address this issue by collocating the secondary instance, or Flip environment, on the same hardware hosting their staging environment. Secondly, there is the requirement to transfer all configurational data, including the very Portlets deployed within the Portal, and the need to carry across any user customization data between Portal instances. With the 28 IBM WebSphere Portal V6 Self Help Guide

  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • 7
  • 8
  • 9
  • 10
  • 11
  • 12
  • 13
  • 14
  • 15
  • 16
  • 17
  • 18
  • 19
  • 20
  • 21
  • 22
  • 23
  • 24
  • 25
  • 26
  • 27
  • 28
  • 29
  • 30
  • 31
  • 32
  • 33
  • 34
  • 35
  • 36
  • 37
  • 38
  • 39
  • 40
  • 41
  • 42
  • 43
  • 44
  • 45
  • 46
  • 47
  • 48
  • 49
  • 50
  • 51
  • 52
  • 53
  • 54
  • 55
  • 56
  • 57
  • 58
  • 59
  • 60
  • 61
  • 62
  • 63
  • 64
  • 65
  • 66
  • 67
  • 68
  • 69
  • 70
  • 71
  • 72
  • 73
  • 74
  • 75
  • 76
  • 77
  • 78
  • 79
  • 80
  • 81
  • 82
  • 83
  • 84
  • 85
  • 86
  • 87
  • 88
  • 89
  • 90
  • 91
  • 92
  • 93
  • 94
  • 95
  • 96
  • 97
  • 98
  • 99
  • 100
  • 101
  • 102
  • 103
  • 104
  • 105
  • 106
  • 107
  • 108
  • 109
  • 110
  • 111
  • 112
  • 113
  • 114
  • 115
  • 116
  • 117
  • 118
  • 119
  • 120
  • 121
  • 122
  • 123
  • 124
  • 125
  • 126
  • 127
  • 128
  • 129
  • 130
  • 131
  • 132
  • 133
  • 134
  • 135
  • 136
  • 137
  • 138
  • 139
  • 140
  • 141
  • 142
  • 143
  • 144
  • 145
  • 146
  • 147
  • 148
  • 149
  • 150
  • 151
  • 152
  • 153
  • 154
  • 155
  • 156
  • 157
  • 158
  • 159
  • 160
  • 161
  • 162
  • 163
  • 164
  • 165
  • 166
  • 167
  • 168
  • 169
  • 170
  • 171
  • 172
  • 173
  • 174
  • 175
  • 176
  • 177
  • 178
  • 179
  • 180
  • 181
  • 182
  • 183
  • 184
  • 185
  • 186
  • 187
  • 188
  • 189
  • 190
  • 191
  • 192
  • 193
  • 194
  • 195
  • 196
  • 197
  • 198
  • 199
  • 200
  • 201
  • 202
  • 203
  • 204
  • 205
  • 206
  • 207
  • 208
  • 209
  • 210
  • 211
  • 212
  • 213
  • 214
  • 215
  • 216
  • 217
  • 218
  • 219
  • 220
  • 221
  • 222
  • 223
  • 224
  • 225
  • 226
  • 227
  • 228
  • 229
  • 230
  • 231
  • 232
  • 233
  • 234
  • 235
  • 236
  • 237
  • 238
  • 239
  • 240
  • 241
  • 242

28
IBM WebSphere Portal V6 Self Help Guide
undertaken during a period of scheduled outage, such as during the weekend or overnight,
when the respective users of the solution may be unaffected by any downtime. Alternatively,
in-situ maintenance can be performed by adhering to the IBM documented 24x7 maintenance
procedure. However, while this latter approach represents a distinct improvement over the
24x7 maintenance procedures applicable to previous versions of WebSphere Portal Server,
the complexities of performing such maintenance, and maintaining an uninterrupted level of
service, arguably remain high risk for many organizations. As such, the decision to implement
the 24x7 maintenance procedure in a single clustered WebSphere Portal Server V6.0.x
environment can only rest with the comfort factor required by each particular organization.
The key differentiator between this and the other maintenance methods described in this
section is that the in-situ procedure does not require an additional environment during the
period of maintenance. Full details for this procedure, including the IBM documented 24x7
maintenance proceedure, can be found in the WebSphere Portal Server Version 6.0
Information Center.
2.4.2
Two sets of production environments
This option considers two sets of production environments, each supporting duplicate
WebSphere Portal Server configurations. It is not implied that either environment shares the
same user community, customization, and wmm database domains in a peer-to-peer fashion,
as is now achieveable with WebSphere Portal Server V6.0.x. Instead, it is characterized by
the replication of all data and artifacts between hosting environments prior to undertaking
maintenance.
As such, the deployment of two sets of production environments has long been an
acknowledged approach for maintaining operational continuity and for addressing disaster
recovery. However, unlike the approach detailed in 2.4.3, “The dual cluster with two lines of
production architecture” on page 29, such a deployment does not normally see both sets of
production servers actively handling the load. It is what is commonly referred to as an
active/passive implementation. Furthermore, the passive environment may also serve the
purpose of addressing disaster recovery.
When considering an architecture based on the selection of two sets of production
environments, there are two sub approaches that are most commonly implemented.
The Flip-Flop approach
One approach for maintaining a continuous level of operation in a WebSphere Portal Server
environment has been with the adoption of a Flip-Flop architecture. Such an architecture
utilizes a second WebSphere Portal Server instance identical to the primary instance. Each
environment is normally clustered and highly available in its own right. During scheduled
maintenance, user requests are first gracefully quiesced over to the secondary instance, or
Flip environment, before Fix Packs and updates are applied to primary instance. With the
successful completion of all maintenance activities to the primary instance, the decision can
be made to Flop the users back or to retain the users against the current secondary instance.
If the latter option is selected, the secondary WebSphere Portal Server environment
effectively becomes the primary instance.
Unfortunately, there are several weaknesses associated with the Flip-Flop architecture that
make the approach somewhat less than straight forward to implement. First, it should be fairly
evident that the procedure requires an additional physical environment. Typically, many
organizations address this issue by collocating the secondary instance, or Flip environment,
on the same hardware hosting their staging environment. Secondly, there is the requirement
to transfer all configurational data, including the very Portlets deployed within the Portal, and
the need to carry across any user customization data between Portal instances. With the