EMC VNX Procedure Generator: Topic Selections
EMC VNX Procedure Generator: Topic Selections
Topic
VNX5300 Procedures
Selections
VNX5300 Activity Type: Hardware Replacements
VNX5300 File/Unified HW Replacements: Data Mover Enclosure (DME)
IMPORTANT: To be sure you have the latest SolVe generator application and content, always choose Yes
when asked if you want to update the application or repository.
REPORT PROBLEMS
If you find any errors in this procedure or have comments regarding this application, send email to
[email protected]
©
Copyright 2010 –2013 EMC Corporation. All rights reserved.
Publication Date: July, 2013
EMC believes the information in this publication is accurate as of its publication date. The information is subject to
change without notice.
THE INFORMATION IN THIS PUBLICATION IS PROVIDED “AS IS.” EMC CORPORATION MAKES NO
REPRESENTATIONS OR WARRANTIES OF ANY KIND WITH RESPECT TO THE INFORMATION IN THIS
PUBLICATION, AND SPECIFICALLY DISCLAIMS IMPLIED WARRANTIES OF MERCHANTABILITY OR FITNESS
FOR A PARTICULAR PURPOSE.
Use, copying, and distribution of any EMC software described in this publication requires an applicable software
license.
For the most up-to-date regulatory document for your product line, go to the Technical Documentation and Advisories
section on EMC Powerlink.
For the most up-to-date listing of EMC trademarks, see the list of EMC Corporation Trademarks on EMC.com.
All other trademarks used herein are the property of their respective owners.
Contents
Handling Field Replaceable Units (FRUs)......................................................................... 4
FRUs and Power Issues .......................................................................................................................... 4
CAUTION: When you want to power down a storage processor enclosure (SPE), turn off the power
switch on the enclosure’s standby power supply (SPS). Never turn off the SPE’s power supply
switches. Turning off the SPE power supply switches instead of the SPS power switch shuts down
the storage processors before they have enough time to save the write-cache data to the vault
drives. This causes data loss.
If write-cache data is not saved, the storage processor log displays an error message similar to the
following:
Enclosure 0 Disk 5 0x90a (Can’t Assign - Cache Dirty) 0 0xafb40 0x14362c.
If you see this error message, contact Customer Service immediately.
To use it, attach the clip of the ESD wristband (strap) to any bare (unpainted) metal on the cabinet
enclosure; then secure the wristband around your wrist with the metal button against your skin.
IMPORTANT: These procedures are not a substitute for the use of an ESD kit. You should follow them
only in the event of an emergency.
Before touching any FRU, touch a bare (unpainted) metal surface of the enclosure.
Before removing any FRU from its antistatic bag, place one hand firmly on a bare metal surface of the
enclosure, and at the same time, pick up the FRU while it is still sealed in the antistatic bag. Once you
have done this, do not move around the room or contact other furnishings, personnel, or surfaces until
you have installed the FRU.
When you remove a FRU from the antistatic bag, avoid touching any electronic components and
circuits on it.
If you must move around the room or touch other surfaces before installing a FRU, first place the FRU
back in the antistatic bag. When you are ready again to install the FRU, repeat these procedures.
When installing multiple disks in a powered-up system, wait at least 6 seconds before sliding the
second and each subsequent disk into the system.
Place a disk module on a soft, antistatic surface, such as an industry-standard antistatic foam pad or
the container used to ship the module. Never place a disk module directly on a hard surface.
WARNING: The weight of some FRU assemblies is not evenly distributed. To prevent personal
injury or equipment damage when handling FRU assemblies, make sure you provide extra support
at the heavy end of a FRU assembly.
CAUTION: Electrostatic discharge (ESD) can damage equipment and impair electrical circuitry.
Always follow ESD prevention procedures when removing and replacing Customer-Replaceable
Units (CRUs) and Field-Replaceable Units (FRUs).
Task Summary
This procedure assumes that you are reinstalling (transferring) the power/cooling modules, management
modules, Blades, and I/O modules from the faulted Blade enclosure to the replacement Blade enclosure.
So that you can understand what is involved, the table of Contents lists the high-level tasks required for this
procedure. Take a moment to become familiar with these tasks. If you have any questions, contact
Customer Support before you begin the procedure.
CAUTION: This procedure takes approximately three hours to complete. During this time, the NAS
service is stopped and service is disrupted. Any process initiated by the Control Station (CS) or
any commands run from the /nas directory is inhibited. You cannot manage the system during this
time.
Note: This procedure shows command output collected from a two X-Blade (Blade) system, unless
otherwise specified. Be aware that the actual command output will differ depending on the number of
Blades in the system. Additionally, output values displayed in the document are for the specific system used
to create this procedure.
WARNING: The weight of some FRU assemblies is not evenly distributed. To prevent personal
injury or equipment damage when handling an assembly, make sure you provide extra support at
the heavy end of the assembly.
where <control_station> is the hostname or IP address of the primary Control Station (CS).
Note: Alerts will not automatically clear. Delete old alerts by right-clicking on the alert and selecting
Delete.
2. [ ] Use the drop-down list at the top left of the Dashboard to select the system that contains faulted
hardware.
3. [ ] Select System > Hardware > Hardware for File to view information about the hardware
components.
4. [ ] Check the system inventory for faulted hardware components.
5. [ ] Record the full component name for any faulted hardware found on the Hardware for File page.
The component name contains important information about hardware location.
For example, a Blade enclosure is listed in Unisphere as DME 0, where 0 is the Blade enclosure (DME)
number.
AC AC AC AC
CNS-001667
10. [ ] If a Blade enclosure LED or Unisphere show a faulted Blade enclosure, continue with this
replacement procedure.
11. [ ] If each Blade enclosure LED or Unisphere does not show a faulted Blade enclosure, it does not
need to be replaced.
Recheck the Unisphere to see if another component has faulted. If the Unisphere shows another
component has faulted, it might need to be replaced instead of a Blade enclosure.
14. [ ] Verify that the email notification service has stopped (is not enabled):
/nas/bin/nas_emailuser –info
15. [ ] ConnectHome and email notifications are now disabled on your VNX OE File system. Skip over the
next Block task to begin the service activity.
0 2 and 3
1 4 and 5
2 6 and 7
3 8 and 9
2. [ ] Within this file, find and record, in Table 2, the values for the Resume PROM information contained
in the RESUME_INFORMATION_MIDPLANE section.
Example XML file:
EMC_BARE_ASSEMBLY_PART_NUMBER="100-562-440 "
EMC_BARE_ASSEMBLY_REVISION="A03"
EMC_BARE_ASSEMBLY_SERIAL_NUMBER="FCNTR082900499 "
EMC_PART_NUMBER=" "
EMC_ARTWORK_REVISION=" "
EMC_ASSEMBLY_REVISION=" "
EMC_SERIAL_NUMBER=" "
EMC_PRODUCT_PART_NUMBER="900-567-002 "
EMC_PRODUCT_SERIAL_NUMBER="FCNCH095103338 "
VENDOR_NAME="Blank "
LOCATION_OF_MANUFACTURE="Blank "
YEAR_OF_MANUFACTURE=" "
MONTH_OF_MANUFACTURE=" "
DAY_OF_MONTH_OF_MANUFACTURE=" "
ASSEMBLY_NAME=" "
/>
EMC_PRODUCT_PART_NUMBER PRODUCT PN
EMC_PRODUCT_SERIAL_NUMBER PRODUCT SN
ASSEMBLY_NAME ASSEMBLY_NAME
Note 1: All of this information except the ASSEMBLY_NAME should also be on the serial number tag located on the rear of the BE
enclosure.
Note 2: The field names listed in Corresponding Resume PROM Field correspond to the value prompts found in Configure the
BE Resume PROM.
Note: The CE should contact support and get the Resume PROM configuration data from CSI if not
available locally.
3. [ ] The Resume PROM values for the faulted BE have been collected.
Note: If the secondary CS is currently acting as the primary CS, you may need to failback and then
continue to power down the secondary CS.
1. [ ] At the rear of the cabinet, connect the null modem cable from a serial port on your management
console to the secondary Control Station, CS 1.
2. [ ] Open the HyperTerminal or PuTTY session using the following settings:
Bits per second: 19200
Data bits: 8
Parity: None
Stop bits: 1
Flow control: None
Emulation: Auto Detect
Telnet terminal ID: ANSI
3. [ ] Login in to CS 1 as nasadmin.
4. [ ] Change to the root user by entering the following command:
$ su root
Note: This may take a few minutes to complete. Ensure it is finished before you continue.
6. [ ] Press and hold the power button to power down CS 1. The Control Station power button is shown
in Figure 3. Remove the bezel to access this button.
7. [ ] Wait 30 seconds for the internal disk drive to stop spinning and safely park its heads before
continuing.
8. [ ] Disconnect the CS 1 power cable from the power distribution panel (PDP).
9. [ ] Close the HyperTerminal or PuTTY session with CS 1.
10. [ ] Reconnect your management console to the serial port on CS 0 and reopen the HyperTerminal or
PuTTY session on CS 0 using the settings found in Step 2.
11. [ ] The secondary CS is halted and powered off.
Note: This document uses the term Blade to refer to a component that is defined as a logical construct of a
CPU module with its associated I/O modules.
Single-Blade system
For a single-Blade system, prepare the Blade for removal by doing the following:
1. [ ] Unmount the file systems and prepare the Blade to be halted by typing:
# /nas/bin/server_cpu server_2 -q now
CAUTION: Stopping the NAS service disrupts service on the system. Any process initiated
by the Control Station or any commands run from the /nas directory are inhibited. Do not to
attempt to run this command while working in the /nas directory.
This command takes up to five minutes to complete. If this command fails, reboot the CS and try
again.
3. [ ] Halt the Blade by typing:
CAUTION: When you shut down the Blade, you will lose access to the public networks and file
systems. Verify that the customer is prepared for this action.
# /nasmcd/sbin/t2reset pwroff -s 2
4. [ ] Wait three minutes for the previous command to complete and then verify that the Blade has
powered off:
Note: The getreason command can be augmented to provide continuous monitoring of the system
reason codes by adding the watch command. Use Ctrl/C to stop end the monitoring. For example # watch
/nasmcd/sbin/getreason
Example Output:
# /nasmcd/sbin/getreason
6 - slot_0 primary control station
- slot_2 powered off
6. [ ] The BE is removed from the NAS database and the NAS service is stopped.
Disconnect the faulted Blade enclosure's two power cables from the rack.
Multi-Blade system
For a multi-Blade system, prepare the Blade enclosure for removal by doing the following:
1. [ ] Determine the failover status of the Blades by entering:
# /nas/bin/nas_server -info –all
Sample Output:
id = 1
name = server_2.faulted.server_3
acl = 0
type = standby
slot = 2
member_of =
standbyfor= server_2
status :
defined = out_of_service
actual = online, ready
id = 2
name = server_2
acl = 0
type = nas
slot = 3
member_of =
standby = server_2.faulted.server_3, policy=manual
status :
defined = enabled
actual = online, ready
Value Definition
id ID of the Blade.
name Name given to the Blade.
acl Access control level value assigned to the Blade or VDM.
type Type assigned to the Blade. (nas=primary; standby=standby)
slot Physical slot in the system where the Blade resides.
member_of Group to which the Blade is a member.
standby If the Blade has a local standby associated with it.
standbyfor If the Blade has a local standby, lists associated primary.
status Whether the Blade is enabled or disabled, and whether it is active.
2. [ ] Go to Step 4 if the system is configured with only one Blade enclosure, otherwise continue.
3. [ ] For a multi-Blade enclosure system, prepare each Blade in the faulted BE for removal. Repeat this
step for each Blade in the faulted enclosure before continuing to the next step.
a. If the Blade is a primary Blade and has no standby Blade configured, no action is required.
b. If the Blade is a primary Blade that has a standby Blade configured in another BE, failover to that
standby Blade:
# /nas/bin/server_standby <server_name> -activate mover
Where <server_name> is the name of the server. Use the nas_server –list command to get
the server name.
c. If the Blade is configured as a standby Blade and is currently standing in for an available primary Blade
contained in another BE, enter the following command:
# /nas/bin/server_standby <server_name> -restore mover
where <server_name> is the name of the server. Use the nas_server –list command to get
the server name.
Example:
In a four Blade system with BE 1 faulted, Blade 2 has failed over to Blade 4, its standby. The
command to restore Blade 2 as the primary Blade is:
# /nas/bin/server_standby server_2 -restore mover
d. If the Blade is configured as a standby Blade and is not currently standing in for a primary Blade
contained in another BE, remove the Blade as a standby. Enter the following command:
Note: A Blade can be configured as a standby for multiple primary Blades. Repeat this step for each
primary Blade for which the Blade is a standby. Afterwards, go to step 4.
where <server_name> is the name of the primary Blade. Use the nas_server -list command to get
the Blade name.
4. [ ] Stop the NAS service by entering:
CAUTION: Stopping the NAS service disrupts service on the system. Any process initiated by
the Control Station or any commands run from the /nas directory is inhibited. Do not to
attempt to run this command while working in the /nas directory.
This command will take up to five minutes to complete. Make sure it is finished before you continue. If
this command fails, reboot the CS and try again.
5. [ ] Power off each Blade in the faulted BE:
CAUTION: When you shut down a Blade in a single BE system, you will lose access to the
public networks and file systems. Verify that the customer is prepared for this action.
Where <slot_number> is the slot ID number of the Blade inside the BE.
6. [ ] Wait three minutes for the previous commands to complete and then verify that the Blades have
powered off:
Note: The getreason command can be augmented to provide continuous monitoring of the system
reason codes by adding the watch command. Use Ctrl/C to stop the monitoring. For example, watch
/nasmcd/sbin/getreason.
Example:
6 - slot_0 primary control station
- slot_2 powered off
- slot_3 powered off
CAUTION: The enclosure is heavy and should be installed into a rack by two people (or a
data-center lift tool). To avoid personal injury and/or damage to the equipment, do not attempt to lift
and install the enclosure without a mechanical lift or help from another person.
1. [ ] Remove four M5 12.7 mm screws (two per side) to release the chassis from the cabinet (Figure 4).
Note: The screws also secure the latch brackets mounted on the front of the chassis.
CL4138a
2. [ ] From the front of the cabinet and, with help from another person, lift the chassis and slide it out of
the cabinet (Figure 5).
3. [ ] Place the enclosure on a work surface.
CL4137a
1. [ ] From the front of the cabinet, lift the chassis and slide it onto the rails (Figure 6).
When the chassis slides to the back of the cabinet, the two notches in the rear of the chassis insert into
the rear tabs on each rail. The tabs secure and support the rear of the chassis.
If the chassis does not slide all the way into the cabinet, you may need to loosen the screws that hold
the rear of the rails in place, then adjust the rails to allow the tabs to fit into the notches.
CL4137b
2. [ ] Secure the front of the chassis to the vertical channels of the cabinet using four M5 12.7 mm
screws (two per side) (Figure 7).
CL4138b
You hear an audible click when the tab is pressed and the latch is released.
4. [ ] Lower the latch to release the power/cooling module from the CPU module.
Note: The orange tabs on the power/cooling module are labeled with a 1.
5. [ ] Pull the power/cooling module out of the chassis (Figure 8) and place it on an antistatic surface.
CL3972
AC AC AC AC
9. [ ] Install the CPU modules from the Blade in the replacement chassis in the corresponding locations
from which they were removed.
10. [ ] Install power/cooling modules into the replacement chassis in the corresponding locations from
which they were removed.
CAUTION: Observe ESD procedures; handle the Blade components with care. You can remove a
management module while the system is powered up.
22. [ ] Use masking tape or similar material to label each cable to correctly reconnect them later.
23. [ ] Disconnect the Ethernet cables from the module.
Carefully press the connector latches to release them before you remove the cables.
CAUTION: Do not bend, pull, or stress other cables. Dress and secure the cables to one side so
that you can remove a management module without damaging the cables.
24. [ ] On the management module, pull the trigger mechanism on the module handle to release it from
the Blade enclosure Figure 13.
CNS-001759
25. [ ] Using the handle, gently pull the module from the Blade enclosure.
CNS-001759
33. [ ] Align the module with the empty slot (Figure 16).
Install the replacement I/O module in the same slot from which the faulted I/O module was removed.
34. [ ] When the I/O module appears seated, push and release the small button on the handle
(Figure 16).
If the button remains in, the module is fully seated .
If the button springs back, gently push the module further into the chassis, then push it again.
If the button still does not rest flush with its handle, remove the module and repeat steps 1 and 2.
35. [ ] If the I/O module you installed is a replacement for a faulted I/O module, attach the cables to the
replacement I/O module in the exact same locations that they were attached in the faulted I/O module.
38. [ ] Reconnect the Blade enclosure communication cables and all remaining of the cables to the rear
of the Blade enclosure by following the cable labels.
Note: Do not reconnect the Blade enclosure power cables at this time.
1. [ ] Connect the null modem cable from the COM 1 port of your service laptop to the RS-232 serial
port on the rear of the A-side management module (Figure 18).
Port 2
Port 0 Port 1
Management Management
module B module A CNS-001679MOD
CAUTION: Do not bend, pull, or stress any other cabinet cables. Dress and secure the cables
without damaging the cables.
3. [ ] Reconnect the BE AC power cords, carefully pressing the connector latches to restrain these
cables after you connect them.
5. [ ] Enter the password SHIP_it and press Enter. The Diagnostic Menu appears.
6. [ ] Select the Resume PROM Sub-Menu, and press Enter. The Resume PROM Sub-Menu
appears.
7. [ ] Select Set Resume, and press Enter. A list of the devices in the enclosure appears.
8. [ ] Select XP Chassis, and press Enter.
9. [ ] Press Enter for each of the following prompts to accept the default values:
Enter EMC TLA Part Number (16 characters)[100-562-956]:
Enter EMC TLA Artwork Revision (3 characters)[]:
Enter EMC TLA Assembly Revision (3 characters)[A01]:
Enter EMC TLA Serial Number (16 characters)[FCNDN082300025]:
Note: The prompts will include examples of what should be populated in the system, shown in brackets [].
11. [ ] Enter the EMC Part Number field from Table 2 of the Resume PROM information you gathered in
the Saving Resume PROM configuration information procedure, then press Enter.
12. [ ] The system prompts with:
Enter EMC System Hardware TLA SN (16 characters)[FNM00084100038]:
13. [ ] Enter the EMC_SERIAL_NUMBER from Table 2 of the Resume PROM information you gathered
in the Saving Resume PROM configuration information procedure, then press Enter.
14. [ ] The system prompts with:
Enter EMC System Hardware TLA Rev (3 characters)[A01]:
18. [ ] Enter the PRODUCT PN (16 characters) = EMC_PRODUCT_PART_NUMBER from Table 2 of the
Resume PROM information you gathered in the Saving Resume PROM configuration information
procedure, then press Enter.
19. [ ] The system prompts with:
Enter Product SN (16 characters)[FNM00000000000]:
22. [ ] Enter the value from the ASSEMBLY_NAME field from Table 2 of the Resume PROM information
you gathered in the Saving Resume PROM configuration information procedure, then press Enter.
23. [ ] The system prompts:
Enter Number of Programmables (0-84) [0]:
Enter World Wide Name (WWN) Seed (0x200000-0x7FEFFFFF) [0x3B200E19]:
Vendor ID: 0x7
Model ID: 0x0C
Seed: 0x00E19
Enter PCBA Part Number (16 characters)[005348742]:
Enter PCBA Assembly Revision (3 characters)[A10]:
Enter PCBA Serial Number (16 characters)[FCNBD082021620]:
Enter Channel Technology/Speed (0x00-0xFFFF) [0x00]:
Note: In the above example, the information enclosed in brackets shows you an example of what should be
populated.
24. [ ] Continue pressing Enter to accept the default values until the system prompts: System Types:
0 - Agnostic
1 - Clariion
2 - Celerra
3 - Clariion AUX
4 - Symmetrix
5 - Centera
6 - E3 Diag Mode
Enter System Type (0-6) [2]:
0 2 and 3
1 4 and 5
2 6 and 7
3 8 and 9
27. [ ] When the system prompts for the NAS Enclosure ID, enter the number of the replacement BE.
Table 4 lists the NAS Enclosure ID and the Blades associated with each enclosure.
28. [ ] Continue pressing Enter to accept the default values until the system prompts:
Continue (y or n) [yes]?
Serial port
CNS-000918
Parity: None
Stop bits: 1
Flow control: None
Emulation: Auto Detect
Telnet terminal ID: ANSI
3. [ ] Log in to Control Station CS 0 as nasadmin. Change to the root user by entering the following
command:
$ su root
b. If no nas_mcd services are running, skip to Collect System Information and Save SP Logs.
c. If any nas_mcd services are running, type:
# killall nas_mcd
d. If the killall command was used, recheck that all nas_mcd services are stopped:
# ps axwf|grep nas_mcd
hostname
Host IP address
SP A IP address
Storage Group Name
5. [ ] Collect the hostname of the system by typing the following command. Record the hostname in
Table 5.
# hostname
Sample output:
Celerra_naut3
6. [ ] Use the hostname parameter to collect host IP address. Record the Host IP address in Table 5.
# cat /etc/hosts | grep "<hostname>"
Example Output:
# cat /etc/hosts | grep "Celerra_naut3"
10.6.4.125 Celerra_naut3.rtp.lab.emc.com Celerra_naut3
7. [ ] Collect the IP address assigned to SP A, shown in bold in the following example output. Record
SP A’
s IP address in Table 5.
# cat /etc/hosts | grep "SP"
Example output:
10.6.4.126 A_FCNTR074200022 Celerra_naut3.rtp.lab.emc.com # CLARiiON SP
10.6.4.127 B_FCNTR074200022 Celerra_naut3.rtp.lab.emc.com # CLARiiON SP
Example Output:
# /nasmcd/sbin/navicli -h 128.221.252.200 port -list
Information about each HBA:
HBA UID: 50:06:01:60:BC:E0:08:9F:50:06:01:60:3C:E0:08:9F
Server Name: Celerra_naut3
Server IP Address: 10.6.4.125
HBA Model Description:
HBA Vendor Description:
HBA Device Driver Name:
Information about each port of this HBA:
SP Name: SP A
SP Port ID: 0
HBA Devicename:
Trusted: NO
Logged In: NO
Defined: YES
Initiator Type: 3
StorageGroup Name: Celerra_naut3
...
...
Information about each SPPORT:
SP Name: SP A
SP Port ID: 0
SP UID: 50:06:01:60:BE:A0:01:A8:50:06:01:60:3E:A0:01:A8
Link Status: Up
Port Status: Online
Switch Present: NO
.....
....
b. Display the information about each user-defined storage group in the system. Save this output in the
event of rollback. Record the Storage Group Name in Table 5. Output of this command contains the
HBA UID which will be used to remove initiator records, later in the procedure.
IMPORTANT: The information in this output will be used to remove the current initiator records, or in
the event of a roll-back scenario, to recreate the current configuration.
Use the command below that best represents the system type:
If you are replacing a Blade enclosure in a VNX VG2 or VNX VG8 gateway, use the following command
to complete this task:
# /nasmcd/sbin/navicli -h <SPA_IP_Address> storagegroup -list –host
IMPORTANT: Fabric connected gateway systems may attach to a storage array with multiple hosts
connected. This command will display all storage groups implemented on the storage array. Ensure
that you record only the storage group associated with the correct host name and only for the
connections stemming from the faulted Blade enclosure.
Example:
# /nasmcd/sbin/navicli -h 10.6.4.126 storagegroup -list –host
Storage Group Name: Celerra_naut3
Storage Group UID: 3E:86:BB:3D:70:64:DF:11:9C:6F:00:60:16:36:9C:7D
HBA/SP Pairs:
HLU/ALU Pairs:
Note: Fabric connected gateway systems will have eight paths per enclosure.
If you are replacing a Blade enclosure on any other VNX Unified or File system, use the following
command to complete this step:
Example:
# /nasmcd/sbin/navicli -h 10.6.4.126 storagegroup –list –gname "~filestorage"
Storage Group Name: ~filestorage
Storage Group UID: 3E:86:BB:3D:70:64:DF:11:9C:6F:00:60:16:36:9C:7D
HBA/SP Pairs:
HLU/ALU Pairs:
3. [ ] Check that the system shows the proper number of Blades by typing:
# /nasmcd/sbin/getreason –e
Sample output:
6 - slot_0 control station ready
0 - slot_2 reset(Post Code = 0x61 ; Mid Code = 0x80 ; Blade Code = 0x3e
0 - slot_3 reset(Post Code = 0x61 ; Mid Code = 0x80 ; Blade Code = 0x39
CAUTION: Entering the wrong HBA UID will cause the removal of valid initiator records. Ensure
that the HBA UID entered matches the record you wish to remove, which were found in Collect
system information and save SP logs.
IMPORTANT: The command requires the HBA UID to be input with the necessary colons, as shown in
the example below.
Example:
# /nasmcd/sbin/navicli -h 10.6.4.126 port -removeHBA -o -hbauid
50:06:01:60:BC:E0:08:9F:50:06:01:61:3C:E0:08:9F
HBA: 50:06:01:60:BC:E0:08:9F:50:06:01:61:3C:E0:08:9F(y/n)?
y
Warning: This is an active HBA. Do you still want to remove it (y/n)?
y
5. [ ] Repeat the previous step for each Blade port initiator record to be removed for the BE being
replaced.
6. [ ] After removing the necessary initiator records, display the storage group list for the existing
configuration to verify that all necessary records have been successfully removed. Compare this output
to the output collected in Collect system information and save SP logs to verify that the necessary
initiator records have been removed.
Note: If the system is configured with more than two Blades, initiator records for the Blades in non-faulted
Blade enclosure(s) should still appear.
Use the command below that best represents the system type:
If you are replacing a Blade enclosure in a VG2 or VG8 gateway, use the following command to
complete this step:
# /nasmcd/sbin/navicli -h <SPA_IP_Address> storagegroup –list –host
Example Output:
If you are replacing a Blade enclosure on any other VNX Unified or File system, use the following
command to complete this step:
# /nasmcd/sbin/navicli -h <SPA_IP_Address> storagegroup -list -gname "~filestorage"
Example:
# /nasmcd/sbin/navicli -h 10.6.4.126 storagegroup –list –gname "~filestorage"
Sample Output:
Storage Group Name: ~filestorage
Storage Group UID: 3E:86:BB:3D:70:64:DF:11:9C:6F:00:60:16:36:9C:7D
HLU/ALU Pairs:
HLU Number ALU Number
---------- ----------
5 35
4 34
1 31
2 32
0 30
3 33
Shareable: YES
Note: If this command fails to PXE boot the Blade, repeat until the PXE boot completes.
# /nasmcd/sbin/t2tty -p <slot_number>
where <slot_number> is the slot number of the Blade(s) in the replacement BE.
10. [ ] Check the fault LED on the front of the BE Figure 20. If the LED is amber, there is still a faulted
component in the BE or the BE is still faulted.
AC AC AC AC
CNS-001668
Figure 20 BE, power/cooling module, and CPU module fault LEDs (bezel removed)
11. [ ] Query the system to display the new WWNs assigned to each Blade in the enclosure by typing the
following command.
IMPORTANT: Ensure that you capture the output returned for the following commands. The new
initiator records cannot be created without this information.
12. [ ] Construct the new Blade port WWNs (HBA UIDs) and record these values in Table 6.
Construct each WWN by grouping the unformatted Node and Port values of each Blade port into a
single string delimited with colons between every two digits.
For example, using the values shown for Blade 2 in the output shown in the previous step, the WWNs
for Blade 2 are:
Blade 2 - port 0: 50:06:01:60:c4:60:20:75:50:06:01:60:44:60:20:75
Blade 2 - port 1: 50:06:01:60:c4:60:20:75:50:06:01:61:44:60:20:75
Blade number Blade port Blade port WWN (HBA UID) SP and port SP ports marked with an * only
apply to a fabric-connected gateway. Do not
use for a direct-connected gateway.
1
SP ports marked with an * only apply to fabric-connected gateway. Do not sue for a direct-connected
gateway.
where <slot_number> is the slot number of each Blade in the replacement BE.
Two examples are presented below. Example 1 shows the commands and output for Blade enclosure 0
in a Unified, File, or direct-connect gateway. Whereas, Example 2 is a model for a gateway in a
fabric-connect configuration. Follow the example that reflects the system’
s configuration.
Example 1:
Example 1 shows the commands and output for Blade enclosure 0 in a Unified, File, or direct-connect
gateway. With:
Blade 2 - port 0 (HBA 0) connected to SP A - port 0
Blade 2 - port 1 (HBA 1) connected to SP B - port 1
Blade 3 - port 0 (HBA 0) connected to SP A - port 0
# /nasmcd/sbin/t2tty -c 2 "fcp show" | egrep "HBA 0|HBA 1"FCP ONLINE HBA 0: ALPA 000001
WWN: 5006016044602075 QE8 4G
FCP scsi-0: HBA 0: ALPA 0000ef SP-a00: 50060160BCE02EA7 Class 3
FCP ONLINE HBA 1: ALPA 000001 WWN: 5006016144602075 QE8 4G
FCP scsi-16: HBA 1: ALPA 0000ef SP-b00: 50060160BCE02EA7 Class 3
# /nasmcd/sbin/t2tty -c 3 "fcp show" | egrep "HBA 0|HBA 1"FCP ONLINE HBA 0: ALPA 000001
WWN: 5006016844602075 QE8 4G
FCP scsi-0: HBA 0: ALPA 0000ef SP-a01: 50060160BCE02EA7 Class 3
FCP ONLINE HBA 1: ALPA 000001 WWN: 5006016944602075 QE8 4G
FCP scsi-16: HBA 1: ALPA 0000ef SP-b01: 50060160BCE02EA7 Class 32:
Example 2 shows the commands and output for Blade enclosure 0 in a gateway in a fabric-connect
configuration. With:
Blade 2 - port 0 (HBA 0) connected to SP A - port 0 and SP B - port 0
Blade 2 - port 1 (HBA 1) connected to SP A - port 1 and SP B - port 1
Blade 3 - port 0 (HBA 0) connected to SP A - port 0 and SP B - port 0
Blade 3 - port 1 (HBA 1) connected to SP A - port 1 and SP B - port 1
# /nasmcd/sbin/t2tty -c 2 "fcp show" | egrep "HBA 0|HBA 1"FCP ONLINE HBA 0: S_ID 014500
WWN: 5006016044602075 QE8 8G
FCP scsi-0: HBA 0: D_ID 043f00 SP-a00: 50060160BCE02EA7 Class 3
FCP scsi-16: HBA 0: D_ID 043b00 SP-b00: 50060160BCE02EA7 Class 3
FCP ONLINE HBA 1: S_ID 014700 WWN: 5006016144600544 QE8 8G
FCP scsi-32: HBA 1: D_ID 043900 SP-b01: 50060160BCE02EA7 Class 3
FCP scsi-48: HBA 1: D_ID 043d00 SP-a01: 50060160BCE02EA7 Class 3
# /nasmcd/sbin/t2tty -c 3 "fcp show" | egrep "HBA 0|HBA 1"FCP ONLINE HBA 0: S_ID 014300
WWN: 5006016844600544 QE8 8G
14. [ ] Enter the following command to assign a storage group to a specific HBA by creating an initiator
record. You must create an initiator record for each connection (physical or logical) from the Blade ports
(0 and 1) to the SP ports.
CAUTION: Before issuing the command, verify that the values used to create the new initiator
record contain the correct information for each connection. Incorrect information can cause a
misconfiguration of the initiator record.
Use the command below that best represents the system type:
Use the following command for a VNX VG2 or VNX VG8 gateway:
# /nasmcd/sbin/navicli -h <SPA_IP_Address> storagegroup -setpath -o –gname
<StorageGroup_Name> -hbauid <Blade_Port_HBA_UID> -sp <SP_Letter> -spport
<SP_Port_Number> -ip <Host_IP_Address> -host <Hostname>
Use the following command for all other VNX Unified or File systems:
Where:
<SPA_IP_Address> is the SP A IP address in Table 5.
<StorageGroup_Name> is the Storage Group Name in Table 5
<Blade_Port_HBA_UID> is the Blade port WWN in Table 6
<SP_Letter> is the letter of the SP to which the Blade port it is to be connected. Refer to Table 6
<SP_Port_Number> is the SP port number to which the Blade port it is to be connected. Refer to
Table 6
<Host_IP_Address> is the system Host IP address in Table 5.
<Hostname> is the hostname in Table 5.
Example for a non-gateway VNX platform:
# /nasmcd/sbin/navicli -h 10.6.4.126 storagegroup -setpath -o –gname ~filestorage -hbauid
50:06:01:60:c4:60:20:75:50:06:01:60:44:60:20:75 -sp a -spport 0 -type 32 –ip 128.221.252.2 -host
Celerra_naut3
Warning: Changing configuration options may cause the array to stop functioning correctly.
Failover-related Initiator settings for a single host MUST BE CONSISTENTfor all paths from the host
to the storage system. Please verify after reconnect.
Do you wish to continue (y/n)? y
15. [ ] Repeat Step 2 for each necessary initiator record. When all initiator records have been created go
to Step 4.
Note: For a fabric-connected gateway, Step 2 must be issued a total of eight times, since each Blade port is
fabric connected to a the same port on each storage processor.
16. [ ] Confirm the new initiator records are correct by comparing the new initiator records with the original
initiator records.
17. [ ] Use the command below that best represents the system type:
For a VG2 or VG8 gateway, use the information collected in Collect system information and save SP
logs for the original configuration.
# /nasmcd/sbin/navicli -h <SPA_IP_Address> storagegroup -list - hosts
Sample output:
Storage Group Name: Celerra_naut3
Storage Group UID: 3E:86:BB:3D:70:64:DF:11:9C:6F:00:60:16:36:9C:7D
HBA/SP Pairs:
HLU/ALU Pairs:
18. [ ] For any other VNX Unified or File systems, use the information collected in Collect system
information and save SP logs for the original configuration.
# /nasmcd/sbin/navicli -h <SPA_IP_Address> storagegroup -list -gname "~filestorage"
Sample output:
Storage Group Name: ~filestorage
Storage Group UID: 3E:86:BB:3D:70:64:DF:11:9C:6F:00:60:16:36:9C:7D
HBA/SP Pairs:
HLU/ALU Pairs:
Note: If this command fails to PXE boot the Blade, repeat until the PXE boot completes.
# /nasmcd/sbin/t2tty -p <slot_number>
where <slot_number> is the slot number of the Blade(s) in the replacement BE.
21. [ ] Check that the Blades have successfully PXE booted. The Blades should appear with reason code
4.
# /nasmcd/sbin/getreasonoutput:
6 - slot_0 control station ready
4 - slot_2 configured
4 - slot_3 configured
22. [ ] Verify that each PXE Booted Blade shows No root file system in the output of the following
command:
# /nasmcd/sbin/t2tty -C <slot_number> "ls"
Expected output:
CAM Devices on scsi-0:
TID 00: 0:d0+ 1:d1+ 2:d2+ 3:d3+ 4:d4+ 5:d5+
TID 01: 0:d6- 1:d7+
CAM Devices on scsi-16:
TID 00: 0:d8- 1:d9- 2:d10- 3:d11- 4:d12- 5:d13-
TID 01: 0:d14+ 1:d15-
1278457867: ADMIN: 6: Command succeeded: camshowconfig
26. [ ] Check that the nd-clnt threads on the Control Station have started:
# ps axwf | grep nd-clnt
Sample output:
30751 pts/0 S+ 0:00 \_ grep nd-clnt
3500 ? S 0:03 [nd-clnt 0 1]
3501 ? S 0:07 [nd-clnt 4 5]
4217 ? S 0:02 [nd-clnt 5 6]
Note: This command may take up to 20 minutes to complete. Do not continue with the next step until this
command has completed.
# /nasmcd/sbin/setup_slot -i <slot_number>
30. [ ] Verify if the Blades are configured correctly by entering the following command for each Blade in
the enclosure:
# ping –c 3 <server_name>
where <server_name> is the name of the Blades in the BE. Use the nas_server –list command
to get the Blade names if necessary.
31. [ ] Check that the Blades are properly configured:
# /nas/bin/nas_server –list
Note: If the output for one of the Blades shows <unmounted>, that Blade is configured as a standby.
2. [ ] Wait 10 minutes to allow for the NAS service to completely load, then at the login prompt, log in as
nasadmin.
3. [ ] Change to root by entering:
$ su root
4. [ ] If you halted the secondary CS in Halting the secondary control station, reconnect the CS 1 power
cable to the PDP. If not, proceed to the next step.
5. [ ] Verify that the system is in a sane state:
Sample output:
3579 ? Sl 0:00 /nasmcd/nas_mcd -h /nasmcd /nas/sys/nas_mcd.cfg
10819 ttyS1 S+ 0:00 \_ grep nas_mcd
Sample output:
8855 ? Ssl 0:00 \_ /nas/sbin/nas_boxmonitor /nas -i 8
10839 ttyS1 S+ 0:00 \_ grep box
c. Type the following to check that the necessary file systems are mounted:
# df –h
7. [ ] For a multi-BE system, if any of the Blades in the replacement enclosure were failed over to a
standby Blade housed in another BE in Prepare DME Replacement, complete the following to fully
failback the Blade.
a. If this is single-BE system or no Blades are in a failed over state go to the next step:
b. Restore the Blades to the configured primary/standby Blade relationship by typing the following
command:
Note: This command requires several minutes to complete. Wait until this command has completed before
continuing.
where <server_name> is the name of the standby Blade functioning as the primary Blade.
Example:
Suppose Blade 3 is a standby Blade for Blade 2 and Blade 2 has failed over to Blade 3. The command
to restore Blade 2 as the primary Blade is:
# /nas/bin/server_standby server_2 -restore mover
Sample Output:
server_2 :
server_2 : going standby
server_2.faulted.server_3 : going active
replace in progress ...done
failover activity complete
commit in progress (not interruptible)...done
server_2 : renamed as server_3
server_2.faulted.server_3 : renamed as server_2
8. [ ] If you removed a standby relationship in Prepare DME Replacement, reestablish this relationship
by entering the following command for each standby relationship previously removed.
# /nas/bin/server_standby <primary_server_name> -create mover=
<standby_server_name>-policy <policy_type>
where:
<primary_server_name> is the name of the designated primary Blade
<standby_server_name> is the name of the Blade designated standby Blade
<policy_type> is the failover policy; EMC recommends an auto failover policy
9. [ ] If the Blades are configured as primary and standby, perform a failover test on the Blades in the
replacement enclosure.
If the Blades are not in a failover configuration, proceed to the next step.
a. Determine the Blade failover status using the command below and Table 3:
# /nas/bin/nas_server -info –all
b. To test the failover configuration of the Blades in the replacement enclosure, type the following
command:
# /nas/bin/server_standby <server_name> -activate mover
where <server_name> is the name of the primary Blade. Use the nas_server -list command to
get the server name.
Example:
In a two Blade system, Blade 2 has failed over to Blade 3, its standby. The command to restore Blade 2
as the primary Blade is:
# /nas/bin/server_standby server_2 -restore mover
10. [ ] Verify the Blades have the correct mounts by entering the following commands for each Blade:
# /nas/bin/server_mount <server_name>
where <server_name> is the name of the lowest-numbered Blade in the replacement BE.
Use the nas_server -list command to get the server name.
11. [ ] Check the fault LED on the front of the BE (Figure 20). If the fault LED is on, go to FRU failures for
information on how to resolve the problem.
1. [ ] Carefully align the front bezel, by holding the bezel edges, to the two brackets on the front of the
enclosure.
CL4032
2. [ ] Push the bezel in until you feel it snap into place. Pull back on the bezel lightly to ensure it is
secure.
3. [ ] Lock the bezel by turning the key clockwise.
Example:
# /nas/bin/nas_checkup
Check Version: <NAS_version>
Check Command: /nas/bin/nas_checkup
Check Log : /nas/log/checkup-run.100128-181007.log
-------------------------------------Checks-------------------------------------
Control Station: Checking if file system usage is under limit.............. Pass
Control Station: Checking if NAS Storage API is installed correctly........ Pass
Control Station: Checking if NBS clients are started....................... Pass
Control Station: Checking if NBS configuration exists...................... Pass
Control Station: Checking if NBS devices are accessible.................... Pass
Control Station: Checking if NBS service is started........................ Pass
Storage System : Checking for unsupported storage profile.................. Pass
Storage System : Checking if Fibre Channel zone checker is set up.......... N/A
Storage System : Checking if Fibre Channel zoning is OK.................... N/A
Storage System : Checking if proxy arp is setup............................ Pass
Storage System : Checking SPA SPB communication............................ Pass
--------------------------------------------------------------------------------
If the output of the nas_checkup command indicates any problems, correct the problems and re-do the
command before continuing.
1. [ ] Wait one minute to see if the fault LED turns off. The component might take a few seconds to turn
off the LED.
2. [ ] If the fault LED remains on, check that the component is properly inserted.
3. [ ] Check the /nas/log/sys_log file to identify if the FRU error is cleared and the LED has not
turned off yet. The sys_log file contains any error messages that relate to the FRU. To view the
contents of the sys_log file, type:
# /nas/bin/nas_logviewer /nas/log/sys_log|more
Example:
#/nas/sbin/enclosure_status -v -e 0 |grep -i failed
14. [ ] From the ConnectHome configuration, determine the connections that are enabled:
/nas/sbin/nas_connecthome –i
For example:
/nas/sbin/nas_connecthome -t -email_1
or
/nas/sbin/nas_connecthome -t -email_2
or
/nas/sbin/nas_connecthome -t –https
or
/nas/sbin/nas_connecthome -t -modem_1
17. [ ] If the Recipient Address(es) field is empty, email notifications has not been configured and does
not need to be enabled. If you want to configure email notifications, use the
/nas/bin/nas_emailuser command or Unisphere to configure it.
18. [ ] If the Recipient Address(es) field is populated, email notifications was enabled. Re-enable email
notifications:
c. Test the configuration and verify that the configured Recipient Address(es) received the test email:
/nas/bin/nas_emailuser –test
The array will now call home or responded to any other template applied to the VNX OE File array. Skip
over the next Block task to conclude the service activity.
Finish
19. [ ] Upon completion of this service activity, update the CSI database with the new VNX system
information for this site. Standard Global Services discipline requests you to register this service activity
in CSI, particularly for an Install or an Upgrade. You can use CRM or MS Outlook IB Database email
form in accordance with your team's reporting policy. Verify that all the customer site information is
correct (Table 7).
Parts Return
20. [ ] Locate the Parts Return Label package and fill out the shipping label and apply to box for return to
EMC.
21. [ ] Apply other labels for the box appropriate to this returning part. Check for any Engineering bulletin
and with the Customer to determine if it is FA bound.
IMPORTANT: If Engineering has requested the part to be shipped back for an FA, ensure the proper ship
to information has been entered and Attention to and/or Contact Name is provided in the shipping label.
22. [ ] Securely tape the box and Ship the failed part back to EMC.