On-site personnel reported that a set of libraries failed to start and cluster resources failed to start, and they sent screenshots and logs for analysis.
After analyzing the log, I found that: ASM did not start OCR, etc. and could not start
Question 1:
2018-05-08 11:59:39.980: [ OCRASM][1639659296]proprasmo: Error in open/create file in dg [OCRDG] 2018-05-08 11:59:39.980: [ OCRASM][1639659296]ASM Error Stack : ORA-15077: could not locate ASM instance serving a required diskgroup 2018-05-08 11:59:39.981: [ OCRASM][1639659296]proprasmo: kgfoCheckMount returned [7] |
The second error message: Probably there is a problem with the heartbeat network
Question 2:
2018-05-08 13:06:36.097: [ CSSD][3379095296]clssnmvDHBValidateNcopy: node 1, rac1, has a disk HB, but no network HB, DHB has rcfg 421249059, wrtcnt, 94952864, LATS 861554, lastSeqNo 94952861, uniqueness 1525754229, timestamp 1525755985/867374 |
There is no problem with ping to verify the private network
Question 1 is obvious, Oracle rac asm method, OCR is stored in asm, make sure that the asm instance is started before starting
Because the asm instance was not brought up after the server restarted. We have to manually start the asm instance
sqlplus / sysasm startup After starting the asm instance, we start the cluster related resources [root@rac1 ~]# crsctl start has CRS-4638: Oracle High Availability Services is online Start instance resources, etc. crsctl start cluster –all |
Verify that the login node 1 is logged in normally
process the second node
Summary: The on-site personnel are not familiar with the rac startup sequence, and the Oracle grid root user commands are mixed