Hi,
My replicat process was hang, and it was not running.
OGG-15163 There was a problem sending a message
When i stop or stats it, i got following error.
GGSCI (msdidb01.deveci.local) 22> stop R01MSD Sending STOP request to REPLICAT R01MSD ... 2020-03-29 23:39:24 ERROR OGG-15163 There was a problem sending a message to REPLICAT (Timeout waiting for message). GGSCI (msdidb01.deveci.local) 39> stats R01MSD Sending STATS request to REPLICAT R01MSD ... 2020-03-30 09:43:43 ERROR OGG-15163 There was a problem sending a message to REPLICAT R01MSD (Timeout waiting for message).
GGSCI (msdidb01.deveci.local) 4> kill R01MSD Sending KILL request to MANAGER ... Killed process (351231) for REPLICAT R01MSD
This problem is related with the table in the replicat file.
Index of table was unusable.
REPLICAT (Timeout waiting for message)
Index was unusable, so to solve this problem you need to perform index maintenance.
You should rebuild index as follows.
alter session set ddl_lock_timeout=500; alter index MSD.TEST_TABLE_RECEIVE_QUE_NDX rebuild parallel 32; alter index MSD.TEST_TABLE_RECEIVE_QUE_NDX noparallel;
Then start it again.
GGSCI (msdidb01.deveci.local as GGS_ADMIN@MSDB1) 143> start R01MSD Sending START request to MANAGER ... REPLICAT R01MSD starting GGSCI (msdidb01.deveci.local as GGS_ADMIN@MSDB1) 144> info R01MSD REPLICAT R01MSD Last Started 2020-03-30 19:04 Status RUNNING Checkpoint Lag 43:57:50 (updated 00:00:00 ago) Process ID 62363 Log Read Checkpoint File /goldengate/ggate18c/dirdat/i8000000056 2020-03-28 23:06:48.004848 RBA 360101922 GGSCI (msdidb01.deveci.local as GGS_ADMIN@MSDB1) 151> info R01MSD REPLICAT R01MSD Last Started 2020-03-30 19:04 Status RUNNING Checkpoint Lag 00:00:00 (updated 00:00:00 ago) Process ID 62363 Log Read Checkpoint File /goldengate/ggate18c/dirdat/i8000000056 2020-03-30 19:25:34.009110 RBA 471167456
Problem has been solved.
this doesnt give you any meaning full troubleshooting steps. have same type of issue that is the reason i was searching and found this my replicate has 30 plus tables and not throwing any error in the logs files. in the start of the checkpt growing situation we saw huge delete was going on kind of extract side . now only replicate lag 12 hour but how to find out problematic table or where is problem is not part of this article.. stats, stop nothing wokring for you then u mentioned index was unusable.how?