Reason for outage : The controller card and cross-connect card were hang by unknown reason at P.O.Bangsue#2 exchange (middle exchange). And the E1 trunk card was failed by unknown reason at P.O.Bangsue#1 exchange (middle exchange.)
Recovery Action : Replaced new controller card , new cross-connect card and new E1 trunk card.
Timeline :
13/01/2016
04:15TH : Jastel team informed CID546961 was unstable symptom.
04:57TH : NMC UIH found alarm on trunk P.O.Bangsue#2 - Cement_Bangsue exchange and it was clear by itself on 04:57TH.
05:48TH : We asked Jastel team for permission to reroute customer to other path. We had waiting permission to do it.
10:06TH : Jastel team allowed us to reroute to other path on 11:00TH
10:12TH : After NMC team had investigated,they found this circuit could not reroute at that moment because the new path hadn't enough time slot of TDM equipment (2 Mbps used 32 time slots per node.The nodes on other path already have other customer inside all node) to move it.
10:50TH : Our FE went to P.O.Bangsue#2 exchange and found the controller card did not work as properly then waited for other team to arrived with new controller card.
11:30TH : Jastel team informed us the link was stable and kept monitoring it.
11:43TH : Jastel team informed us the link was down again and let us do any action to resolve it.
12:20TH : The support team of our FE arrived to P.O.Bangsue#2 exchange with new controller card.
12:25TH : Our FE replaced new controller card and we asked Jastel team for the current status which customer detected.
12:30TH : Jastel team informed us the link was stable and kept monitoring it.
14:02TH : NMC UIH found cross-connect card was hang with unknown reason at P.O.Bangsue#2 then the support team of our FE went to take the new cross-connect card.
14:10TH : Jastel team informed CID546961 was unstable symptom.
14:45TH : The support team arrived to node with new cross-connect card.
14:50TH : Our FE replaced new cross-connect card at P.O.Bangsue#2 exchange.
15:55TH : P.O.Bangsue#2 exchange had problem again then NMC UIH decided to create new route for this circuit.
16:40TH : NMC UIH created new route for this circuit as temporary fix and kept monitoring it (But this route was long distance then it will make the unstable symptom)
16:45TH : NMC UIH prepared the available trunk on short distance route to move this circuit from long distance route to short distance route as permanent fix.
17:10TH : Jastel team informed us the link was unstable symptom.
17:55TH : Jastel team informed us the link was stable and kept monitoring it (Alarm clear by itself).
18:36TH : We found alarm on trunk of long distance route then we tried to clear the available time slots to reroute this circuit to originate path
19:45TH : After NMC UIH had prepared to find the available time slots,NMC UIH did reroute to the originate path.
20:00TH : NMC UIH found the alarm on trunk of originate path then tried to find the optimize path for reroute path.
20:25TH : NMC UIH reroute this circuit to optimize path (the shortest one)
14/01/2016
09:57TH : Jastel team informed us the link was unstable symptom.
10:08TH : NMC UIH rerouted to other path.
10:20TH : NMC UIH found E1 trunk card was failed by unknown reason and dispatched FE to go onsite.
11:19TH : Our FE arrived with new E1 trunk card but still waiting the support team bring module equipment which used with new E1 trunk card.
12:18TH : Our FE replaced new E1 trunk card and NMC UIH asked Jastel for permission to reroute this circuit to originate path.
12:20TH : Jastel team informed us the link was stable and kept monitoring it.
16:32TH : NMC UIH found the error alarm on trunk (it will make this circuit be critical down) of this path then NMC UIH prepared to reroute this circuit to originate path.
16:35TH : NMC UIH rerouted to originate path and keep monitoring this circuit closely.