by Berrys66 » Mon Jul 19, 2004 9:51 am
I get something very similar. My AC2K cluster worked fine until I installed the security patch from Microsoft issued in mid April. After installing the patch my cluster member's event logs would be completely filled with DCOM errors within a day. After a while the 2nd node in the cluster would stop responding and the only solution would be a reboot.
so, I uninstalled the patch, and low and behold, the event logs are now no longer completely full after a day or so. Now what is happening is that the cluster will run quite happily for 2 days, and then for some reason the 2nd node loses the plot and becomes unreachable. At that point I obviously start getting the 10009 DCOM errors. Rebooting the 2nd node sometimes will bring it back into synch, but more often than not I need to restart both machines in the cluster.
Anybody else seen this behaviour? And even better - does anybody know how to fix it, short of re-installing the operating system and AC2K.
I am using AC2K SP2 on W2K SP4.