Hi,
I have a problem with my 2-node exchange 2010 SP3 RU 2 DAG
i think it is set up correctly and the script written by Paul (exchangeserverpro.com - GetDAGHealth - http://exchangeserverpro.com/get-daghealth-ps1-database-availability-group-health-check-script/ ) and other scripts and commands show it all OK (just quorum
group is failed which I studied a lot about it and seems it is not a problem) all others show healthy
when I myself restart a mailbox server (MB1 for example) all mailbox copies are activated on the other one and no problem is reported
but here is my sad story
these are all virtual machines in ESXi 5.0 infrastructure
the other day, one of storage servers went away :( and so the mailbox1 machine was completely out of play. assume that you suddenly shutdown a mailbox server
I expected the DAG to help me but it did not
just he databases which were active on the MB2 were up but all the others down. unmounted or in unknown state
i tried a lot to make the mailbox2 host the databases which were active on the failed machine
but it said failed failed and failed ..
then i went to shell and tested everything
once it said you need to use -skipactivecopychecks once said -skiplags once said need to update index and ..
but none of them worked and this was all with some similar error that said : the service ... is not started or cannot be found in mailbox1 server... Oh come on man ! i know this ! the mailbox is totally vanished and gone and i expect you to take over and
make all databases activated on yourself but .. no chance :(
so i want to know is anything wrong here ?
is the quorum group failed important ?
how on earth i should recover from such a situation in which the other mailbox is totally destroyed and no ping is available, no service on it and nothing, nothing ...
and if it can be done automatically, so much better and if cannot be, how should i do it manually when i tried all the ways, switches, EMC, EMS and all said the service .. cannot be found, accessed , .. on the other mailbox server (the failed one which hosted
the active copy of some data bases) ..
Thanks so much
and by the way this is the error (as u see, it is always looking for the service and copy status on the destroyed mailbox and as i said, skipactivecopychecks and .. goes to skiplags, update and index and always again error about not able to contact mailbox1)
Summary: 1 item(s). 0 succeeded, 1 failed.
Elapsed time: 00:00:10
Move-ActiveMailboxDatabase
Failed
Error:
Error getting mailbox database copy status from server "mailbox1". You can use -SkipActiveCopyChecks to skip this validation check. Error A server-side administrative operation has failed. The Microsoft Exchange Replication service may not
be running on server MAILBOX1.gbgnetwork.net. Specific RPC error message: Error 0x71a (The remote procedure call was cancelled) from cli_GetCopyStatusEx2.
Exchange Management Shell command attempted:
Move-ActiveMailboxDatabase -Identity 'UMFUsersDatabase' -ActivateOnServer 'MAILBOX2' -MountDialOverride 'None'
Elapsed Time: 00:00:10