Windows Active DirectoryWindows Active Directory

I was doing some maintenance work on one of the many different environments I help maintain today when I noticed that Active Directory replication failed to one of the Domain Controllers in the domain.

When performing a repadmin /showrepl I saw the following:

C:\WINDOWS\system32>repadmin /replsummary
Replication Summary Start Time: 2023-06-28 09:41:07

Beginning data collection for replication summary, this may take awhile:
  .....


Source DSA          largest delta    fails/total %%   error
 ServerDC01         >60 days            5 /   5  100  (2148074274) The target principal name is incorrect.
 ServerDC02                  02m:15s    0 /   5    0


Destination DSA     largest delta    fails/total %%   error
 ServerDC01                  02m:15s    0 /   5    0
 ServerDC02         >60 days            5 /   5  100  (2148074274) The target principal name is incorrect.

I then proceeded to RDP into the server ServerDC02 to examine the Windows Event Log. From experience, I suspected finding traces of what was going on in the log Applications and Service Logs -> Directory Service.

Here I could see several errors pertaining to the KDC (Knowledge Consistency Checker), both warnings and errors. Event IDs logged were 1308, 1311, 1566 and 1865. Taking a closer look, the useful information needed was found in Event ID 1311:

The Knowledge Consistency Checker (KCC) has detected problems with the following directory partition. 
 
Directory partition:
CN=Configuration,DC=contoso,DC=com
 
There is insufficient site connectivity information for the KCC to create a spanning tree replication topology. Or, one or more directory servers with this directory partition are unable to replicate the directory partition information. This is probably due to inaccessible directory servers. 

Having worked with Windows Servers and Active Directory for quite some time, I quickly suspected that replication had failed for quite some time, causing the inconsistency. I had also seen similar issues quite a few years ago, so I decided to force a new replication from the Domain Controller having the FSMO role (Flexible Single Master Operation) in the domain to the Domain Controller that was having issues.

This requires you to stop the KDC service on the failing Domain Controller and reset its Machine Account Password.

To do this, I had to perform the following actions:

  • Stop and disable the KDC service on the Domain Controller having issues, ServerDC02
    • Stop-Service KDC
    • Set-Service KDC -StartupType Disabled
    • Restart-Computer
  • Once the server came back online, RDP in and open CMD with Administrative Privileges
  • Issue the command netdom resetpwd /s:ServerDC01.contoso.com /ud:contoso\adDomainAdminAccount /pd:*
    • Use the server address to the working domain controller for /s:
    • Use a valid Domain Admin account for /ud:
    • Using * for /pd: will ensure CMD prompts for the account password
  • The command should return The machine account password for the local machine has been successfully reset.
  • Go to the working Domain Controller and open CMD with administrative privileges
  • Issue the command repadmin /replicate <destination_dc> <source_dc> "DC=contoso,DC=com"
    • Replace <destination_dc> with the full address for the faulty Domain Controller, in this case ServerDC02.contoso.com
    • Replace <source_dc> with the full address for the working Domain Controller, in this case ServerDC01.contoso.com
  • The command should return Sync from ServerDC01.contoso.com to ServerDC02.contoso.com completed successfully
  • Head back to ServerDC02
  • Start the KDC service
    • Set-Service KDC -StartupType Auto
    • Start-Service KDC
  • Wait until replication starts and completes

Replication can take a while to complete, depending on both the number of objects found in your Active Directory, as well as the latency between the two Domain Controllers.

Once completed, you should see the following from repadmin.

C:\WINDOWS\system32>repadmin /replsummary
Replication Summary Start Time: 2023-06-28 09:56:59

Beginning data collection for replication summary, this may take awhile:
  .....


Source DSA          largest delta    fails/total %%   error
 ServerDC01                  10m:41s    0 /   5    0
 ServerDC02                  03m:07s    0 /   5    0


Destination DSA     largest delta    fails/total %%   error
 ServerDC01                  03m:07s    0 /   5    0
 ServerDC02                  10m:41s    0 /   5    0

Hope you found this guide useful.

By Jostein Elvaker Haande

"A free society is a society where it is safe to be unpopular" - Adlai Stevenson

Leave a Reply

Your email address will not be published. Required fields are marked *