1: Ha sync failing between netscaler device, when doing force sync we get error:
"command failed on secondary node but suceeded on primary node"
2: exec: show HA node
1) Node ID: 0
IP: 10.10.10.12
Node State: UP
Master State: Primary
Fail-Safe Mode: OFF
INC State: DISABLED
Sync State: ENABLED
<snip>
2) Node ID: 1
IP: 10.10.10.51
Node State: UP
Master State: Secondary
Fail-Safe Mode: OFF
INC State: DISABLED
Sync State: FAILED <<<<<<<<<<<<<look here
<snip>
3: On comparing the primary and secondary, we see the huge difference in between the 2 nodes:
Secondary:
more ../../shell/ls_lRtrp.out
total 17079
drwxr-xr-x 2 root wheel 512B Aug 25 2018 mnt/
drwxr-xr-x 2 root wheel 512B Aug 25 2018 home/
drwxrwxr-x 2 root 5 512B Aug 25 2018 .snap/
drwxr-xr-x 2 root wheel 1.0k Aug 25 2018 bin/
drwxr-xr-x 2 root wheel 2.0k Aug 25 2018 sbin/
drwxr-xr-x 2 root wheel 512B Aug 25 2018 libexec/
drwxr-xr-x 3 root wheel 2.0k Aug 25 2018 lib/
drwxr-xr-x 3 root wheel 512B Aug 25 2018 compat/
drwxr-xr-x 10 root wheel 512B Aug 25 2018 usr/
drwxr-xr-x 14 root wheel 5.0k Aug 25 2018 netscaler/
drwxr-xr-x 38 root wheel 1.0k Oct 4 2018 var/
dr-xr-xr-x 7 root wheel 512B Jul 27 22:05 dev/
lrwxr-xr-x 1 root wheel 16B Jul 27 22:05 nsconfig -> /flash/nsconfig/
drwxr-xr-x 260 root wheel 3.5k Jul 27 22:05 nscache/
lrwxr-xr-x 1 root wheel 33B Jul 27 22:05 optional -> /netscaler/portal/themes/optional
lrwxr-xr-x 1 root wheel 15B Jul 27 22:05 configdb -> /flash/configdb
lrwxr-xr-x 1 root wheel 33B Jul 27 22:05 colorful -> /netscaler/portal/themes/colorful
drwxr-xr-x 4 root wheel 512B Aug 8 01:44 root/
drwxr-xr-x 6 root wheel 512B Aug 12 11:40 flash/
drwxr-xr-x 9 root wheel 1.5k Aug 13 06:19 etc/
-rw-r--r-- 1 root wheel 16M Aug 13 06:32 (null) <<<<<<<<< 16M here seems to be our problem / the difference between the two nodes.
drwxrwxrwt 3 root wheel 1.5k Aug 13 06:32 tmp/
dr-xr-xr-x 1 root wheel 0B Aug 13 06:32 proc/
Primary:
/shell/ls_lRtrp.out
total 41
drwxr-xr-x 2 root wheel 512B Aug 25 2018 mnt/
drwxr-xr-x 2 root wheel 512B Aug 25 2018 home/
drwxrwxr-x 2 root 5 512B Aug 25 2018 .snap/
drwxr-xr-x 2 root wheel 1.0k Aug 25 2018 bin/
drwxr-xr-x 2 root wheel 2.0k Aug 25 2018 sbin/
drwxr-xr-x 2 root wheel 512B Aug 25 2018 libexec/
drwxr-xr-x 3 root wheel 2.0k Aug 25 2018 lib/
drwxr-xr-x 3 root wheel 512B Aug 25 2018 compat/
drwxr-xr-x 10 root wheel 512B Aug 25 2018 usr/
drwxr-xr-x 14 root wheel 5.0k Aug 25 2018 netscaler/
drwxr-xr-x 38 root wheel 1.0k Oct 4 2018 var/
dr-xr-xr-x 7 root wheel 512B Jul 27 22:12 dev/
lrwxr-xr-x 1 root wheel 16B Jul 27 22:12 nsconfig -> /flash/nsconfig/
drwxr-xr-x 260 root wheel 3.5k Jul 27 22:12 nscache/
lrwxr-xr-x 1 root wheel 33B Jul 27 22:12 optional -> /netscaler/portal/themes/optional
lrwxr-xr-x 1 root wheel 15B Jul 27 22:12 configdb -> /flash/configdb
lrwxr-xr-x 1 root wheel 33B Jul 27 22:12 colorful -> /netscaler/portal/themes/colorful
-rw-r--r-- 1 root wheel 2k Jul 27 22:13 (null) <<<<<<<<<<<<<look here
drwxr-xr-x 4 root wheel 512B Aug 8 01:45 root/
drwxr-xr-x 6 root wheel 512B Aug 12 11:40 flash/
drwxr-xr-x 9 root wheel 1.5k Aug 13 06:30 etc/
drwxrwxrwt 3 root wheel 1.5k Aug 13 06:34 tmp/
dr-xr-xr-x 1 root wheel 0B Aug 13 06:34 proc/
4: We see huge number of following logs in ‘messages’ file:
Aug 13 06:32:40 <local0.alert> 10.10.140.51 08/13/2019:06:32:40 GMT VMP01-HS 0-PPE-0 : default EVENT STATECHANGE 202 0 : Device "self node 10.10.140.51" - State "SYNC Failure - Save remote config failed"
Aug 13 06:32:41 <local0.err> VMP01-HS nssync: NSAPI_AUTOSYNC_FILES issued failed: Synchronization failed, please try again
Aug 13 06:32:45 <kern.err> VMP01-HS kernel: pid 96836 (rsync), uid 0 inumber 2517 on /: filesystem full
Aug 13 06:32:46 <kern.err> VMP01-HS kernel: pid 96849 (nscli), uid 0 inumber 42200 on /: filesystem full
So rsync was failing, causing ramdisk usage increasing with this (null) file
Checking the auth.log on primary, indicated nsroot unable to login.
Logs indicate sync to be failing due to a disk usage issue on the ramdisk md0. Internal nsuser is not able to login causing failure. This is causing the sync failures on the secondary appliance.