Some issues that have cropped up since gluster was installed.
Slow writes to disk. These come in two categories
The read privileges have been apparently lost for some files. This does not necessarily appear this way on "ls"
This issue seems to be 0byte meta-pointers left behind (see Gluster Community Chat Archive Logs, search on "sticky"). Logging onto each of mseas-data, nas-0-0 and nas-0-1 and deleting the files we run across (ls -lhat | grep "\-\-\-\-\-\-\-\-\-T" ) clears this up, but isn't a long term solution. For an example of this on our systems, see the gmeta files in /projects/philex/PE/2011/Jan09/arch75
ls -lh gmeta* -rw-rw-r-- 1 phaley philex 94M Feb 11 16:11 gmeta_ccnt_arch75 -rw-rw-r-- 1 phaley philex 16M Feb 11 16:17 gmeta_ccntW0_arch75 -rw-rw-r-- 1 phaley philex 9.3M Feb 11 16:15 gmeta_ccntW_arch75 -rw-rw-r-- 1 phaley philex 36M Feb 11 16:14 gmeta_dccnt_arch75 file gmeta* gmeta_ccnt_arch75: writable, regular file, no read permission gmeta_ccntW0_arch75: writable, regular file, no read permission gmeta_ccntW_arch75: writable, regular file, no read permission gmeta_dccnt_arch75: data
Inability to re-install compute nodes. Need to test this on an otherwise healthy node when Greg is around.
When nas-0-0 required rebooting, the 10GbE card came up as eth2 not eth3. This confused gluster. Why did this happen? Can it be prevented from happening again? Is nas-0-1 vunerable to the same thing?
Intermittent failure of SGE jobs to start on compute nodes. This seems to be on compute nodes that can't find my home area. Rebooting may help, may not remain. Need to see if reinstallation works.
PFJL has noticed that glusterfs averages around 10% cpu/memory on top on mseas. Is this normal? Note that 10% memory on mseas may translate to more on a compute node.
We probably should synchronize the clocks on nas-0-0, nas-0-1 to the same as serving mseas and mseas-data.
Some issues that seem to have been solved since gluster was installed.
issue seems to have been seltroubleshootd grabbing massive amounts of memory and putting mseas-data into swap