Gluster geo replication became faulty with OSError: [Errno 13] Permission denied

Posted on

Hi,

I am new to gluster & have a setup of master and slave volumes over the WAN. Each cluster has 3 nodes each (2 replica 1 arbiter).

For one the volumes geo replication status continuously goes into faulty state with below error. The worker thread becomes Active for few seconds but goes into faulty state again.

I did find a file with below mentioned gfid in the .glusterfs folder of the volume brick. I have checked file permissions/ownership and is same on both master and slave side.

Traceback (most recent call last):
File "/usr/libexec/glusterfs/python/syncdaemon/gsyncd.py", line 204, in main
main_i()
File "/usr/libexec/glusterfs/python/syncdaemon/gsyncd.py", line 780, in main_i
local.service_loop(*[r for r in [remote] if r])
File "/usr/libexec/glusterfs/python/syncdaemon/resource.py", line 1584, in service_loop
g1.crawlwrap(oneshot=True, register_time=register_time)
File "/usr/libexec/glusterfs/python/syncdaemon/master.py", line 570, in crawlwrap
self.crawl()
File "/usr/libexec/glusterfs/python/syncdaemon/master.py", line 1293, in crawl
self.process([item[1]], 0)
File "/usr/libexec/glusterfs/python/syncdaemon/master.py", line 1002, in process
self.process_change(change, done, retry)
File "/usr/libexec/glusterfs/python/syncdaemon/master.py", line 943, in process_change
failures = self.slave.server.entry_ops(entries)
File "/usr/libexec/glusterfs/python/syncdaemon/repce.py", line 226, in call
return self.ins(self.meth, *a)
File "/usr/libexec/glusterfs/python/syncdaemon/repce.py", line 208, in call
raise res
OSError: [Errno 13] Permission denied: '.gfid/7090f3b8-2405-42d2-9061-3f0d5424acda'