diff options
| author | Xavier Hernandez <xhernandez@datalab.es> | 2014-11-07 12:12:19 +0100 | 
|---|---|---|
| committer | Vijay Bellur <vbellur@redhat.com> | 2014-12-04 11:34:38 -0800 | 
| commit | bc91dd4de39ffd481a52b837f322f6782c14e9f1 (patch) | |
| tree | a7a550c659bdee92b5d29839f87f5473d0b367bc /xlators/cluster/ec/src/ec.c | |
| parent | 7319b01ffa3d4ff7b1405873c8caeaf8a1f7b5d6 (diff) | |
ec: Fix self-healing issues.
Three problems have been detected:
1. Self healing is executed in background, allowing the fop that
   detected the problem to continue without blocks nor delays.
   While this is quite interesting to avoid unnecessary delays,
   it can cause spurious failures of self-heal because it may
   try to recover a file inside a directory that a previous
   self-heal has not recovered yet, causing the file self-heal
   to fail.
2. When a partial self-heal is being executed on a directory,
   if a full self-heal is attempted, it won't be executed
   because another self-heal is already in process, so the
   directory won't be fully repaired.
3. Information contained in loc's of some fop's is not enough
   to do a complete self-heal.
To solve these problems, I've made some changes:
* Improved ec_loc_from_loc() to add all available information
  to a loc.
* Before healing an entry, it's parent is checked and partially
  healed if necessary to avoid failures.
* All heal requests received for the same inode while another
  self-heal is being processed are queued. When the first heal
  completes, all pending requests are answered using the results
  of the first heal (without full execution), unless the first
  heal was a partial heal. In this case all partial heals are
  answered, and the first full heal is processed normally.
* An special virtual xattr (not physically stored on bricks)
  named 'trusted.ec.heal' has been created to allow synchronous
  self-heal of files.
  Now, the recommended way to heal an entire volume is this:
    find <mount> -d -exec getfattr -h -n trusted.ec.heal {} \;
Some minor changes:
* ec_loc_prepare() has been renamed to ec_loc_update().
* All loc management functions return 0 on success and -1 on
  error.
* Do not delay fop unlocks if heal is needed.
* Added basic ec xattrs initially on create, mkdir and mknod
  fops.
* Some coding style changes
Change-Id: I2a5fd9c57349a153710880d6ac4b1fa0c1475985
BUG: 1161588
Signed-off-by: Xavier Hernandez <xhernandez@datalab.es>
Reviewed-on: http://review.gluster.org/9072
Tested-by: Gluster Build System <jenkins@build.gluster.com>
Reviewed-by: Dan Lambright <dlambrig@redhat.com>
Diffstat (limited to 'xlators/cluster/ec/src/ec.c')
0 files changed, 0 insertions, 0 deletions
