diff options
| author | Amar Tumballi <amarts@redhat.com> | 2018-10-16 16:31:49 +0530 | 
|---|---|---|
| committer | Amar Tumballi <amarts@redhat.com> | 2018-12-14 17:34:28 +0000 | 
| commit | d49b41e817d592c1904b6f01716df6546dad3ebe (patch) | |
| tree | 12546ad6da0534683cb7bed5b829500ecdfafe93 /glusterfsd/src | |
| parent | fc74ef85e0780e0a265275df00e4d0d4a2d05eab (diff) | |
fuse: add --lru-limit option
The inode LRU mechanism is moot in fuse xlator (ie. there is no
limit for the LRU list), as fuse inodes are referenced from
kernel context, and thus they can only be dropped on request of
the kernel. This might results in a high number of passive
inodes which are useless for the glusterfs client, causing a
significant memory overhead.
This change tries to remedy this by extending the LRU semantics
and allowing to set a finite limit on the fuse inode LRU.
A brief history of problem:
When gluster's inode table was designed, fuse didn't have any
'invalidate' method, which means, userspace application could
never ask kernel to send a 'forget()' fop, instead had to wait
for kernel to send it based on kernel's parameters. Inode table
remembers the number of times kernel has cached the inode based
on the 'nlookup' parameter. And 'nlookup' field is not used by
no other entry points (like server-protocol, gfapi etc).
Hence the inode_table of fuse module always has to have lru-limit
as '0', which means no limit. GlusterFS always had to keep all
inodes in memory as kernel would have had a reference to it.
Again, the reason for this is, kernel's glusterfs inode reference
was pointer of 'inode_t' structure in glusterfs. As it is a
pointer, we could never free it (to prevent segfault, or memory
corruption).
Solution:
In the inode table, handle the prune case of inodes with 'nlookup'
differently, and call a 'invalidator' method, which in this case is
fuse_invalidate(), and it sends the request to kernel for getting
the forget request.
When the kernel sends the forget, it means, it has dropped all
the reference to the inode, and it will send the forget with the
'nlookup' parameter too. We just need to make sure to reduce the
'nlookup' value we have when we get forget. That automatically
cause the relevant prune to happen.
Credits: Csaba Henk, Xavier Hernandez, Raghavendra Gowdappa, Nithya B
fixes: bz#1560969
Change-Id: Ifee0737b23b12b1426c224ec5b8f591f487d83a2
Signed-off-by: Amar Tumballi <amarts@redhat.com>
Diffstat (limited to 'glusterfsd/src')
| -rw-r--r-- | glusterfsd/src/glusterfsd.c | 24 | ||||
| -rw-r--r-- | glusterfsd/src/glusterfsd.h | 1 | 
2 files changed, 25 insertions, 0 deletions
| diff --git a/glusterfsd/src/glusterfsd.c b/glusterfsd/src/glusterfsd.c index 6347941f369..0dea52b6af1 100644 --- a/glusterfsd/src/glusterfsd.c +++ b/glusterfsd/src/glusterfsd.c @@ -219,6 +219,9 @@ static struct argp_option gf_options[] = {       "[default: 300]"},      {"resolve-gids", ARGP_RESOLVE_GIDS_KEY, 0, 0,       "Resolve all auxiliary groups in fuse translator (max 32 otherwise)"}, +    {"lru-limit", ARGP_FUSE_LRU_LIMIT_KEY, "N", 0, +     "Set fuse module's limit for number of inodes kept in LRU list to N " +     "[default: 0]"},      {"background-qlen", ARGP_FUSE_BACKGROUND_QLEN_KEY, "N", 0,       "Set fuse module's background queue length to N "       "[default: 64]"}, @@ -496,6 +499,15 @@ set_fuse_mount_options(glusterfs_ctx_t *ctx, dict_t *options)          }      } +    if (cmd_args->lru_limit >= 0) { +        ret = dict_set_int32(options, "lru-limit", cmd_args->lru_limit); +        if (ret < 0) { +            gf_msg("glusterfsd", GF_LOG_ERROR, 0, glusterfsd_msg_4, +                   "lru-limit"); +            goto err; +        } +    } +      if (cmd_args->background_qlen) {          ret = dict_set_int32(options, "background-qlen",                               cmd_args->background_qlen); @@ -1257,6 +1269,13 @@ parse_opts(int key, char *arg, struct argp_state *state)              cmd_args->resolve_gids = 1;              break; +        case ARGP_FUSE_LRU_LIMIT_KEY: +            if (!gf_string2int32(arg, &cmd_args->lru_limit)) +                break; + +            argp_failure(state, -1, 0, "unknown LRU limit option %s", arg); +            break; +          case ARGP_FUSE_BACKGROUND_QLEN_KEY:              if (!gf_string2int(arg, &cmd_args->background_qlen))                  break; @@ -2085,6 +2104,11 @@ parse_cmdline(int argc, char *argv[], glusterfs_ctx_t *ctx)          ctx->ssl_cert_depth = glusterfs_read_secure_access_file();      } +    /* Need to set lru_limit to below 0 to indicate there was nothing +       specified. This is needed as 0 is a valid option, and may not be +       default value. */ +    cmd_args->lru_limit = -1; +      argp_parse(&argp, argc, argv, ARGP_IN_ORDER, NULL, cmd_args);      if (cmd_args->print_xlatordir || cmd_args->print_statedumpdir || diff --git a/glusterfsd/src/glusterfsd.h b/glusterfsd/src/glusterfsd.h index 0042054e138..86ac61c1a92 100644 --- a/glusterfsd/src/glusterfsd.h +++ b/glusterfsd/src/glusterfsd.h @@ -109,6 +109,7 @@ enum argp_option_keys {      ARGP_ATTR_TIMES_GRANULARITY_KEY = 187,      ARGP_PRINT_LIBEXECDIR_KEY = 188,      ARGP_FUSE_FLUSH_HANDLE_INTERRUPT_KEY = 189, +    ARGP_FUSE_LRU_LIMIT_KEY = 190,  };  struct _gfd_vol_top_priv { | 
