POSIX container create do not wok


timehuang88@...
 

HI Guys:
As shown below, when I try to create container with POSIX type, the admin terminal blocked there(top screenshot). another terminal lauched the DAOS cluster with clush cmd just showed that creating  and closing log info(bottom screenshot).
I also tried to create container with HDF5 type, it do work well. 
I don`t known what`s going on there and why  POSIX type container failed to be created.
need help!







Nabarro, Tom
 

You probably need the server log from the access point rank (/tmp/server0.log default setting in server config) to help debug the issue.

 

Tom

 

From: daos@daos.groups.io <daos@daos.groups.io> On Behalf Of timehuang88@...
Sent: Tuesday, May 19, 2020 10:48 AM
To: daos@daos.groups.io
Subject: [daos] POSIX container create do not wok

 

HI Guys:
As shown below, when I try to create container with POSIX type, the admin terminal blocked there(top screenshot). another terminal lauched the DAOS cluster with clush cmd just showed that creating  and closing log info(bottom screenshot).
I also tried to create container with HDF5 type, it do work well. 
I don`t known what`s going on there and why  POSIX type container failed to be created.
need help!






---------------------------------------------------------------------
Intel Corporation (UK) Limited
Registered No. 1134945 (England)
Registered Office: Pipers Way, Swindon SN3 1RJ
VAT No: 860 2173 47

This e-mail and any attachments may contain confidential material for
the sole use of the intended recipient(s). Any review or distribution
by others is strictly prohibited. If you are not the intended
recipient, please contact the sender and delete all copies.


Colin Ngam
 

Greetings,

 

Does it work with –svc 1?

 

Thanks.

 

Colin

 

From: <daos@daos.groups.io> on behalf of "Nabarro, Tom" <tom.nabarro@...>
Reply-To: "daos@daos.groups.io" <daos@daos.groups.io>
Date: Tuesday, May 19, 2020 at 5:01 AM
To: "daos@daos.groups.io" <daos@daos.groups.io>
Subject: Re: [daos] POSIX container create do not wok

 

You probably need the server log from the access point rank (/tmp/server0.log default setting in server config) to help debug the issue.

 

Tom

 

From: daos@daos.groups.io <daos@daos.groups.io> On Behalf Of timehuang88@...
Sent: Tuesday, May 19, 2020 10:48 AM
To: daos@daos.groups.io
Subject: [daos] POSIX container create do not wok

 

HI Guys:
As shown below, when I try to create container with POSIX type, the admin terminal blocked there(top screenshot). another terminal lauched the DAOS cluster with clush cmd just showed that creating  and closing log info(bottom screenshot).
I also tried to create container with HDF5 type, it do work well. 
I don`t known what`s going on there and why  POSIX type container failed to be created.
need help!






---------------------------------------------------------------------
Intel Corporation (UK) Limited
Registered No. 1134945 (England)
Registered Office: Pipers Way, Swindon SN3 1RJ
VAT No: 860 2173 47

This e-mail and any attachments may contain confidential material for
the sole use of the intended recipient(s). Any review or distribution
by others is strictly prohibited. If you are not the intended
recipient, please contact the sender and delete all copies.


timehuang88@...
 

conent of /tmp/server0.log shown below:

05/20-11:02:04.67 snode2 DAOS[7483/7492] grp  ERR  src/cart/src/cart/crt_group.c:3177 crt_group_secondary_rank_add_internal() rank 0 is not part of associated primary group daos_server
05/20-11:02:04.67 snode2 DAOS[7483/7492] grp  ERR  src/cart/src/cart/crt_group.c:3645 crt_group_secondary_modify() Failure when adding rank 0, rc=-1019
05/20-11:02:04.67 snode2 DAOS[7483/7492] pool ERR  src/pool/srv_target.c:958 ds_pool_tgt_map_update() e8337c30: Can not update pool group: DER_OOG(-1019)
05/20-11:02:04.67 snode2 DAOS[7483/7492] rsvc ERR  src/rsvc/srv.c:475 rsvc_step_up_cb() e8337c30: failed to step up as leader 0: DER_OOG(-1019)
05/20-11:02:05.11 snode2 DAOS[7483/7492] grp  ERR  src/cart/src/cart/crt_group.c:3177 crt_group_secondary_rank_add_internal() rank 0 is not part of associated primary group daos_server
05/20-11:02:05.11 snode2 DAOS[7483/7492] grp  ERR  src/cart/src/cart/crt_group.c:3645 crt_group_secondary_modify() Failure when adding rank 0, rc=-1019
05/20-11:02:05.11 snode2 DAOS[7483/7492] pool ERR  src/pool/srv_target.c:958 ds_pool_tgt_map_update() e8337c30: Can not update pool group: DER_OOG(-1019)
05/20-11:02:05.11 snode2 DAOS[7483/7492] rsvc ERR  src/rsvc/srv.c:475 rsvc_step_up_cb() e8337c30: failed to step up as leader 0: DER_OOG(-1019)
05/20-11:02:05.11 snode2 DAOS[7483/7492] swim ERR  src/cart/src/swim/swim.c:743 swim_progress() SWIM shutdown
05/20-11:23:25.96 snode2 DAOS[9933/9942] swim ERR  src/cart/src/swim/swim.c:743 swim_progress() SWIM shutdown
05/20-15:05:57.02 snode2 DAOS[9933/9944] server EMRG src/iosrv/srv.c:544 dss_srv_handler() Assertion failed d_list_empty(&dx->dx_sleep_ult_list)
05/20-15:06:08.84 snode2 DAOS[10062/10071] grp  ERR  src/cart/src/cart/crt_group.c:3177 crt_group_secondary_rank_add_internal() rank 0 is not part of associated primary group daos_server
05/20-15:06:08.84 snode2 DAOS[10062/10071] grp  ERR  src/cart/src/cart/crt_group.c:3645 crt_group_secondary_modify() Failure when adding rank 0, rc=-1019
05/20-15:06:08.84 snode2 DAOS[10062/10071] pool ERR  src/pool/srv_target.c:958 ds_pool_tgt_map_update() 9e4f5899: Can not update pool group: DER_OOG(-1019)
05/20-15:06:08.84 snode2 DAOS[10062/10071] rsvc ERR  src/rsvc/srv.c:475 rsvc_step_up_cb() 9e4f5899: failed to step up as leader 0: DER_OOG(-1019)
05/20-15:06:08.84 snode2 DAOS[10062/10071] swim ERR  src/cart/src/swim/swim.c:743 swim_progress() SWIM shutdown
 
Maybe the swim_progress() SWIM shutdown give me some tips, maybe not. 
Another question, in the storage cluster, I have 3 nodes(snode1, snode2, snode3), and I set up snode1 as access point inside all the 3 nodes daos_server.yml file, when DAOS system start up, snode1 runs rank0, snode2 runs rank1, snode3 runs rank2.  Since snode1 is the access point, when I try to create container, should the --svc option be set as 0? Is that right?


timehuang88@...
 
Edited

Hi Colin, thx for your suggestion.
It does not work with --svc 1, actually I tried this cmd with 0,1,2 since the cluster has rank0-2. None of them work

 

 


timehuang88@...
 
Edited



before creating POSIX type container, should I run something else in daos server? or I missed something important?