Re: Message looks serious?
Colin Ngam
Greetings,
commit 8200a7fb403e091b51b4b00c1aec57dafefb1ada
Thanks.
Colin
From: <daos@daos.groups.io> on behalf of "Wang, Di" <di.wang@...>
Hello,
Thanks. This does show the connection failed with –svc 40. I am not sure why it does not output any failure messages. But I do see others also complained about zero failure message. Which version are you using 0.9 or master?
Btw: these server log might be seen during pool creation as well. It is a known issue, and we will fix it. Thanks.
"05/13-11:57:11.02 delphi-006 DAOS[26509/26552] pool WARN src/pool/srv_target.c:1020 ds_pool_tgt_map_update() Ignore update pool a68b3845 1 -> 1
Thanks Wangdi From: <daos@daos.groups.io> on behalf of Colin Ngam <colin.ngam@...>
Hi WangDi,
Is this what you need:
05/14-09:53:00.86 hl-d106 DAOS[20928/20928] fi INFO src/cart/src/gurt/fault_inject.c:486 d_fault_inject_init() No config file, fault injection is OFF. 05/14-09:53:00.86 hl-d106 DAOS[20928/20928] crt INFO src/cart/src/cart/crt_init.c:282 crt_init_opt() libcart version 4.7.0 initializing 05/14-09:53:00.86 hl-d106 DAOS[20928/20928] crt WARN src/cart/src/cart/crt_init.c:174 data_init() FI_UNIVERSE_SIZE was not set; setting to 2048 05/14-09:53:00.86 hl-d106 DAOS[20928/20928] crt WARN src/cart/src/cart/crt_init.c:393 crt_init_opt() FI_OFI_RXM_USE_SRX not set, set=1 05/14-09:53:01.23 hl-d106 DAOS[20928/20928] client INFO src/utils/daos.c:142 cmd_args_print() DAOS system name: daos_server 05/14-09:53:01.23 hl-d106 DAOS[20928/20928] client INFO src/utils/daos.c:143 cmd_args_print() pool UUID: 6eb32fb0-49e9-49fd-96e8-bba14728a8c3 05/14-09:53:01.23 hl-d106 DAOS[20928/20928] client INFO src/utils/daos.c:144 cmd_args_print() cont UUID: 00000000-0000-0000-0000-000000000000 05/14-09:53:01.23 hl-d106 DAOS[20928/20928] client INFO src/utils/daos.c:148 cmd_args_print() pool svc: parsed 1 ranks from input 1 05/14-09:53:01.23 hl-d106 DAOS[20928/20928] client INFO src/utils/daos.c:152 cmd_args_print() attr: name=NULL, value=NULL 05/14-09:53:01.23 hl-d106 DAOS[20928/20928] client INFO src/utils/daos.c:156 cmd_args_print() path=NULL, type=unknown, oclass=UNKNOWN, chunk_size=0 05/14-09:53:01.23 hl-d106 DAOS[20928/20928] client INFO src/utils/daos.c:162 cmd_args_print() snapshot: name=NULL, epoch=0, epoch range=NULL (0-0) 05/14-09:53:01.23 hl-d106 DAOS[20928/20928] client INFO src/utils/daos.c:163 cmd_args_print() oid: 0.0 05/14-09:53:01.31 hl-d106 DAOS[20928/20928] daos INFO src/common/drpc.c:664 drpc_close() Closing dRPC socket fd=19 05/14-09:53:01.35 hl-d106 DAOS[20928/20928] daos INFO src/common/drpc.c:664 drpc_close() Closing dRPC socket fd=19 05/14-09:53:01.40 hl-d106 DAOS[20928/20928] common ERR src/common/rsvc.c:142 rsvc_client_process_error() removed rank 1 from replica list due to DER_NOTREPLICA(-2020) 05/14-09:53:01.40 hl-d106 DAOS[20928/20928] common WARN src/common/rsvc.c:102 rsvc_client_choose() replica list empty 05/14-09:53:01.40 hl-d106 DAOS[20928/20928] pool ERR src/pool/cli.c:539 dc_pool_connect() 6eb32fb0: cannot find pool service: DER_NOTREPLICA(-2020) failed to connect to pool: -1005
This comes from the daos.log on the Client.
Thanks.
Colin
From: <daos@daos.groups.io> on behalf of "Wang, Di" <di.wang@...>
If 40 does not exist, it should not be able to connect to the pool at all, I.e. it should output sth like "failed to connect to pool: …”.
These logs seems suggesting the pool connection did happen. Would you please collect the client side daos log? (by "export D_LOG_FILE=xxx “? It might tell us what really happened. Thanks.
Thanks WangDi From: <daos@daos.groups.io> on behalf of Colin Ngam <cngam@...>
Greetings,
Executing the command: daos pool list-cont --pool a68b3845-fe78-481e-aa84-164e851d5f52 --svc 40
Note that 40 does not exist.
We did not get an error from the daos command.
In the log:
05/13-11:57:11.02 delphi-006 DAOS[26509/26552] pool WARN src/pool/srv_target.c:1020 ds_pool_tgt_map_update() Ignore update pool a68b3845 1 -> 1 05/13-11:57:11.02 delphi-006 DAOS[26509/26552] pool WARN src/pool/srv_target.c:1020 ds_pool_tgt_map_update() Ignore update pool a68b3845 1 -> 1 05/13-11:57:11.02 delphi-006 DAOS[26509/26552] pool WARN src/pool/srv_target.c:1020 ds_pool_tgt_map_update() Ignore update pool a68b3845 1 -> 1
My guess is that ds_pool_tgt_map_update() should not even be called?
Cheers,
Colin
|
|