Re: Message looks serious?


Wang, Di
 

Hello, 

Thanks.  This does show the connection failed with –svc 40.  I am not sure why it does not output any failure messages. But I do see others also complained about zero failure message.  Which version are you using 0.9 or master?

Btw: these server log might be seen during pool creation as well.  It is a known issue, and we will fix it. Thanks.

"05/13-11:57:11.02 delphi-006 DAOS[26509/26552] pool WARN src/pool/srv_target.c:1020 ds_pool_tgt_map_update() Ignore update pool a68b3845 1 -> 1


Thanks

Wangdi

From: <daos@daos.groups.io> on behalf of Colin Ngam <colin.ngam@...>
Reply-To: "daos@daos.groups.io" <daos@daos.groups.io>
Date: Thursday, May 14, 2020 at 8:06 AM
To: "daos@daos.groups.io" <daos@daos.groups.io>
Subject: Re: [daos] Message looks serious?

Hi WangDi,

 

Is this what you need:

 

05/14-09:53:00.86 hl-d106 DAOS[20928/20928] fi   INFO src/cart/src/gurt/fault_inject.c:486 d_fault_inject_init() No config file, fault injection is OFF.

05/14-09:53:00.86 hl-d106 DAOS[20928/20928] crt  INFO src/cart/src/cart/crt_init.c:282 crt_init_opt() libcart version 4.7.0 initializing

05/14-09:53:00.86 hl-d106 DAOS[20928/20928] crt  WARN src/cart/src/cart/crt_init.c:174 data_init() FI_UNIVERSE_SIZE was not set; setting to 2048

05/14-09:53:00.86 hl-d106 DAOS[20928/20928] crt  WARN src/cart/src/cart/crt_init.c:393 crt_init_opt() FI_OFI_RXM_USE_SRX not set, set=1

05/14-09:53:01.23 hl-d106 DAOS[20928/20928] client INFO src/utils/daos.c:142 cmd_args_print()   DAOS system name: daos_server

05/14-09:53:01.23 hl-d106 DAOS[20928/20928] client INFO src/utils/daos.c:143 cmd_args_print()   pool UUID: 6eb32fb0-49e9-49fd-96e8-bba14728a8c3

05/14-09:53:01.23 hl-d106 DAOS[20928/20928] client INFO src/utils/daos.c:144 cmd_args_print()   cont UUID: 00000000-0000-0000-0000-000000000000

05/14-09:53:01.23 hl-d106 DAOS[20928/20928] client INFO src/utils/daos.c:148 cmd_args_print()   pool svc: parsed 1 ranks from input 1

05/14-09:53:01.23 hl-d106 DAOS[20928/20928] client INFO src/utils/daos.c:152 cmd_args_print()   attr: name=NULL, value=NULL

05/14-09:53:01.23 hl-d106 DAOS[20928/20928] client INFO src/utils/daos.c:156 cmd_args_print()   path=NULL, type=unknown, oclass=UNKNOWN, chunk_size=0

05/14-09:53:01.23 hl-d106 DAOS[20928/20928] client INFO src/utils/daos.c:162 cmd_args_print()   snapshot: name=NULL, epoch=0, epoch range=NULL (0-0)

05/14-09:53:01.23 hl-d106 DAOS[20928/20928] client INFO src/utils/daos.c:163 cmd_args_print()   oid: 0.0

05/14-09:53:01.31 hl-d106 DAOS[20928/20928] daos INFO src/common/drpc.c:664 drpc_close() Closing dRPC socket fd=19

05/14-09:53:01.35 hl-d106 DAOS[20928/20928] daos INFO src/common/drpc.c:664 drpc_close() Closing dRPC socket fd=19

05/14-09:53:01.40 hl-d106 DAOS[20928/20928] common ERR  src/common/rsvc.c:142 rsvc_client_process_error() removed rank 1 from replica list due to DER_NOTREPLICA(-2020)

05/14-09:53:01.40 hl-d106 DAOS[20928/20928] common WARN src/common/rsvc.c:102 rsvc_client_choose() replica list empty

05/14-09:53:01.40 hl-d106 DAOS[20928/20928] pool ERR  src/pool/cli.c:539 dc_pool_connect() 6eb32fb0: cannot find pool service: DER_NOTREPLICA(-2020)

failed to connect to pool: -1005

 

This comes from the daos.log on the Client.

 

Thanks.

 

Colin

 

From: <daos@daos.groups.io> on behalf of "Wang, Di" <di.wang@...>
Reply-To: "daos@daos.groups.io" <daos@daos.groups.io>
Date: Wednesday, May 13, 2020 at 4:07 PM
To: "daos@daos.groups.io" <daos@daos.groups.io>
Subject: Re: [daos] Message looks serious?

 

If 40 does not exist, it should not be able to connect to the pool at all, I.e. it should output sth like "failed to connect to pool: …”.

 

These logs seems suggesting the pool connection did happen.  Would you please collect the client side daos log? (by "export D_LOG_FILE=xxx “? It might tell us what really happened. Thanks.

 

Thanks

WangDi

From: <daos@daos.groups.io> on behalf of Colin Ngam <cngam@...>
Reply-To: "daos@daos.groups.io" <daos@daos.groups.io>
Date: Wednesday, May 13, 2020 at 10:08 AM
To: "daos@daos.groups.io" <daos@daos.groups.io>
Subject: [daos] Message looks serious?

 

Greetings,

 

Executing the command:

daos pool list-cont --pool a68b3845-fe78-481e-aa84-164e851d5f52 --svc 40

 

Note that 40 does not exist.

 

We did not get an error from the daos command.

 

In the log:

 

05/13-11:57:11.02 delphi-006 DAOS[26509/26552] pool WARN src/pool/srv_target.c:1020 ds_pool_tgt_map_update() Ignore update pool a68b3845 1 -> 1

05/13-11:57:11.02 delphi-006 DAOS[26509/26552] pool WARN src/pool/srv_target.c:1020 ds_pool_tgt_map_update() Ignore update pool a68b3845 1 -> 1

05/13-11:57:11.02 delphi-006 DAOS[26509/26552] pool WARN src/pool/srv_target.c:1020 ds_pool_tgt_map_update() Ignore update pool a68b3845 1 -> 1

 

My guess is that ds_pool_tgt_map_update() should not even be called?

 

Cheers,

 

Colin

 

Join daos@daos.groups.io to automatically receive all group messages.