When the network port recovers from a fault, the corresponding rank cannot receive the groupupdate message, resulting in the failure to join the group normally,rpc msg OOG.


dagouxiong2015@...
 

We tried the mercury demo and found that initializing hg can solve this OOG problem,
and then analyzed the code of the cart,
and found that the cart context applied a global context, and it is used by all other ranks rpc msg.

When we want to initialize the cart context for a rank, instead of all ranks,what can we do?




struct dss_module_info {
    crt_context_t       dmi_ctx;

Join daos@daos.groups.io to automatically receive all group messages.