unable to configure raft service;no dRPC client set (data plane not started?)


21960347@...
 

Hello DAOS developers and users

I am trying to start a DAOS server with real SCM and NVme Devices.

After formating the storage device,the server complain "instance 0 exited: failed to start system db: unable to configure raft service: invalid database" and "no dRPC client set (data plane not started?)"

I've pasted the daos_server.log and .yaml below,thanks

localhost.localdomain INFO 2021/08/23 16:17:11 DAOS Control Server v1.2 (pid 60280) listening on 0.0.0.0:10001
localhost.localdomain INFO 2021/08/23 16:17:11 Checking DAOS I/O Engine instance 0 storage ...
localhost.localdomain INFO 2021/08/23 16:17:13 Metadata format required on instance 0
localhost.localdomain INFO 2021/08/23 16:17:57 Formatting scm storage for DAOS I/O Engine instance 0 (reformat: true)
localhost.localdomain INFO 2021/08/23 16:17:57 Instance 0: starting format of SCM (dcpm:/mnt/daos1)
localhost.localdomain INFO 2021/08/23 16:18:00 Instance 0: finished format of SCM (dcpm:/mnt/daos1)
localhost.localdomain INFO 2021/08/23 16:18:00 Formatting nvme storage for DAOS I/O Engine instance 0
localhost.localdomain INFO 2021/08/23 16:18:00 Instance 0: starting format of nvme block devices [0000:87:00.0 0000:88:00.0 0000:da:00.0 0000:db:00.0]
localhost.localdomain INFO 2021/08/23 16:18:08 Instance 0: finished format of nvme block devices [0000:87:00.0 0000:88:00.0 0000:da:00.0 0000:db:00.0]
localhost.localdomain INFO 2021/08/23 16:18:08 DAOS I/O Engine instance 0 storage ready
localhost.localdomain INFO 2021/08/23 16:18:08 instance 0 exited: failed to start system db: unable to configure raft service: invalid database
localhost.localdomain ERROR 2021/08/23 16:18:08 removing socket file: removing instance 0 socket file: no dRPC client set (data plane not started?)
localhost.localdomain INFO 2021/08/23 16:18:08 &&& RAS EVENT id: [engine_status_down] ts: [2021-08-23T16:18:08.508477+0800] host: [localhost.localdomain] type: [STATE_CHANGE] sev: [ERROR] msg: [DAOS rank exited unexpectedly] pid: [60280]





# For a single-server system
 
name: daos_server
access_points: ['localhost']
provider: ofi+sockets
nr_hugepages: 4096
control_log_file: /tmp/daos_server.log
transport_config:
   allow_insecure: true


engines:
-
  targets: 1
  pinned_numa_node: 0
  nr_xs_helpers: 0
  fabric_iface: eno1
  fabric_iface_port: 31416
  log_file: /tmp/daos_engine.0.log
 
  env_vars:
  - DAOS_MD_CAP=1024
  - CRT_CTX_SHARE_ADDR=0
  - CRT_TIMEOUT=30
  - FI_SOCKETS_MAX_CONN_RETRY=1
  - FI_SOCKETS_CONN_TIMEOUT=2000

  scm_mount: /mnt/daos1 # map to -s /mnt/daos
  scm_class: dcpm
  scm_list: [/dev/pmem0]
 
  bdev_class: nvme
  bdev_list: ["0000:87:00.0","0000:88:00.0","0000:da:00.0","0000:db:00.0"]