I am running a 3-node Nodeweaver test cluster on the same Ubuntu 18.04 machine under KVM, and am hitting problems trying to deploy the ubuntu 18.04 VM from Nodeweaver’s app store.
The version is NodeWeaver 11.5 build 2002080133.
Any suggestions welcome!
The error is:
21:42:06 2020 [Z0][VM][I]: New state is PENDING
21:42:14 2020 [Z0][VM][I]: New state is ACTIVE
21:42:14 2020 [Z0][VM][I]: New LCM state is PROLOG
21:42:15 2020 [Z0][VM][I]: New LCM state is BOOT
21:42:15 2020 [Z0][VMM][I]: Generating deployment file: /var/lib/one/vms/3/deployment.0
21:42:15 2020 [Z0][VMM][I]: Successfully execute transfer manager driver operation: tm_context.
21:42:15 2020 [Z0][VMM][I]: ExitCode: 0
21:42:15 2020 [Z0][VMM][I]: Successfully execute network driver operation: pre.
21:42:18 2020 [Z0][VMM][I]: Command execution fail: cat << EOT | /var/tmp/one/vmm/kvm/deploy '/var/lib/one//datastores/0/3/deployment.0' 'node4745' 3 node4745
21:42:18 2020 [Z0][VMM][I]: error: Failed to create domain from /var/lib/one//datastores/0/3/deployment.0
21:42:18 2020 [Z0][VMM][I]: error: internal error: Child process (LC_ALL=C PATH=/sbin:/usr/sbin:/bin:/usr/bin /usr/bin/kvm -help) unexpected exit status 127: /usr/bin/kvm: error while loading shared libraries: librdmacm.so.1: cannot open shared object file: No such file or directory
21:42:18 2020 [Z0][VMM][I]:
21:42:18 2020 [Z0][VMM][E]: Could not create domain from /var/lib/one//datastores/0/3/deployment.0
21:42:18 2020 [Z0][VMM][I]: ExitCode: 255
21:42:18 2020 [Z0][VMM][I]: Failed to execute virtualization driver operation: deploy.
21:42:18 2020 [Z0][VMM][E]: Error deploying virtual machine: Could not create domain from /var/lib/one//datastores/0/3/deployment.0
21:42:18 2020 [Z0][VM][I]: New LCM state is BOOT_FAILURE
When I first started hitting the problem, the error was:
20:28:06 2020 [Z0][VMM][I]: Command execution fail: cat << EOT | /var/tmp/one/vmm/kvm/deploy '/var/lib/one//datastores/0/2/deployment.0' 'node4745' 2 node4745
20:28:06 2020 [Z0][VMM][I]: error: Failed to create domain from /var/lib/one//datastores/0/2/deployment.0
20:28:06 2020 [Z0][VMM][I]: error: invalid argument: could not find capabilities for arch=x86_64 domaintype=kvm
20:28:06 2020 [Z0][VMM][E]: Could not create domain from /var/lib/one//datastores/0/2/deployment.0
20:28:06 2020 [Z0][VMM][I]: ExitCode: 255
This first problem was addressed by adjusting the cpu model in virt-manager to be host-passthrough
based on this article.
Any help appreciated!