Perform manual validation for Docker Swarm Mode interop identified tests #927

lipingxue · 2017-02-17T18:58:50Z

This is to track Docker Swarm Mode(introduced in docker 1.12 release) interop related issue.

govint · 2017-02-20T11:04:41Z

ESX service needs to be made cluster aware. Currently there are multiple places where we have no way to identify if a VM really exists or not as the VM may not be on the same node. #934, #929 show how a VM (not necessarily deleted) but running on a different node can't be identified as existing,. We can have the UUID of the VM in a tenant's VM list and not know if the VM exists or not.

There has to be a way to determine the following:

Include the datacenter in VC to which the ESX host belongs and make it part of input to the ESX service.
Provide an api to contact VC to figure if the VM (UUID) exists in the datacenter to which the host belongs.
Use (2) in all cases where we need to determine if a VM actually exists (VM list per tenant, handling stale attach in vmdk_ops.py).

tusharnt · 2017-02-28T20:17:36Z

@lipingxue can you please add your analysis here? We can then close this investigation issue.

lipingxue · 2017-03-02T20:11:29Z

The following are two action items identified after trying out Docker Swarm Mode with vSphere docker volume service.

Add Swarm related test in CI/CD test
In CI/CD test, we can add the test for stateful service fail-over in Swarm mode. This is the test we need:
1. Create a Swarm cluster, with three VMs (one master, two workers)
2. Deploy a stateful service (redis for example) with persistent mode on worker1, login to redis-cli to set some key/value pair, which will be stored in redis.
3. Kill worker1, wait for a while, check the service to make sure it restarts successfully on worker2. Login to redis-cli, and read the value with given key which was set in step 2, and the value should be same as the values set in step2.
Understand what is the customer need to use Swarm with persistent storage (fail-over only or have the need to scale-up if needed)
Current Swarm mode can only support fail-over case well (--replica 1).
Swarm mode cannot support scale(--replica > 1) for stateful services well due to its own limitation (no task-level volume support), so not much things we can do to from vSphere volume service point of view.

shuklanirdesh82 · 2017-03-13T22:29:51Z

#927 (comment)

@ashahi1 I'm assigning to you to perform manual validation for both photon and ubuntu distros.

ashahi1 · 2017-03-17T01:32:29Z

Testing the fail-over scenario for docker swarm.

Created a swarm of three nodes - one master and two worker nodes. All three VMs are on the same host and shared-vmfs datastore

root@sc-rdops-vm02-dhcp-52-237:~# docker node ls
ID                           HOSTNAME                   STATUS  AVAILABILITY  MANAGER STATUS
2vixtu2afs45tx73xgn3qu4z7 *  sc-rdops-vm02-dhcp-52-237  Ready   Active        Leader
3fgwmxj57tz6b6f3e4o6uw5q7    sc-rdops-vm02-dhcp-52-237  Ready   Active
6np11t99e82iqp3cxenj87mfu    sc-rdops-vm02-dhcp-52-237  Ready   Active
root@sc-rdops-vm02-dhcp-52-237:~#

Created a volume from master - redis-data

root@sc-rdops-vm02-dhcp-52-237:~# docker volume create --driver=vsphere --name=redis-data -o size=1GB
redis-data
root@sc-rdops-vm02-dhcp-52-237:~# docker volume ls
DRIVER              VOLUME NAME
vsphere             Vol1_200mb@sharedVmfs-0
vsphere             redis-data@sharedVmfs-0

Created the redis service from the master with volume mounted

docker service create --mount type=volume,target=/data,source=redis-data -p 6379:6379 --constraint node.role==worker redis redis-server --appendonly yes


[root@sc-rdops-vm18-dhcp-54-69:/tmp] /usr/lib/vmware/vmdkops/bin/vmdkops_admin.py ls
Volume      Datastore     VM-Group  Capacity  Used  Filesystem  Policy  Disk Format  Attached-to      Access      Attach-as               Created By     Created Date
----------  ------------  --------  --------  ----  ----------  ------  -----------  ---------------  ----------  ----------------------  -------------  ------------------------
Vol1_200mb  sharedVmfs-0  _DEFAULT  100MB     15MB  ext4        N/A     thin         detached         read-write  independent_persistent  ubuntu-Master  Thu Mar 16 22:32:15 2017
redis-data  sharedVmfs-0  _DEFAULT  1GB       54MB  ext4        N/A     thin         ubuntu-worker-2  read-write  independent_persistent  ubuntu-Master  Thu Mar 16 23:28:09 2017

[root@sc-rdops-vm18-dhcp-54-69:/tmp]

Verified service started on worker-2

docker service ls

root@sc-rdops-vm02-dhcp-52-237:~# docker service ls
ID            NAME         REPLICAS  IMAGE  COMMAND
463qb2vp1m1k  gracious_keller  1/1       redis  redis-server --appendonly yes
root@sc-rdops-vm02-dhcp-52-237:~#

Updated the docker service

docker service update --restart-condition on-failure gracious_keller

Entered data to redis

root@sc-rdops-vm02-dhcp-52-237:~# redis-cli
127.0.0.1:6379> set hello world
OK
127.0.0.1:6379> get hello
"world"
127.0.0.1:6379> set India Delhi
OK
127.0.0.1:6379> get India
"Delhi"
127.0.0.1:6379>

Shutdown the worker-2. Service started on worker-1

root@sc-rdops-vm02-dhcp-52-237:~# docker node ls
ID                           HOSTNAME                   STATUS  AVAILABILITY  MANAGER STATUS
2vixtu2afs45tx73xgn3qu4z7 *  sc-rdops-vm02-dhcp-52-237  Ready   Active        Leader
3fgwmxj57tz6b6f3e4o6uw5q7    sc-rdops-vm02-dhcp-52-237  Ready   Active
6np11t99e82iqp3cxenj87mfu    sc-rdops-vm02-dhcp-52-237  Down    Active


[root@sc-rdops-vm18-dhcp-54-69:/tmp] /usr/lib/vmware/vmdkops/bin/vmdkops_admin.py ls
Volume      Datastore     VM-Group  Capacity  Used  Filesystem  Policy  Disk Format  Attached-to      Access      Attach-as               Created By     Created Date
----------  ------------  --------  --------  ----  ----------  ------  -----------  ---------------  ----------  ----------------------  -------------  ------------------------
Vol1_200mb  sharedVmfs-0  _DEFAULT  100MB     15MB  ext4        N/A     thin         detached         read-write  independent_persistent  ubuntu-Master  Thu Mar 16 22:32:15 2017
redis-data  sharedVmfs-0  _DEFAULT  1GB       55MB  ext4        N/A     thin         ubuntu-worker-1  read-write  independent_persistent  ubuntu-Master  Thu Mar 16 23:28:09 2017

[root@sc-rdops-vm18-dhcp-54-69:/tmp]

Verified the data from worker-1

root@sc-rdops-vm02-dhcp-52-237:~# redis-cli
127.0.0.1:6379> get hello
"world"
127.0.0.1:6379> get India
"Delhi"
127.0.0.1:6379> set USA Washington D.C.
(error) ERR syntax error
127.0.0.1:6379> set USA WashingtonD.C.
OK
127.0.0.1:6379> get USA
"WashingtonD.C."
127.0.0.1:6379>

ashahi1 · 2017-03-17T01:42:46Z

@lipingxue

Liping, I have tested the fail-over scenario for docker swarm. I wrote data from one of the workers(worker-2), killed that worker and then I was able to read data from first worker (worker-1).

Please take a look at the posted output and re-open the issue if we need to do anymore testing.

lipingxue self-assigned this Feb 17, 2017

tusharnt added this to the 0.12 milestone Feb 17, 2017

tusharnt added Investigate P0 labels Feb 17, 2017

lipingxue modified the milestones: v1 GA, 0.12 Feb 17, 2017

tusharnt modified the milestones: 0.12, v1 GA Feb 23, 2017

tusharnt modified the milestones: 0.13, 0.12 Feb 28, 2017

shuklanirdesh82 added the regression-test-needed label Mar 3, 2017

tusharnt assigned shuklanirdesh82 and unassigned lipingxue Mar 9, 2017

tusharnt changed the title ~~Docker Swarm Mode interop~~ Identify test scenarios for Docker Swarm Mode interop Mar 9, 2017

tusharnt added kind/test and removed Investigate P0 labels Mar 9, 2017

shuklanirdesh82 assigned ashahi1 and unassigned shuklanirdesh82 Mar 13, 2017

shuklanirdesh82 changed the title ~~Identify test scenarios for Docker Swarm Mode interop~~ Perform manual validation for Docker Swarm Mode interop identified tests Mar 13, 2017

ashahi1 closed this as completed Mar 17, 2017

shaominchen mentioned this issue Jun 1, 2017

Automate Docker Swarm interop test #1324

Closed

This was referenced Jun 22, 2017

Verify docker service commands with the volume plugin #601

Closed

[E2E] Need to deploy redis-cli in docker hosts to verify the data #1467

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Perform manual validation for Docker Swarm Mode interop identified tests #927

Perform manual validation for Docker Swarm Mode interop identified tests #927

lipingxue commented Feb 17, 2017

govint commented Feb 20, 2017

tusharnt commented Feb 28, 2017

lipingxue commented Mar 2, 2017

shuklanirdesh82 commented Mar 13, 2017

ashahi1 commented Mar 17, 2017 •

edited by shaominchen

Loading

ashahi1 commented Mar 17, 2017

Perform manual validation for Docker Swarm Mode interop identified tests #927

Perform manual validation for Docker Swarm Mode interop identified tests #927

Comments

lipingxue commented Feb 17, 2017

govint commented Feb 20, 2017

tusharnt commented Feb 28, 2017

lipingxue commented Mar 2, 2017

shuklanirdesh82 commented Mar 13, 2017

ashahi1 commented Mar 17, 2017 • edited by shaominchen Loading

ashahi1 commented Mar 17, 2017

ashahi1 commented Mar 17, 2017 •

edited by shaominchen

Loading