Skip to content
This repository was archived by the owner on Jan 30, 2020. It is now read-only.
This repository was archived by the owner on Jan 30, 2020. It is now read-only.

fleet start fails arbitrarily on cluster with more than 500 nodes #1579

Description

@Vishant0031

Test environment:
coreos version 835.11
fleetd version 0.11.5
etcd Version: 2.2.0

Fleet nodes:
700

Issue:
I am trying to start a fleet service on my cluster. I use command fleetctl start (diskinstall.service).
fleetctl list-machines shows 700 nodes.
and fleet service is a Global=true service and suppose to be launched on all the nodes.
When I start the service (fleetctl start diskinstall.service), service is started only on <700 nodes.
Sometimes the service is launched on ~400 nodes or ~500 and its very random.

At all the time fleetctl list-machines outputs 700 nodes.
etcd2 is also healthy all the time.
I have to repeat the fleetctl start command to overcome this issue.

Thanks.

Metadata

Metadata

Assignees

No one assigned

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions