Skip to content

Upgrading server cluster to v1.10.3 unable to discover fellow peers #26233

Open
@bfqrst

Description

@bfqrst

Nomad version

Nomad v1.10.3
BuildDate 2025-07-08T18:46:27Z
Revision fde0bf2

Operating system and Environment details

Ubuntu Noble
Linux auto-nomad-dwt 6.8.0-1030-aws #32-Ubuntu SMP Wed May 28 19:33:40 UTC 2025 aarch64 aarch64 aarch64 GNU/Linux

Issue

Upgrading cluster on AWS via rolling release (ASG) results in new cluster peers unable to find themselves.

Reproduction steps

Have a AWS ASG ready with 3 members. Build new image via Packer with v1.10.3 in it. Cycle ASG.

Expected Result

Running cluster being able to find fellow cluster members.

Actual Result

Jul 09 10:20:34 auto-nomad-dwt nomad[597]: 2025-07-09T10:20:34.579+0200 [ERROR] agent.joiner: discovering join addresses failed: agent_mode=server join_config="provider=aws region=eu-central-1 tag_key=nomad-servers tag_value=nomad-auto addr_type=private_v4" error="discover-aws: DescribeInstancesInput failed: operation error EC2: DescribeInstances, exceeded maximum number of attempts, 3, https response error StatusCode: 0, RequestID: , request send failed, Post \"https://ec2.eu-central-1.api.aws/\": AuthorizedOnly"

Reverting new member back to v1.10.2 resolves the issue.

Job file (if appropriate)

Nomad Server logs (if appropriate)

Nomad Client logs (if appropriate)

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    Status

    Needs Triage

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions