MEP-20 L3 only by majst01 · Pull Request #282 · metal-stack/website

majst01 · 2026-06-09T07:49:37Z

Description

DRAFT L3 only network

Please only review the Readme.md, the files in the ai folder where generated during the design process from AI and will be removed in the final MEP. I kept them only for reference during the review process.

Used AI-Tools ✨

Qwen3.6 used for generation of ideas in the ai folder.

netlify · 2026-06-09T07:49:42Z

✅ Deploy Preview for metal-stack-io ready!

Name	Link
🔨 Latest commit	`da77748`
🔍 Latest deploy log	https://app.netlify.com/projects/metal-stack-io/deploys/6a47741abed7980008ba056f
😎 Deploy Preview	https://deploy-preview-282--metal-stack-io.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

majst01 · 2026-06-17T14:00:35Z

+
+## metal-core
+
+metal-core will need to support additional configuration templates for the boot vrf. 


We could also install metal-core in a way that it also talks to metal-apiserver from within this boot-vrf which completely eliminates the need for weird routes on the switches

Sven-Ric · 2026-06-17T16:20:12Z

+The L3 only boot and registration process can be described as follows:
+
+- Every server will be scanned on a regular basis from the metal-bmc if there is IPXE is configured as boot iso payload. This is a additional task on the metal-bmc. metal-bmc already scans all servers on a regular basis to gather power metrics etc.
+- If the boot iso is set to ipxe, the boot source override must be set to CDROM instead of PXE from network and a reboot must be triggered (migration to this approach, not when a machine is allocated).


If we don't plan on removing support for the old PXE boot with this change, it could make sense to track the boot mode of each machine in metal-api. The migration step could then be triggered and tracked by metal-api.

…yer-3-only

vknabel · 2026-07-02T09:04:02Z

+
+The placement therefore follows from the role given to `metal-boot`. If it only handles the lightweight control functions such as token issuance, `boot.ipxe`, DNS and NTP, placing a container on each leaf is acceptable. The small control steps stay well within the `ip2me` budget, and this also fits the suggestion from the design notes that `metal-boot` could be deployed on each switch with a shared anycast address for redundancy. The downside is that it exposes additional services on critical infrastructure, so the container still needs proper hardening. If `metal-boot` must instead act as a complete proxy that also carries bulk traffic, it should be placed on a fabric reachable host such as a management server. From there the proxied traffic is forwarded in hardware and never punted to a switch CPU, so CoPP does not apply.
+
+The metal-image-cache-sync is currently placed on the management-servers. One of the stated goals is to remove the need for connections between the production infrastructure and the management infrastrutcure. Since placing or proxying the image cache on the switches is not viable, the image cache has to move to a different location. The image cache can either be hosted on a metal-stack provisioned machine, or on a server outside of metal-stack's scope.


When placing the image cache on a metal-stack provisioned machine, how would the bootstrap work here? Temporary server outside of metal-stack?

Image cache is totally optional, so for the first machine pulling the image directly would only slow down installation

muhittink · 2026-07-02T10:02:17Z

+- Enable automated IPv6 address acquisition via SLAAC (RFC 4862) driven by Router Advertisements (RFC 4861) instead of DHCP
+- IPv6 in a dedicated Boot VRF instead of a Boot VLAN.
+
+This approach requires that metal-apiserver, metal-hammer, ipxe and a new component running in the partition and connected to the boot-vrf (`metal-boot` for now) are IPv6 ready.


Can we get rid of iPXE and its complexity completely?

I dont think so. I am pretty sure we would end up with a much more complex solution without it.

…yer-3-only

metal-robot Bot added the area: documentation Affects the documentation area. label Jun 9, 2026

metal-robot Bot added this to Development Jun 9, 2026

mwindower reviewed Jun 9, 2026

View reviewed changes

Comment thread community/04-Proposals/MEP20/ai/mep-ra-slaac-boot.md Outdated

Gerrit91 reviewed Jun 10, 2026

View reviewed changes

majst01 force-pushed the layer-3-only branch from fad312f to 748d609 Compare June 11, 2026 05:52

Sven-Ric reviewed Jun 11, 2026

View reviewed changes

Comment thread community/04-Proposals/MEP20/README.md Outdated

majst01 force-pushed the layer-3-only branch from 748d609 to 437c2f8 Compare June 11, 2026 11:16

Initial

f512e95

majst01 force-pushed the layer-3-only branch from 437c2f8 to f512e95 Compare June 12, 2026 05:07

chbmuc reviewed Jun 15, 2026

View reviewed changes

Comment thread community/04-Proposals/MEP20/README.md Outdated

chbmuc reviewed Jun 15, 2026

View reviewed changes

Comment thread community/04-Proposals/MEP20/README.md Outdated

majst01 force-pushed the layer-3-only branch from 5e1eb6c to 56bd5e0 Compare June 15, 2026 11:29

rename boot-helper to metal-boot

58e12ef

majst01 force-pushed the layer-3-only branch from 56bd5e0 to 58e12ef Compare June 15, 2026 11:33

Added preliminary necessary changes

e315656

majst01 commented Jun 17, 2026

View reviewed changes

Sven-Ric reviewed Jun 17, 2026

View reviewed changes

RDNSS tested successfully

d505794

majst01 changed the title ~~L3 only~~ MEP-20 L3 only Jun 21, 2026

Sven-Ric and others added 6 commits June 21, 2026 17:48

Added section about metal-boot scoping and placement

47ebec7

Hint howto upload virtual media from metal-bmc

32825f5

Added list for services that require ipv6 support

8c8d473

Merge branch 'main' of https://github.com/metal-stack/website into la…

8de575b

…yer-3-only

Remove AI, add note about BMC passwords

50a0409

Typo

e567874

vknabel reviewed Jul 2, 2026

View reviewed changes

muhittink reviewed Jul 2, 2026

View reviewed changes

majst01 added 2 commits July 3, 2026 10:34

metal-apiserver

a0484ae

Merge branch 'main' of https://github.com/metal-stack/website into la…

da77748

…yer-3-only

majst01 marked this pull request as ready for review July 3, 2026 08:34

majst01 requested a review from a team as a code owner July 3, 2026 08:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

MEP-20 L3 only#282

MEP-20 L3 only#282
majst01 wants to merge 12 commits into
mainfrom
layer-3-only

majst01 commented Jun 9, 2026 •

edited

Loading

Uh oh!

netlify Bot commented Jun 9, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

majst01 Jun 17, 2026

Uh oh!

Sven-Ric Jun 17, 2026

Uh oh!

Uh oh!

vknabel Jul 2, 2026

Uh oh!

majst01 Jul 2, 2026

Uh oh!

muhittink Jul 2, 2026

Uh oh!

majst01 Jul 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants


		## metal-core

		metal-core will need to support additional configuration templates for the boot vrf.


		The placement therefore follows from the role given to `metal-boot`. If it only handles the lightweight control functions such as token issuance, `boot.ipxe`, DNS and NTP, placing a container on each leaf is acceptable. The small control steps stay well within the `ip2me` budget, and this also fits the suggestion from the design notes that `metal-boot` could be deployed on each switch with a shared anycast address for redundancy. The downside is that it exposes additional services on critical infrastructure, so the container still needs proper hardening. If `metal-boot` must instead act as a complete proxy that also carries bulk traffic, it should be placed on a fabric reachable host such as a management server. From there the proxied traffic is forwarded in hardware and never punted to a switch CPU, so CoPP does not apply.

		The metal-image-cache-sync is currently placed on the management-servers. One of the stated goals is to remove the need for connections between the production infrastructure and the management infrastrutcure. Since placing or proxying the image cache on the switches is not viable, the image cache has to move to a different location. The image cache can either be hosted on a metal-stack provisioned machine, or on a server outside of metal-stack's scope.

Uh oh!

Conversation

majst01 commented Jun 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Used AI-Tools ✨

Uh oh!

netlify Bot commented Jun 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for metal-stack-io ready!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

majst01 Jun 17, 2026

Choose a reason for hiding this comment

Uh oh!

Sven-Ric Jun 17, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

vknabel Jul 2, 2026

Choose a reason for hiding this comment

Uh oh!

majst01 Jul 2, 2026

Choose a reason for hiding this comment

Uh oh!

muhittink Jul 2, 2026

Choose a reason for hiding this comment

Uh oh!

majst01 Jul 2, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

majst01 commented Jun 9, 2026 •

edited

Loading

netlify Bot commented Jun 9, 2026 •

edited

Loading