Skip to content

add InferenceSet controller for scaling inference workloads automatically #1659

@andyzhangx

Description

@andyzhangx

Is your feature request related to a problem? Please describe.
add InferenceSet for scaling inference workloads automatically, this feature would be alpha in v0.8.0 and user needs to --set featureGates.gatewayAPIInferenceExtension=true during KAITO install

Describe the solution you'd like

Describe alternatives you've considered

Additional context

Metadata

Metadata

Assignees

Labels

enhancementNew feature or request

Type

No type

Projects

Status

No status

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions