When you reduce a pipeline or a queue concurrency, Polyaxon does not kill or cancel active or scheduled runs to meet the new value. Polyaxon will apply a “draining” effect, where it will allow those active or scheduled runs to finish first without scheduling more operations under that pipeline or on that queue.
If the concurrency is increased, Polyaxon will try to reach this new value by scheduling new operations while respecting other higher level limitations. For example increasing the concurrency of a pipeline to a higher value than the concurrency of the queue used or the organization’s quota will result in a pass-through mechanism and the agent manager will only consider the queue or the organization to manage the concurrency, in other terms it has the same effect as not setting any concurrency limit.