File initializer

Often users will need to perform an operation that only triggers some bash logic, a tiny python script, or they might need to create a file based on the input of a polyaxonfile and feed it to their main logic. Polyaxon has now a new initializer: file. This initializer allows to create a file, a here script (or here-doc) that can be used by following init containers or the main container.

For more details please check this blog post.

Improved manual management of runs

Most of the experiment tracking that users perform are in-cluster and only concern the run that’s being executed by the pod running on Kubernetes. However, several users are also launching local experiments within a notebook sessions both in-cluster or out of the cluster, and they generally like to track those experiments as well.

For more details please check this blog post.

Offline mode for ML tracking

Often users will need to test or debug their scripts before running them in-cluster. Polyaxon detects an environment variable POLYAXON_NO_OP that allows users to disable all client and tracking calls.

For more details please check this blog post.

Timeline view

If you are running DAGs, hyperparam tuning jobs, or schedules, you can take advantage of the new view that shows an aggregated table of all dependent runs of that pipeline.

For more details please check this blog post.

Improved client with more artifacts interfaces

This version brings some missing high level artifacts methods that were missing in previous versions:

  • delete_artifact: To delete a single artifact.
  • delete_artifacts: To delete a directory.
  • upload_artifacts_dir: To upload a directory.

Logs storage improvement

Logs are now stored in a similar way to other metric events, i.e. as a pipe separated values, this should reduce the file sizes significantly, the files are also more readable when used outside of Polyaxon UI or CLI.

New versions for TFJob and MPIJob

All Kubeflow charts were updated to point to their latest versions. Also the UI is now smarter at showing logs of the main containers, even when the containers have different naming conventions than the one used by Polyaxon.

Scalable operator reconciles

If you are running a high number of concurrent operations, you will see that with the default deployment options you might notice slower jobs’ termination or reporting.

The deployment configs (for Polyaxon CE and Polyaxon Agent) have now a new field maxConcurrentReconciles that you can use to perform more concurrent reconciliations, the operator has also some more logic to detect issues and report correct states back to the API:

operator:
  maxConcurrentReconciles: 10

The operator was also updated to pull lower conditions and report them when an operation has issues:

status-warning

Comparison table opened indicator

The comparison table has a new indicator when a run is opened in flyout mode:

flyout-indicator.png

Learn More about Polyaxon

This blog post just goes over a couple of features that we shipped since our last product update, there are several other features and fixes that are worth checking. To learn more about all the features, fixes, and enhancements, please visit the release notes.

Polyaxon continues to grow quickly and keeps improving and providing the simplest machine learning abstraction. We hope that these updates will improve your workflows and increase your productivity, and again, thank you for your continued feedback and support.