v0.1.9

  • Add config deployment app to help guide users through the configuration options when deploying Polyaxon
  • Add possibility to delete/stop jobs/experiments from dashboards
  • Update charts to support rolling updates
  • Optimize and fix some issues in some API endpoints
  • Optimize and reduce number of generated queries (More queries to optimize)
  • Update SDK to expose only one client
  • Add a delay before creating the experiment group’s experiment to avoid not finding running status
  • Extend config manager to validate different generic specs and raise errors on bad config deployments
  • Abstract serialization of bookmarks to be used for lists
  • Fix an issue with the metrics api’s pagination
  • Enhanced the filters ui
  • Consolidate installed libraries

v0.1.8

  • Enable users to pull image from private registries
  • Add first version of an abstraction to seamlessly use cloud storage or persistent volumes for data, outputs, and logs
  • Annotate jobs with node used for the scheduling
  • Remove some optional env vars and move them to default values
  • Use configMap and secret envFrom to populate environment
  • Reduce number of partials
  • Update default page size to 20
  • Update usage of secrets and configs
  • Update clean commands
  • Fix an issue with check command
  • Fix a bug related to checking running experiment groups
  • Add missing tests for azure id creation
  • Enhance build and test process
  • Upgrade Kubernetes client to 6.0.0
  • Upgrade web socket and request libs
  • Upgrade api and workers requirements
  • Update test requirements
  • Move label and pod/job states schemas to core

v0.1.7

  • Add initial draft of actions
  • Add base integrations mechanism
  • Add notification via emai, slack, pagerduty, mattermost, discord, hipchat, webhooks …
  • Enlarge docker’s shm memory
  • Fix Microsoft authentication
  • Fix issue with cleaning pre-delete hooks
  • Remove deprecated polyaxon-lib
  • Separate email and admin user in helm chart
  • Fix api service with NodePort
  • Fix issue related to delete project command outside of initialized polyaxon workspace in CLI
  • Redirect directly to dashboard from CLI

v0.1.6

  • Add possibility to deploy Polyaxon with tls.
  • Expose default toleration/affinity for builds, jobs, experiments.
  • Add affinity and toleration to spec environment.
  • Update spec to use node_selector instead of node_selectors.
  • Refactor worker/ps/default_worker/default_ps in environment section.
  • Update CLI init command to always create a new config for the project.
  • Add shortcuts -f for follow and -p for past for the logs commands.
  • Creating job/build/experiment/group with cli will cache the instance automatically for the following commands.
  • Fix some command suggestions.
  • Upgrade pyyaml, there was a security issue.
  • Improve docker build process.
  • Fix pytorch distributed example.
  • Update chart cleaning hook delete strategy.
  • Fix an issue with detecting the correct k8s minor version.
  • Fix issue with docker build process: image name should be lower case.
  • Fix Run meta info for builds, jobs, and groups on dashboard.
  • Add draft for stats and actions.

v0.1.5

  • Build images: You can now pip install any file in your code folder without the need to call the file polyaxon_requirements.txt. Same thing if you need to build an image and have a shell script you just need to chmod +x the file and add it to the build setps.
  • Add bookmarking: You can bookmark your projects/groups/experiments/jobs/builds with Dashboard/API/CLI/Client. If your team is running too many hyper params tuning in many projects, finding the important experiments/jobs/builds can be exhausting since it requires searching each time. Bookmarks allow you to easily save important runs and have access to them in an easy way.
  • Add activity logs: shows recent create/update/delete activities in your team cluster or projects.
  • Add history/recently viewed: This is also something that some teams showed a lot of interests in having, using the platform for a couple of weeks/months and creating thousands of experiments per day, makes it very hard to access the right information.
  • Add support for external PostgreSQL: it’s possible to link an external PostgreSQL database instead of the in-cluster one.
  • SSO: Add support for Microsoft Azure authentication. in addition to github, gitlab, bittbucket, and LDAP, now you can allow your team to signup/login to Polyaxon with Microsoft Azure authentication
  • Tooltips: for dates (creation date, update date, starting date, termination date) the dashboard shows a humanized version (e.g. X hours/days ago) this was not super helpful, and many users asked to have the possibility to see the exact values. On your dashboard you can see the dates values by hovering on any date and it will show datetime in the “YYYY-MM-DD HH: mm:ss” format.
  • Add docs for replication: the docs now has a section outlining different strategies for replicating Polyaxon.
  • Add docs for postgres HA strategies.
  • Add docs for Microsoft Azure authentication.

v0.1.4

  • Fix some issues with node discovery
  • Optimize access to k8s api
  • Update image build process
  • Optimize resources monitoring on k8s
  • Add bookmarks for project, groups, experiments, jobs, builds
  • Fix some UI issues
  • Update docs
  • Other changes from 🇷🇺

v0.1.3

  • Add support for outputs reference from other jobs and experiments by id and name
  • Update metrics tab with empty metrics
  • Add some error handling for spec validation
  • Fix the instruction pages

v0.1.2

  • Update instructions tabs
  • Expose commit on the query spec
  • Add query spec docs
  • Add statuses tab
  • Add metrics tab
  • Add jobs cleaning
  • Update logic for allowing commits after delete notebooks
  • Update outputs logics for tensorboards

v0.1.1

  • Update dockerizer process: moved from a worker based process that handles image builds to pod being scheduled to build the image if does not exists
  • Possibility to force rebuilding an image
  • Separate run into 2 sections: run and build
  • Add build specification
  • Add tensorboard and notebook specifications
  • Add generic job specification
  • Optimize services images and reduce their sizes
  • Upgrade docker client
  • upgrade Kubernetes client
  • Add and test replication for api, workers, and monitors
  • Add tagging for projects, experiments, experiments groups, and jobs
  • Add possibility to name runs (experiments, groups, jobs, builds)
  • Add filtering, you can now filter based on our query spec: started_at:2015-01-01..2018-01-20, tags:foo|bar, metrics__loss:<=0.1
  • Add sorting, you can now sort by different attributes: -created_at, updated_at…
  • Remove upper limit on hptuning, this will allow hptuning algorithm worker to consume more cpu and converge faster.
  • Add download code: you can see now the commit made internally each time you upload the code
  • Add outputs download for experiments and jobs
  • Update tensorboards to support not only projects but also experiments and experiment group
  • Update and enhance ui
  • Add possibility to mount multiple data and outputs volumes
  • Extend the specification to allow to choose an output and data volumes to mount for an experiment or a job
  • Fix several usability and stability issues
  • Extend experiments api to allow to pull metrics and declarations

v0.1.0

  • Add LDAP auth backend
  • Add identity abstraction for oauth and integrations
  • Add Github oauth
  • Add Bitbucket oauth
  • Add Gitlab oauth
  • Update landing page and dashboard design
  • Update signup/login pages
  • Update behavior of experiment groups finished status
  • Expose more params for Bayesian optimization (algorithm was not converging sometime because of the default n_iter and n_warmup value)
  • Add more experiment group meta data: run dates and total time, iterations, status, and algorithm search
  • Fix issues with parsing numpy types in specification files