You can use one or multiple node host paths (host node’s filesystem) to access data directly on your machine learning experiments and jobs
Tip: Please visit the Kubernetes documentation to learn about host paths.
Use the host path as a connection
connections:
- name: node-dataset
kind: host_path
schema:
mountPath: "/path/dataset"
hostPath: "/path/to/dataset"
To mount the data with the read-only option:
connections:
- name: node-dataset
kind: host_path
schema:
mountPath: "/path/dataset"
hostPath: "/path/to/dataset"
readOnly: true
If you want ot access multiple datasets:
connections:
- name: dataset1
kind: host_path
schema:
mountPath: /plx-dataset1
hostPath: "/path/to/dataset1"
readOnly: true
- name: dataset2
kind: host_path
schema:
mountPath: /plx-dataset2
hostPath: "/path/to/dataset2"
readOnly: true
Update/Install Polyaxon CE or Polyaxon Agent deployment
You can deploy/upgrade your Polyaxon CE or Polyaxon Agent deployment with access to data on the PVC.
Access to the dataset in your experiments/jobs
To expose the connection secret to one of the containers in your jobs or services:
run:
kind: job
connections: [dataset1]
Or
run:
kind: job
connections: [dataset1, s3-dataset1]
Use the initializer to load the dataset
To use the artifacts initializer to load the dataset
run:
kind: job
init:
- artifacts: {dirs: [...], files: [...]}
connection: "dataset1"