Autoscaling Clusters
A cluster consists of multiple machines that are networked together. Each machine is called a node. A cluster consists of a head node that handles the job scheduling, and worker nodes that do the work. The worker nodes work in parallel with each other, boosting processing speed to deliver high performance computing.
An autoscaling cluster is a cluster that dynamically scales the number of nodes up or down to meet the demand of the workload. When jobs are submitted to the cluster, nodes are spun up, and as jobs finish, the nodes are shut down.
A cluster can have a set number of static nodes that are allocated right away and remain available throughout the life of the cluster. Otherwise, nodes are allocated dynamically (as needed) to execute jobs submitted to the cluster. To control costs, you can put a limit on the number of dynamic nodes that can be spun up.
Only one autoscaling cluster can be present in a project space at one time. However, the cluster can have multiple job submission queues, and each queue can have a different application associated with it.
Topics:
- Creating an Autoscaling Cluster
- Checking the Installation Status of Autoscaling Cluster Applications
- Adding Applications to an Autoscaling Cluster
- Adding Ansys HPC Platform Services to an Autoscaling Cluster
- Adding Queues to an Autoscaling Cluster
- Deleting a Queue from an Autoscaling Cluster
- Mounting a Storage to an Autoscaling Cluster
- Unmounting a Storage from an Autoscaling Cluster
- Renaming an Autoscaling Cluster
- Viewing the Tags Applied to an Autoscaling Cluster
- Deleting an Autoscaling Cluster
For step-by-step examples of autoscaling cluster configuration and usage, see Recommended Configurations by Application in the Recommended Usage Guide.
For general information on using autoscaling clusters, see Autoscaling Clusters in the User's Guide.