Optimal Space Filling Design

Optimal Space Filling Design (OSF)

The goal in Design of Experiments is to determine the smallest sufficient set of points required to calculate a response surface. Therefore, you choose the type depending on the parametric problem and targeted response surface. The number of points depends on the number of input parameters, or is user defined.

Optimal Space-Filling Design (OSF) creates optimal space filling Design of Experiments (DOE) plans according to some specified criteria. Essentially, OSF is a Latin Hypercube Sampling Design (LHS) that is extended with post-processing. It is initialized as an LHS and then optimized several times, remaining a valid LHS (without points sharing rows or columns) while achieving a more uniform space distribution of points (maximizing the distance between points).

To offset the noise associated with physical experimentation, classical DOE types such as CCD focus on parameter settings near the perimeter of the design region. Because computer simulation is not quite as subject to noise, though, the Optimal Space-Filling (OSF) design is able to distribute the design parameters equally throughout the design space with the objective of gaining the maximum insight into the design with the fewest number of points. This advantage makes it appropriate when a more complex meta-modeling technique such as Kriging, Non-Parametric Regression, or Neural Networks is used.

OSF shares some of the same disadvantages as LHS, though to a lesser degree. Possible disadvantages of an OSF design are:

When the CCD Samples sample type is selected, a maximum of 20 input parameters is supported.
Extremes, such as the corners of the design space, are not necessarily covered.
The selection of too few design points can result in a lower quality of response prediction.

The following properties are available for the OSF DOE type.

Design Type: The following choices are available:
- Max-Min Distance (default): Maximizes the minimum distance between any two points. This strategy ensures that no two points are too close to each other. For a small size of sampling (N), the Max-Min Distance design generally lies on the exterior of the design space and fill in the interior as N becomes larger. Generally, this is the faster algorithm.
- Centered L2: Minimizes the centered L2-discrepancy measure. The discrepancy measure corresponds to the difference between the empirical distribution of the sampling points and the uniform distribution. This means that the centered L2 yields a uniform sampling. This design type is computationally faster than the Maximum Entropy type.
- Maximum Entropy: Maximizes the determinant of the covariance matrix of the sampling points to minimize uncertainty in unobserved locations. This option often provides better results for highly correlated design spaces. However, its cost increases non-linearly with the number of input parameters and the number of samples to be generated. Thus, it is recommended only for small parametric problems.
Maximum Number of Cycles: Determines the number of optimization loops the algorithm needs, which in turns determines the discrepancy of the DOE. The optimization is essentially combinatorial, so a large number of cycles slows down the process. However, this makes the discrepancy of the DOE smaller. For practical purposes, 10 cycles is generally good for up to 20 variables. The value must be greater than 0. The default is 10.
- Samples Type: Determines the number of DOE points the algorithm should generate. This option is suggested if you have some advanced knowledge about the nature of the metamodel. The following choices are available:
  - CCD Samples (default): Supports a maximum of 20 inputs. Generates the same number of samples a CCD DOE would generate for the same number of inputs. You can use this to generate a space filling design that has the same cost as a corresponding CCD design.
  - Linear Model Samples: Generates the number of samples as needed for a linear metamodel.
  - Pure Quadratic Model Samples: Generates the number of samples as needed for a pure quadratic metamodel (no cross terms).
  - Full Quadratic Samples: Generates the number of samples needed to generate a full quadratic model.
  - User-Defined Samples: Specify the desired number of samples.
- Seed Value: Set the value used to initialize the random number generator invoked internally by the LHS algorithm. Although the generation of a starting point is random, the seed value consistently results in a specific LHS. This property allows you to generate different samplings by changing the value or regenerate the same sampling by keeping the same value. The default is 0.
- Number of Samples: Enabled when Samples Type is set to User-Defined Samples. Specifies the default number of samples. The default is 10.