Package 'cardinalR'

Title: Collection of Data Structures
Description: A collection of simple simulation datasets designed for generating Nonlinear Dimension Reduction representations techniques such as t-distributed Stochastic Neighbor Embedding, and Uniform Manifold Approximation and Projection. These datasets serve as a valuable resource for understanding the reliability of Nonlinear Dimension Reduction representations in various contexts.
Authors: Jayani P.G. Lakshika [aut, cre] , Dianne Cook [aut] , Paul Harrison [aut] , Michael Lydeamore [aut] , Thiyanga S. Talagala [aut]
Maintainer: Jayani P.G. Lakshika <[email protected]>
License: MIT + file LICENSE
Version: 0.1.1
Built: 2025-02-12 05:59:17 UTC
Source: https://github.com/jayanilakshika/cardinalr

Help Index


Generate Cell Cycle Data with Noise

Description

This function generates a cell cycle dataset with added noise dimensions.

Usage

cell_cycle(n, num_noise, min_n, max_n)

Arguments

n

The total number of samples to generate.

num_noise

The number of additional noise dimensions to add to the data.

min_n

The minimum value for the noise dimensions.

max_n

The maximum value for the noise dimensions.

Value

A matrix containing the cell cycle data with added noise.

Examples

set.seed(20240412)
cell_cycle_data <- cell_cycle(
  n = 300, num_noise = 2, min_n = -0.05,
  max_n = 0.05
)

Generate Clusters with Different Shapes

Description

This function generates clusters with different shapes, including both Gaussian and non-Gaussian clusters.

Usage

clust_diff_shapes(
  n,
  num_gau_clust,
  num_non_gau_clust,
  clust_sd_gau,
  clust_sd_non_gau,
  num_dims,
  a,
  b
)

Arguments

n

The total number of data points to be generated.

num_gau_clust

The number of Gaussian clusters to generate.

num_non_gau_clust

The number of non-Gaussian clusters to generate.

clust_sd_gau

The standard deviation for the Gaussian clusters.

clust_sd_non_gau

The standard deviation for the non-Gaussian clusters.

num_dims

The number of dimensions for the data points.

a

The scaling factor for the non-Gaussian cluster shape.

b

The translation factor for the non-Gaussian cluster shape.

Value

A matrix containing the generated clusters with different shapes.

Examples

# Generate clusters with default parameters
set.seed(20240412)
data <- clust_diff_shapes(
  n = 300, num_gau_clust = 4,
  num_non_gau_clust = 2, clust_sd_gau = 0.05, clust_sd_non_gau = 0.1,
  num_dims = 7, a = 2, b = 4
)

Generate Clusters with Different Shapes and Different Number of Points

Description

This function generates clusters with different shapes, including both Gaussian and non-Gaussian clusters, with different numbers of points in each cluster.

Usage

clust_diff_shapes_pts(
  clust_size_vec,
  num_gau_clust,
  num_non_gau_clust,
  clust_sd_gau,
  clust_sd_non_gau,
  num_dims,
  a,
  b
)

Arguments

clust_size_vec

A vector specifying the number of points for each cluster.

num_gau_clust

The number of Gaussian clusters to generate.

num_non_gau_clust

The number of non-Gaussian clusters to generate.

clust_sd_gau

The standard deviation for the Gaussian clusters.

clust_sd_non_gau

The standard deviation for the non-Gaussian clusters.

num_dims

The number of dimensions for the data points.

a

The scaling factor for the non-Gaussian cluster shape.

b

The translation factor for the non-Gaussian cluster shape.

Value

A matrix containing the generated clusters with different shapes and different numbers of points.

Examples

# Generate clusters with default parameters
set.seed(20240412)
data <- clust_diff_shapes_pts(
  clust_size_vec = c(50, 50, 50, 50, 100, 100),
  num_gau_clust = 4,
  num_non_gau_clust = 2, clust_sd_gau = 0.05, clust_sd_non_gau = 0.1,
  num_dims = 7, a = 2, b = 4
)

Generate data points along a conic spiral curve with optional noise.

Description

This function generates data points along a conic spiral curve with optional noise.

Usage

conic_spiral_3d(n, num_noise, min_n, max_n)

Arguments

n

Total number of data points to generate.

num_noise

Number of additional noise dimensions to add to the data.

min_n

Minimum value for the noise added to the data.

max_n

Maximum value for the noise added to the data.

Value

A matrix containing the generated data points with or without added noise.

Examples

set.seed(20240412)
conic_spiral_3d(n = 100, num_noise = 2, min_n = -0.05, max_n = 0.05)

Generate points on a conic spiral in 3D space.

Description

This function generates points on a conic spiral in 3D space.

Usage

conic_spiral_3d_row(a, b, c, w)

Arguments

a

Final radius of the cone.

b

Height of the object.

c

Inner radius.

w

Number of spirals.

Value

A matrix containing the generated points on the conic spiral.

Examples

set.seed(20240412)
conic_spiral_3d_row(1, 2, 0.5, 3)

Generate a 3D cube with optional noise.

Description

This function generates a 3D cube along with optional noise.

Usage

cube_3d(num_dims, num_noise, min_n, max_n)

Arguments

num_dims

Number of effective dimensions (default is 3 for a 3D cube).

num_noise

Number of additional noise dimensions to add to the data.

min_n

Minimum value for the noise added to the data.

max_n

Maximum value for the noise added to the data.

Value

A list containing the generated data matrix and the sample size.

Examples

set.seed(20240412)
cube_3d(num_dims = 3, num_noise = 2, min_n = -0.01, max_n = 0.01)

Generate points on a curvilinear 2D manifold

Description

This function generates points on a curvilinear 2D manifold based on a nonlinear equation.

Usage

curv_2d(n, num_noise, min_n, max_n)

Arguments

n

The number of points to generate.

num_noise

The number of noise dimensions to add to the generated points.

min_n

The minimum value for the noise dimensions.

max_n

The maximum value for the noise dimensions.

Value

A matrix containing the generated points on the curvilinear 2D manifold.

Examples

set.seed(20240412)
curvilinear_points <- curv_2d(
  n = 100, num_noise = 2, min_n = -0.01,
  max_n = 0.01
)

Generate Curvy Branching Clusters with Noise

Description

This function generates data with curvy branching clusters along with added noise.

Usage

curvy_branch(n, num_noise, min_n, max_n)

Arguments

n

The total number of data points to be generated.

num_noise

The number of additional noise dimensions to be generated.

min_n

The minimum value for the noise added to the data points.

max_n

The maximum value for the noise added to the data points.

Value

A matrix containing the generated data, with each row representing a data point.

Examples

# Generate curvy branching clusters with noise with custom parameters
set.seed(20240412)
data <- curvy_branch(n = 200, num_noise = 2, min_n = -0.05, max_n = 0.05)

Generate Curvy Branching Cluster Data

Description

This function generates curvy branching cluster data with three clusters of different shapes.

Usage

curvy_branch_clust(n, clust_vec, num_noise, min_n, max_n)

Arguments

n

The total number of data points to be generated.

clust_vec

A vector specifying the number of points for each cluster. If not provided, the n is divided equally among the clusters.

num_noise

The number of additional noise dimensions to be generated.

min_n

The minimum value for the noise added to the data points.

max_n

The maximum value for the noise added to the data points.

Value

A matrix containing the generated data, with each row representing a data point.

Examples

# Generate curvy branching cluster data with custom parameters
set.seed(20240412)
data <- curvy_branch_clust(
  n = 300, clust_vec = c(100, 150, 50),
  num_noise = 2, min_n = -0.05, max_n = 0.05
)

Generate Curvy Branching Cluster Data with Background Noise

Description

This function generates data with four clusters, two of which follow a curvilinear pattern and the other two are distributed randomly.

Usage

curvy_branch_clust_bkg(n, num_noise, min_n, max_n)

Arguments

n

The total number of data points to be generated.

num_noise

The number of additional noise dimensions to be generated.

min_n

The minimum value for the noise added to the data points.

max_n

The maximum value for the noise added to the data points.

Value

A matrix containing the generated data, with each row representing a data point.

Examples

# Generate curvy branching cluster data with background noise with custom parameters
set.seed(20240412)
data <- curvy_branch_clust_bkg(
  n = 400, num_noise = 2, min_n = -0.05,
  max_n = 0.05
)

Generate Curvy Cell Cycle Data with Noise

Description

This function generates a curvy cell cycle dataset with added noise dimensions.

Usage

curvy_cycle(n, num_noise, min_n, max_n)

Arguments

n

The total number of samples to generate.

num_noise

The number of additional noise dimensions to add to the data.

min_n

The minimum value for the noise dimensions.

max_n

The maximum value for the noise dimensions.

Value

A matrix containing the curvy cell cycle data with added noise.

Examples

set.seed(20240412)
curvy_cell_cycle_data <- curvy_cycle(
  n = 300, num_noise = 2, min_n = -0.05,
  max_n = 0.05
)

Generate Curvy Tree Data with Noise

Description

This function generates a dataset representing a curvy tree structure, with added noise.

Usage

curvy_tree(n, num_noise, min_n, max_n)

Arguments

n

The total number of samples to generate.

num_noise

The number of additional noise dimensions to add to the data.

min_n

The minimum value for the noise dimensions.

max_n

The maximum value for the noise dimensions.

Value

A matrix containing the curvy tree data with added noise.

Examples

set.seed(20240412)
tree_data <- curvy_tree(n = 300, num_noise = 2, min_n = -0.05, max_n = 0.05)

Generate data representing small spheres within a larger encompassing sphere with added noise.

Description

This function generates data points representing small spheres within a larger encompassing sphere and adds noise to the data if specified.

Usage

diff_sphere(n, num_noise, min_n, max_n)

Arguments

n

Total number of data points to generate, should be a multiple of 13.

num_noise

Number of additional noise dimensions to add to the data.

min_n

Minimum value for the noise added to the data.

max_n

Maximum value for the noise added to the data.

Value

A matrix containing the generated data points with or without added noise.

Examples

set.seed(20240412)
diff_sphere(
  n = 390, num_noise = 2,
  min_n = -0.05, max_n = 0.05
)

Generate points sampled from the Dini surface with optional noise.

Description

This function generates points sampled from the Dini surface along with optional noise.

Usage

dini_surface_3d(n, num_noise, min_n, max_n)

Arguments

n

Total number of data points to generate.

num_noise

Number of additional noise dimensions to add to the data.

min_n

Minimum value for the noise added to the data.

max_n

Maximum value for the noise added to the data.

Value

A matrix containing the generated data points with or without added noise.

Examples

set.seed(20240412)
dini_surface_3d(n = 100, num_noise = 2, min_n = -0.05, max_n = 0.05)

Generate points on a Dini's surface.

Description

This function generates points on a Dini's surface.

Usage

dini_surface_3d_row(a = 1, b = 1)

Arguments

a

Outer radius of the surface.

b

Space between loops.

Value

A matrix containing the generated points on the surface.

Examples

set.seed(20240412)
dini_surface_3d_row(a = 1, b = 1)

Generate Eight Branching Data with Noise

Description

This function generates a dataset representing eight branching patterns, with added noise.

Usage

eight_branch(n, num_noise, min_n, max_n)

Arguments

n

The total number of samples to generate.

num_noise

The number of additional noise dimensions to add to the data.

min_n

The minimum value for the noise dimensions.

max_n

The maximum value for the noise dimensions.

Value

A matrix containing the eight branching data with added noise.

Examples

set.seed(20240412)
branching_data <- eight_branch(
  n = 400, num_noise = 2, min_n = -0.05,
  max_n = 0.05
)

Generate Four-Branching Data with Noise

Description

This function generates a dataset representing four branches with added noise.

Usage

four_branch(n, num_noise, min_n, max_n)

Arguments

n

The total number of samples to generate.

num_noise

The number of additional noise dimensions to add to the data.

min_n

The minimum value for the noise dimensions.

max_n

The maximum value for the noise dimensions.

Value

A matrix containing the four-branching data with added noise.

Examples

set.seed(20240412)
four_branching_data <- four_branch(
  n = 400, num_noise = 2, min_n = -0.05,
  max_n = 0.05
)

Generate Four Different Long Clusters with Noise

Description

This function generates a dataset consisting of four different long clusters with added noise.

Usage

four_long_clust(n, num_noise, min_n, max_n)

Arguments

n

The total number of samples to generate.

num_noise

The number of additional noise dimensions to add to the data.

min_n

The minimum value for the noise dimensions.

max_n

The maximum value for the noise dimensions.

Value

A matrix containing the four different long clusters with added noise.

Examples

set.seed(20240412)
four_diff_long_clusters <- four_long_clust(
  n = 200, num_noise = 2,
  min_n = -0.05, max_n = 0.05
)

Generate Four Long Clusters with Background Noise

Description

This function generates data with four long clusters along with background noise.

Usage

four_long_clust_bkg(n, num_noise, min_n, max_n)

Arguments

n

The total number of data points to be generated.

num_noise

The number of additional noise dimensions to be generated.

min_n

The minimum value for the noise added to the data points.

max_n

The maximum value for the noise added to the data points.

Value

A matrix containing the generated data, with each row representing a data point.

Examples

# Generate four long clusters with background noise with custom parameters
set.seed(20240412)
data <- four_long_clust_bkg(
  n = 400, num_noise = 4, min_n = -0.05,
  max_n = 0.05
)

Generate synthetic data with Gaussian clusters

Description

Generate Gaussian Clusters

Usage

gau_clust(
  n,
  num_clust,
  mean_matrix,
  var_vec,
  num_dims,
  num_noise,
  min_n,
  max_n
)

Arguments

n

The total number of data points to be generated.

num_clust

The number of clusters to generate.

mean_matrix

A matrix where each row represents the mean vector for a cluster.

var_vec

A vector specifying the variance for each cluster.

num_dims

The number of effective dimensions for the data points.

num_noise

The number of additional noise dimensions to be generated.

min_n

The minimum value for the noise added to the data points.

max_n

The maximum value for the noise added to the data points.

Details

This function generates Gaussian clusters with specified parameters.

Value

A matrix containing the generated Gaussian clusters.

Examples

set.seed(20240412)
gau_clust(
  n = 300, num_clust = 5,
  mean_matrix = rbind(
    c(1, 0, 0, 0), c(0, 1, 0, 0), c(0, 0, 1, 0),
    c(0, 0, 0, 1), c(0, 0, 0, 0)
  ), var_vec = c(0.05, 0.05, 0.05, 0.05, 0.05),
  num_dims = 4, num_noise = 2, min_n = -0.05, max_n = 0.05
)

Generate Gaussian Clusters with Different Points

Description

This function generates Gaussian clusters with different numbers of points per cluster.

Usage

gau_clust_diff(
  clust_size_vec,
  num_clust,
  mean_matrix,
  var_vec,
  num_dims,
  num_noise,
  min_n,
  max_n
)

Arguments

clust_size_vec

A vector specifying the number of points in each cluster.

num_clust

The number of clusters to generate.

mean_matrix

A matrix where each row represents the mean vector for a cluster.

var_vec

A vector specifying the variance for each cluster.

num_dims

The number of effective dimensions for the data points.

num_noise

The number of additional noise dimensions to be generated.

min_n

The minimum value for the noise added to the data points.

max_n

The maximum value for the noise added to the data points.

Value

A matrix containing the generated Gaussian clusters with different points.

Examples

# Generate Gaussian clusters with custom parameters
set.seed(20240412)
data <- gau_clust_diff(
  clust_size_vec = c(50, 100, 200, 50),
  num_clust = 4, mean_matrix =
    rbind(
      c(1, 0, 0, 0, 0, 0), c(0, 1, 0, 0, 0, 0),
      c(0, 0, 1, 0, 0, 0), c(0, 0, 0, 1, 0, 0)
    ),
  var_vec = c(0.02, 0.05, 0.06, 0.1),
  num_dims = 6, num_noise = 4,
  min_n = -0.05, max_n = 0.05
)

Generate Cluster and Curvilinear Data with Noise

Description

This function generates data with two clusters, one following a curvilinear pattern and the other distributed randomly.

Usage

gau_curvy_clust(n, clust_size_vec, num_noise, min_n, max_n)

Arguments

n

The total number of data points to be generated.

clust_size_vec

A vector specifying the number of points for each cluster. If not provided, the n is divided equally between the two clusters.

num_noise

The number of additional noise dimensions to be generated.

min_n

The minimum value for the noise added to the data points.

max_n

The maximum value for the noise added to the data points.

Value

A matrix containing the generated data, with each row representing a data point.

Examples

# Generate cluster and curvilinear data with custom parameters
set.seed(20240412)
data <- gau_curvy_clust(
  n = 300, clust_size_vec = c(100, 200), num_noise = 3,
  min_n = -0.05, max_n = 0.05
)

Generate Clusters and Curvilinear Data with Noise

Description

This function generates data with clusters and curvilinear patterns along with added background noise.

Usage

gau_curvy_clust_bkg(n, num_noise, min_n, max_n)

Arguments

n

The total number of data points to be generated.

num_noise

The number of additional noise dimensions to be generated.

min_n

The minimum value for the noise added to the data points.

max_n

The maximum value for the noise added to the data points.

Value

A matrix containing the generated data, with each row representing a data point.

Examples

# Generate clusters and curvilinear data with noise with custom parameters
set.seed(20240412)
data <- gau_curvy_clust_bkg(
  n = 260, num_noise = 2, min_n = -0.05,
  max_n = 0.05
)

Generate Background Noise Data

Description

This function generates background noise data with specified parameters such as the number of samples, number of dimensions, mean, and standard deviation.

Usage

gen_bkg_noise(n, num_dims, mean, sd)

Arguments

n

Number of samples to generate.

num_dims

Number of dimensions (columns) of the data.

mean

Mean of the normal distribution used to generate noise (default is 0).

sd

Standard deviation of the normal distribution used to generate noise (default is 1).

Value

A matrix containing the generated background noise data, with n rows and num_dims columns.

Examples

# Generate background noise with custom mean and standard deviation
set.seed(20240412)
gen_bkg_noise(n = 50, num_dims = 3, mean = 5, sd = 2)

Generate Random Noise Dimensions

Description

This function generates random noise dimensions to be added to the coordinates of a sphere.

Usage

gen_noise_dims(n, num_noise, min_n, max_n)

Arguments

n

The number of observations for which to generate noise dimensions.

num_noise

The number of noise dimensions to generate.

min_n

The minimum value for the random noise.

max_n

The maximum value for the random noise.

Value

A matrix containing the generated random noise dimensions.

Examples

# Generate random noise dimensions with 3 dimensions, minimum value -1, and maximum value 1
set.seed(20240412)
gen_noise_dims(n = 50, num_noise = 3, min_n = -0.01, max_n = 0.01)

Generate Mirror S-curve Datasets with Noise

Description

This function generates mirror S-curve datasets with added noise dimensions.

Usage

mirror_scurves(n, num_noise, min_n, max_n)

Arguments

n

The total number of samples to generate (should be divisible by 2).

num_noise

The number of additional noise dimensions to add to the data.

min_n

The minimum value for the noise dimensions.

max_n

The maximum value for the noise dimensions.

Value

A matrix containing the combined mirror S-curve datasets with added noise.

Examples

set.seed(20240412)
mirror_s_curve_data <- mirror_scurves(
  n = 200, num_noise = 2,
  min_n = -0.05, max_n = 0.05
)

Generate a 5-D Mobius Strip

Description

This function generates a dataset representing a 5-dimensional Mobius strip.

Usage

mobius_5d(n, num_noise, min_n, max_n)

Arguments

n

The number of points to generate for the Mobius strip.

num_noise

The number of additional noise dimensions to add to the data.

min_n

The minimum value for the noise dimensions.

max_n

The maximum value for the noise dimensions.

Value

A matrix containing the generated Mobius strip.

Examples

set.seed(20240412)
mobius_data <- mobius_5d(n = 100, num_noise = 2, min_n = -0.05, max_n = 0.05)

Generate a Single Row for a 5-D Mobius Strip

Description

This function generates a single row of data representing a point on a 5-dimensional Mobius strip.

Usage

mobius_5d_row()

Value

A vector containing the coordinates of the point on the Mobius strip.

Examples

set.seed(20240412)
mobius_row <- mobius_5d_row()

Generate Mobius Cluster with Noise

Description

This function generates a dataset consisting of a mobius cluster with added noise.

Usage

mobius_clust(n, num_noise, min_n, max_n)

Arguments

n

The total number of samples to generate.

num_noise

The number of additional noise dimensions to add to the data.

min_n

The minimum value for the noise dimensions.

max_n

The maximum value for the noise dimensions.

Value

A matrix containing the mobius cluster with added noise.

Examples

mobius_cluster <- mobius_clust(
  n = 200, num_noise = 2, min_n = -0.05,
  max_n = 0.05
)

Mobius clust dataset with noise dimensions

Description

The 'mobius_clust_data' dataset contains a 3-dimensional Mobius and Gaussian cluster with added noise dimensions. Each data point is represented by five dimensions (x1 to x5).

Usage

data(mobius_clust_data)

Format

A data frame with 500 rows and 5 columns:

x1, x2, x3, x4, x5

High-dimensional coordinates

Source

This dataset is generated for illustrative purposes.

Examples

# Load the mobius_clust_data dataset
data(mobius_clust_data)

# Display the first few rows of the dataset
head(mobius_clust_data)

tSNE embedding for mobius_clust_data dataset which with noise dimensions tSNE parameters set to perplexity: 15.

Description

The 'mobius_clust_tsne_param1' dataset contains the tSNE (t-distributed Stochastic Neighbor Embedding) embeddings of a five-dimensional mobius_clust_data. Each data point is represented by two tSNE coordinates (emb1 and emb2).

Usage

data(mobius_clust_tsne_param1)

Format

## 'mobius_clust_tsne_param1' A data frame with 500 rows and 2 columns:

emb1

Numeric, first tSNE 2D embeddings.

emb2

Numeric, second tSNE 2D embeddings.

Source

This dataset is generated for illustrative purposes.

Examples

# Load the mobius_clust_tsne_param1 dataset
data(mobius_clust_tsne_param1)

# Display the first few rows of the dataset
head(mobius_clust_tsne_param1)

tSNE embedding for mobius_clust_data dataset which with noise dimensions tSNE parameters set to perplexity: 30.

Description

The 'mobius_clust_tsne_param2' dataset contains the tSNE (t-distributed Stochastic Neighbor Embedding) embeddings of a five-dimensional mobius_clust_data. Each data point is represented by two tSNE coordinates (emb1 and emb2).

Usage

data(mobius_clust_tsne_param2)

Format

## 'mobius_clust_tsne_param2' A data frame with 500 rows and 2 columns:

emb1

Numeric, first tSNE 2D embeddings.

emb2

Numeric, second tSNE 2D embeddings.

Source

This dataset is generated for illustrative purposes.

Examples

# Load the mobius_clust_tsne_param2 dataset
data(mobius_clust_tsne_param2)

# Display the first few rows of the dataset
head(mobius_clust_tsne_param2)

tSNE embedding for mobius_clust_data dataset which with noise dimensions tSNE parameters set to perplexity: 5.

Description

The 'mobius_clust_tsne_param3' dataset contains the tSNE (t-distributed Stochastic Neighbor Embedding) embeddings of a five-dimensional mobius_clust_data. Each data point is represented by two tSNE coordinates (emb1 and emb2).

Usage

data(mobius_clust_tsne_param3)

Format

## 'mobius_clust_tsne_param3' A data frame with 500 rows and 2 columns:

emb1

Numeric, first tSNE 2D embeddings.

emb2

Numeric, second tSNE 2D embeddings.

Source

This dataset is generated for illustrative purposes.

Examples

# Load the mobius_clust_tsne_param1 dataset
data(mobius_clust_tsne_param3)

# Display the first few rows of the dataset
head(mobius_clust_tsne_param3)

UMAP embedding for mobius_clust_data dataset which with noise dimensions UMAP parameters set to n-neigbors: 15 and min-dist: 0.1.

Description

The 'mobius_clust_umap_param1' dataset contains the UMAP (Uniform Manifold Approximation and Projection) embeddings of a five-dimensional mobius_clust_data. Each data point is represented by two UMAP coordinates (emb1 and emb2).

Usage

data(mobius_clust_umap_param1)

Format

## 'mobius_clust_umap_param1' A data frame with 500 rows and 2 columns:

emb1

Numeric, first UMAP 2D embeddings.

emb2

Numeric, second UMAP 2D embeddings.

Source

This dataset is generated for illustrative purposes.

Examples

# Load the mobius_clust_umap_param1 dataset
data(mobius_clust_umap_param1)

# Display the first few rows of the dataset
head(mobius_clust_umap_param1)

UMAP embedding for mobius_clust_data dataset which with noise dimensions UMAP parameters set to n-neigbors: 30 and min-dist: 0.08.

Description

The 'mobius_clust_umap_param2' dataset contains the UMAP (Uniform Manifold Approximation and Projection) embeddings of a five-dimensional mobius_clust_data. Each data point is represented by two UMAP coordinates (emb1 and emb2).

Usage

data(mobius_clust_umap_param2)

Format

## 'mobius_clust_umap_param2' A data frame with 500 rows and 2 columns:

emb1

Numeric, first UMAP 2D embeddings.

emb2

Numeric, second UMAP 2D embeddings.

Source

This dataset is generated for illustrative purposes.

Examples

# Load the mobius_clust_umap_param2 dataset
data(mobius_clust_umap_param2)

# Display the first few rows of the dataset
head(mobius_clust_umap_param2)

UMAP embedding for mobius_clust_data dataset which with noise dimensions UMAP parameters set to n-neigbors: 5 and min-dist: 0.9.

Description

The 'mobius_clust_umap_param3' dataset contains the UMAP (Uniform Manifold Approximation and Projection) embeddings of a five-dimensional mobius_clust_data. Each data point is represented by two UMAP coordinates (emb1 and emb2).

Usage

data(mobius_clust_umap_param3)

Format

## 'mobius_clust_umap_param3' A data frame with 500 rows and 2 columns:

emb1

Numeric, first UMAP 2D embeddings.

emb2

Numeric, second UMAP 2D embeddings.

Source

This dataset is generated for illustrative purposes.

Examples

# Load the mobius_clust_umap_param3 dataset
data(mobius_clust_umap_param3)

# Display the first few rows of the dataset
head(mobius_clust_umap_param3)

Generate points on a nonlinear 2D manifold

Description

This function generates points on a nonlinear 2D manifold based on a given equation.

Usage

nonlinear_2d(n, num_noise, min_n, max_n)

Arguments

n

The number of points to generate.

num_noise

The number of noise dimensions to add to the generated points.

min_n

The minimum value for the noise dimensions.

max_n

The maximum value for the noise dimensions.

Value

A matrix containing the generated points on the nonlinear 2D manifold.

Examples

set.seed(20240412)
nonlinear_points <- nonlinear_2d(
  n = 100, num_noise = 2, min_n = -0.01,
  max_n = 0.01
)

Generate Nonlinear Connected Data with Noise

Description

This function generates a dataset representing nonlinear connected clusters with added noise.

Usage

nonlinear_connect(n, num_noise, min_n, max_n)

Arguments

n

The total number of samples to generate.

num_noise

The number of additional noise dimensions to add to the data.

min_n

The minimum value for the noise dimensions.

max_n

The maximum value for the noise dimensions.

Value

A matrix containing the nonlinear connected data with noise.

Examples

set.seed(20240412)
nonlinear_connect <- nonlinear_connect(
  n = 400, num_noise = 2, min_n = -0.05,
  max_n = 0.05
)

Generate Nonlinear Mirror Data with Noise

Description

This function generates a dataset representing two mirror-image clusters with added noise.

Usage

nonlinear_mirror(n, num_noise, min_n, max_n)

Arguments

n

The total number of samples to generate.

num_noise

The number of additional noise dimensions to add to the data.

min_n

The minimum value for the noise dimensions.

max_n

The maximum value for the noise dimensions.

Value

A matrix containing the nonlinear mirror data with noise.

Examples

set.seed(20240412)
nonlinear_mirror <- nonlinear_mirror(
  n = 400, num_noise = 2, min_n = -0.05,
  max_n = 0.05
)

Generate Doublets with Noise

Description

This function generates data with one set of doublets (pairs of clusters) along with added background noise.

Usage

one_doublet(n, num_noise, min_n, max_n)

Arguments

n

The total number of data points to be generated.

num_noise

The number of additional noise dimensions to be generated.

min_n

The minimum value for the noise added to the data points.

max_n

The maximum value for the noise added to the data points.

Value

A matrix containing the generated data, with each row representing a data point.

Examples

# Generate doublets with noise with custom parameters
set.seed(20240412)
data <- one_doublet(n = 220, num_noise = 2, min_n = -0.05, max_n = 0.05)

Generate Doublets with Background Noise

Description

This function generates data with doublets (pairs of clusters) along with added background noise.

Usage

one_doublet_bkg(n, num_noise, min_n, max_n)

Arguments

n

The total number of data points to be generated.

num_noise

The number of additional noise dimensions to be generated.

min_n

The minimum value for the noise added to the data points.

max_n

The maximum value for the noise added to the data points.

Value

A matrix containing the generated data, with each row representing a data point.

Examples

# Generate doublets with background noise with custom parameters
set.seed(20240412)
data <- one_doublet_bkg(n = 250, num_noise = 2, min_n = -0.05, max_n = 0.05)

Generate Doublets with Different Pattern Clusters and Noise

Description

This function generates data with one set of doublets (pairs of clusters) having different patterns, along with added background noise.

Usage

one_doublet_diff_patterns(n, num_noise, min_n, max_n)

Arguments

n

The total number of data points to be generated.

num_noise

The number of additional noise dimensions to be generated.

min_n

The minimum value for the noise added to the data points.

max_n

The maximum value for the noise added to the data points.

Value

A matrix containing the generated data, with each row representing a data point.

Examples

# Generate doublets with different pattern clusters and noise with custom parameters
set.seed(20240412)
data <- one_doublet_diff_patterns(
  n = 280, num_noise = 2, min_n = -0.05,
  max_n = 0.05
)

Generate Doublets with Different Variance Clusters and Noise

Description

This function generates data with one set of doublets (pairs of clusters) having clusters with different variance, along with added background noise.

Usage

one_doublet_diff_var_clust(n, num_noise, min_n, max_n)

Arguments

n

The total number of data points to be generated.

num_noise

The number of additional noise dimensions to be generated.

min_n

The minimum value for the noise added to the data points.

max_n

The maximum value for the noise added to the data points.

Value

A matrix containing the generated data, with each row representing a data point.

Examples

# Generate doublets with different variance clusters and noise with custom parameters
set.seed(20240412)
data <- one_doublet_diff_var_clust(
  n = 260, num_noise = 2,
  min_n = -0.05, max_n = 0.05
)

Generate Doublets with Four Clusters and Noise

Description

This function generates data with one set of doublets (pairs of clusters) containing four clusters, along with added background noise.

Usage

one_doublet_four_clusts(n, num_noise, min_n, max_n)

Arguments

n

The total number of data points to be generated.

num_noise

The number of additional noise dimensions to be generated.

min_n

The minimum value for the noise added to the data points.

max_n

The maximum value for the noise added to the data points.

Value

A matrix containing the generated data, with each row representing a data point.

Examples

# Generate doublets with four clusters and noise with custom parameters
set.seed(20240412)
data <- one_doublet_four_clusts(
  n = 440, num_noise = 2,
  min_n = -0.05, max_n = 0.05
)

Generate Grid Data with Noise

Description

This function generates a grid dataset with specified grid points along the x and y axes, and optionally adds noise dimensions.

Usage

one_grid(nx, ny, num_noise, min_n, max_n)

Arguments

nx

The number of grid points along the x axis.

ny

The number of grid points along the y axis.

num_noise

The number of additional noise dimensions to add to the data.

min_n

The minimum value for the noise dimensions.

max_n

The maximum value for the noise dimensions.

Value

A matrix containing the grid data with added noise.

Examples

set.seed(20240412)
one_grid <- one_grid(nx = 10, ny = 10, num_noise = 2, min_n = -0.05, max_n = 0.05)

Generate One Grid with Different Values and Background Noise

Description

This function generates a grid dataset with different values and background noise.

Usage

one_grid_bkg(n_value, num_noise, min_n, max_n)

Arguments

n_value

The number of grid points along each axis for the grids.

num_noise

The number of additional noise dimensions to add to the data.

min_n

The minimum value for the noise dimensions.

max_n

The maximum value for the noise dimensions.

Value

A list containing the one grid datasets with background noise and the sample size.

Examples

set.seed(20240412)
one_grid_bkg <- one_grid_bkg(
  n_value = 10, num_noise = 2, min_n = -0.05,
  max_n = 0.05
)

Generate points on a plane in 2D space

Description

This function generates points on a plane in 3D space based on the provided coefficients, intercepts, and ranges for the parameters.

Usage

plane(
  n,
  coef_x1,
  coef_x2,
  coef_y1,
  coef_y2,
  intercept_x,
  intercept_y,
  u_min,
  u_max,
  v_min,
  v_max,
  num_noise,
  min_n,
  max_n
)

Arguments

n

The number of points to generate.

coef_x1

The coefficient of the first parameter in the x-dimension equation.

coef_x2

The coefficient of the second parameter in the x-dimension equation.

coef_y1

The coefficient of the first parameter in the y-dimension equation.

coef_y2

The coefficient of the second parameter in the y-dimension equation.

intercept_x

The intercept for the x-dimension equation.

intercept_y

The intercept for the y-dimension equation.

u_min

The minimum value for the first parameter (u) range.

u_max

The maximum value for the first parameter (u) range.

v_min

The minimum value for the second parameter (v) range.

v_max

The maximum value for the second parameter (v) range.

num_noise

The number of noise dimensions to add to the generated points.

min_n

The minimum value for the noise dimensions.

max_n

The maximum value for the noise dimensions.

Value

A matrix containing the generated points on the plane.

Examples

set.seed(20240412)
plane_points <- plane(
  n = 100, coef_x1 = 1, coef_x2 = 1,
  coef_y1 = -1, coef_y2 = 1, intercept_x = -10,
  intercept_y = 8, u_min = 10, u_max = 30, v_min = 10, v_max = 20,
  num_noise = 2, min_n = -0.05, max_n = 0.05
)

Generate 2D Plane with Hole and Noise

Description

This function generates a dataset representing a 2D plane with a hole in the middle, with added noise.

Usage

plane_2d_hole(n, num_noise, min_n, max_n)

Arguments

n

The total number of samples to generate.

num_noise

The number of additional noise dimensions to add to the data.

min_n

The minimum value for the noise dimensions.

max_n

The maximum value for the noise dimensions.

Value

A list containing the 2D plane data with a hole and the sample size.

Examples

set.seed(20240412)
plane_data <- plane_2d_hole(
  n = 100, num_noise = 2,
  min_n = -0.05, max_n = 0.05
)

Generate data points on a Roman surface with optional noise.

Description

This function generates data points on a Roman surface with optional noise.

Usage

roman_surface_3d(n, num_noise, min_n, max_n)

Arguments

n

Total number of data points to generate.

num_noise

Number of additional noise dimensions to add to the data.

min_n

Minimum value for the noise added to the data.

max_n

Maximum value for the noise added to the data.

Value

A matrix containing the generated data points with or without added noise.

Examples

set.seed(20240412)
roman_surface_3d(n = 100, num_noise = 2, min_n = -0.05, max_n = 0.05)

Generate points on a Roman surface in 3D space.

Description

This function generates points on a Roman surface in 3D space.

Usage

roman_surface_3d_row(a = 1)

Arguments

a

Maximum radius of the object.

Value

A matrix containing the generated points on the Roman surface in 3D space.

Examples

set.seed(20240412)
roman_surface_3d_row(a = 1)

Generate S-curve Data

Description

This function generates S-curve data, which is a commonly used dataset for testing and visualizing dimensionality reduction algorithms.

Usage

scurve(n, num_noise, min_n, max_n)

Arguments

n

The number of samples to generate.

num_noise

The number of additional noise dimensions to add to the data.

min_n

The minimum value for the noise dimensions.

max_n

The maximum value for the noise dimensions.

Value

A matrix containing the generated S-curve data.

Examples

set.seed(20240412)
s_curve_data <- scurve(
  n = 100, num_noise = 2,
  min_n = -0.05, max_n = 0.05
)

Generate S-curve Data with a Hole

Description

This function generates S-curve data with a hole by filtering out samples that are not close to a specified anchor point.

Usage

scurve_hole(n, num_noise, min_n, max_n)

Arguments

n

The number of samples to generate.

num_noise

The number of additional noise dimensions to add to the data.

min_n

The minimum value for the noise dimensions.

max_n

The maximum value for the noise dimensions.

Value

A matrix containing the generated S-curve data with a hole.

Examples

set.seed(20240412)
s_curve_hole_data <- scurve_hole(
  n = 100, num_noise = 2,
  min_n = -0.05, max_n = 0.05
)

Generate Seven-Branching Data with Noise

Description

This function generates a dataset representing seven branches with added noise.

Usage

seven_branch(n, num_noise, min_n, max_n)

Arguments

n

The total number of samples to generate.

num_noise

The number of additional noise dimensions to add to the data.

min_n

The minimum value for the noise dimensions.

max_n

The maximum value for the noise dimensions.

Value

A matrix containing the seven-branching data with added noise.

Examples

set.seed(20240412)
seven_branching_data <- seven_branch(
  n = 210, num_noise = 2, min_n = -0.05,
  max_n = 0.05
)

Generate Sine Curve Data with Noise

Description

This function generates a dataset representing a sine curve with added noise.

Usage

sine_curve(n, num_noise, min_n, max_n)

Arguments

n

The number of samples to generate.

num_noise

The number of additional noise dimensions to add to the data.

min_n

The minimum value for the noise dimensions.

max_n

The maximum value for the noise dimensions.

Value

A matrix containing the sine curve data with noise.

Examples

set.seed(20240412)
sine_curve <- sine_curve(n = 100, num_noise = 2, min_n = -0.05, max_n = 0.05)

Generate Coordinates for a Sphere

Description

This function generates the coordinates for a sphere in three-dimensional space.

Usage

sphere(radius, resolution, num_noise, min_n, max_n)

Arguments

radius

The radius of the sphere.

resolution

The number of points used to approximate the surface of the sphere.

num_noise

The number of additional noise dimensions to add to the coordinates.

min_n

The minimum value for the random noise added to the coordinates.

max_n

The maximum value for the random noise added to the coordinates.

Value

A matrix containing the Cartesian coordinates of the points on the sphere.

Examples

# Generate coordinates for a sphere with radius 1 and resolution 20
set.seed(20240412)
sphere(
  radius = 1, resolution = 20, num_noise = 3, min_n = -0.05,
  max_n = 0.05
)

Generate a spiral dataset with optional noise.

Description

This function generates a dataset arranged in a spiral pattern with optional noise.

Usage

spiral_3d(n, num_dims, num_noise, min_n, max_n)

Arguments

n

Total number of data points to generate.

num_dims

Number of effective dimensions for each data point.

num_noise

Number of additional noise dimensions to add to the data.

min_n

Minimum value for the noise added to the data.

max_n

Maximum value for the noise added to the data.

Value

A matrix containing the generated data points with or without added noise.

Examples

set.seed(20240412)
spiral_3d(n = 100, num_dims = 10, num_noise = 2, min_n = -0.05, max_n = 0.05)

Generate Swiss Roll Data

Description

This function generates data points in the shape of a Swiss roll.

Usage

swiss_roll(n, num_noise, min_n, max_n)

Arguments

n

The total number of data points to be generated.

num_noise

The number of additional noise dimensions to be generated.

min_n

The minimum value for the noise added to the data points.

max_n

The maximum value for the noise added to the data points.

Value

A matrix containing the generated Swiss roll data points.

Examples

# Generate Swiss roll data with noise with custom parameters
set.seed(20240412)
data <- swiss_roll(n = 200, num_noise = 2, min_n = -0.05, max_n = 0.05)

Generate Three Circular Clusters with Noise

Description

This function generates three circular clusters in 4D space with added noise dimensions.

Usage

three_circulars(n, num_noise, min_n, max_n)

Arguments

n

The total number of samples to generate.

num_noise

The number of additional noise dimensions to add to the data.

min_n

The minimum value for the noise dimensions.

max_n

The maximum value for the noise dimensions.

Value

A matrix containing the three circular clusters with added noise.

Examples

set.seed(20240412)
circular_clusters_data <- three_circulars(
  n = 300, num_noise = 2,
  min_n = -0.05, max_n = 0.05
)

Generate three clusters of data points with optional noise.

Description

This function generates three clusters of data points along with optional noise.

Usage

three_clust_diff_dist(n, num_dims, num_noise, min_n, max_n)

Arguments

n

Total number of data points to generate, should be a multiple of three.

num_dims

Number of dimensions for each data point.

num_noise

Number of additional noise dimensions to add to the data.

min_n

Minimum value for the noise added to the data.

max_n

Maximum value for the noise added to the data.

Value

A matrix containing the generated data points with or without added noise.

Examples

set.seed(20240412)
three_clust_diff_dist(
  n = 150, num_dims = 7, num_noise = 4, min_n = -0.05,
  max_n = 0.05
)

Generate Three Cluster Mirror with Noise

Description

This function generates data with three clusters forming a mirror image, along with added noise.

Usage

three_clust_mirror(n, num_noise, min_n, max_n)

Arguments

n

The total number of data points to be generated.

num_noise

The number of additional noise dimensions to be generated.

min_n

The minimum value for the noise added to the data points.

max_n

The maximum value for the noise added to the data points.

Value

A matrix containing the generated data, with each row representing a data point.

Examples

# Generate three cluster mirror with noise with custom parameters
set.seed(20240412)
data <- three_clust_mirror(n = 300, num_noise = 2, min_n = -0.05, max_n = 0.05)

Generate Three Different Linear Data with Noise

Description

This function generates a dataset consisting of three different linear patterns with added noise.

Usage

three_diff_linear(n, num_noise, min_n, max_n)

Arguments

n

The total number of samples to generate.

num_noise

The number of additional noise dimensions to add to the data.

min_n

The minimum value for the noise dimensions.

max_n

The maximum value for the noise dimensions.

Value

A matrix containing the three different linear data with added noise.

Examples

set.seed(20240412)
three_diff_linear <- three_diff_linear(
  n = 150, num_noise = 2, min_n = -0.05,
  max_n = 0.05
)

Generate Doublets with Three Clusters and Noise

Description

This function generates data with three sets of doublets (pairs of clusters) along with added background noise.

Usage

three_doublets(n, num_noise, min_n, max_n)

Arguments

n

The total number of data points to be generated.

num_noise

The number of additional noise dimensions to be generated.

min_n

The minimum value for the noise added to the data points.

max_n

The maximum value for the noise added to the data points.

Value

A matrix containing the generated data, with each row representing a data point.

Examples

# Generate doublets with three clusters and noise with custom parameters
set.seed(20240412)
data <- three_doublets(
  n = 420, num_noise = 2,
  min_n = -0.05, max_n = 0.05
)

Generate Three Grids with Noise

Description

This function generates three grid datasets with noise dimensions.

Usage

three_grid(n_value, num_noise, min_n, max_n)

Arguments

n_value

The number of grid points along the x and y axes for each grid.

num_noise

The number of additional noise dimensions to add to the data.

min_n

The minimum value for the noise dimensions.

max_n

The maximum value for the noise dimensions.

Value

A list containing three grid datasets with added noise and the sample size of each dataset.

Examples

set.seed(20240412)
three_grids <- three_grid(
  n_value = 19, num_noise = 2, min_n = -0.05,
  max_n = 0.05
)

Generate Three Linear Clusters with Noise

Description

This function generates data with three linear clusters, along with added noise.

Usage

three_long_clust(n, num_noise, min_n, max_n)

Arguments

n

The total number of data points to be generated.

num_noise

The number of additional noise dimensions to be generated.

min_n

The minimum value for the noise added to the data points.

max_n

The maximum value for the noise added to the data points.

Value

A matrix containing the generated data, with each row representing a data point.

Examples

# Generate three linear clusters with noise with custom parameters
set.seed(20240412)
data <- three_long_clust(n = 300, num_noise = 2, min_n = -0.05, max_n = 0.05)

Generate Three Nonlinear Clusters with Noise

Description

This function generates data with three nonlinear clusters, along with added noise.

Usage

three_nonlinear(n, num_noise, min_n, max_n)

Arguments

n

The total number of data points to be generated.

num_noise

The number of additional noise dimensions to be generated.

min_n

The minimum value for the noise added to the data points.

max_n

The maximum value for the noise added to the data points.

Value

A matrix containing the generated data, with each row representing a data point.

Examples

# Generate three nonlinear clusters with noise with custom parameters
set.seed(20240412)
data <- three_nonlinear(n = 300, num_noise = 2, min_n = -0.05, max_n = 0.05)

Generate a torus-shaped dataset with optional noise.

Description

This function generates a torus-shaped dataset along with optional noise.

Usage

torus_3d(n, num_noise, min_n, max_n)

Arguments

n

Total number of data points to generate.

num_noise

Number of additional noise dimensions to add to the data.

min_n

Minimum value for the noise added to the data.

max_n

Maximum value for the noise added to the data.

Value

A matrix containing the generated torus-shaped data points with or without added noise.

Examples

set.seed(20240412)
torus_3d(n = 100, num_noise = 2, min_n = -0.05, max_n = 0.05)

Generate a row of data points for a 3D torus.

Description

This function generates a row of data points for a 3D torus with given radii.

Usage

torus_3d_row(radius)

Arguments

radius

A numeric vector containing the radii of the torus, from largest to smallest.

Value

A vector representing a row of data points for the 3D torus.

Examples

set.seed(20240412)
torus_3d_row(c(2, 1))

Generate Tree-like Data with Noise

Description

This function generates a dataset representing a tree-like structure, with added noise.

Usage

tree(n, num_noise, min_n, max_n)

Arguments

n

The total number of samples to generate.

num_noise

The number of additional noise dimensions to add to the data.

min_n

The minimum value for the noise dimensions.

max_n

The maximum value for the noise dimensions.

Value

A matrix containing the tree-like data with added noise.

Examples

set.seed(20240412)
tree_data <- tree(n = 300, num_noise = 2, min_n = -0.05, max_n = 0.05)

Generate Triangular 3D Datasets with Noise

Description

This function generates triangular 3D datasets with added noise dimensions.

Usage

tri_3d(n, num_noise, min_n, max_n)

Arguments

n

The total number of samples to generate.

num_noise

The number of additional noise dimensions to add to the data.

min_n

The minimum value for the noise dimensions.

max_n

The maximum value for the noise dimensions.

Value

A matrix containing the triangular 3D datasets with added noise.

Examples

set.seed(20240412)
triangular_3d_data <- tri_3d(
  n = 100, num_noise = 2,
  min_n = -0.05, max_n = 0.05
)

Generate Triangular Plane with Background Noise

Description

This function generates a triangular plane dataset with background noise dimensions.

Usage

tri_plane_bkg(n, num_noise, min_n, max_n)

Arguments

n

The total number of samples to generate.

num_noise

The number of additional noise dimensions to add to the data.

min_n

The minimum value for the noise dimensions.

max_n

The maximum value for the noise dimensions.

Value

A matrix containing the triangular plane dataset with background noise.

Examples

set.seed(20240412)
triangular_plane_data <- tri_plane_bkg(
  n = 216,
  num_noise = 2, min_n = -0.05, max_n = 0.05
)

Generate Linked Data

Description

This function generates linked data points.

Usage

two_circulars(n, num_noise, min_n, max_n)

Arguments

n

The total number of data points to be generated. Should be a product of two.

num_noise

The number of additional noise dimensions to be generated.

min_n

The minimum value for the noise added to the data points.

max_n

The maximum value for the noise added to the data points.

Value

A matrix containing the generated linked data points.

Examples

# Generate linked data with noise with custom parameters
set.seed(20240412)
data <- two_circulars(n = 200, num_noise = 2, min_n = -0.05, max_n = 0.05)

Generate Two Curvilinear Data with Noise

Description

This function generates a dataset representing two curvilinear clusters with added noise.

Usage

two_curvilinear(n, num_noise, min_n, max_n)

Arguments

n

The total number of samples to generate.

num_noise

The number of additional noise dimensions to add to the data.

min_n

The minimum value for the noise dimensions.

max_n

The maximum value for the noise dimensions.

Value

A matrix containing the two curvilinear data with noise.

Examples

set.seed(20240412)
two_curvilinear <- two_curvilinear(
  n = 250, num_noise = 2, min_n = -0.05,
  max_n = 0.05
)

Generate Two Curvilinear Clusters with Noise

Description

This function generates data with two curvilinear clusters along with added noise.

Usage

two_curvy(n, num_noise, min_n, max_n)

Arguments

n

The total number of data points to be generated.

num_noise

The number of additional noise dimensions to be generated.

min_n

The minimum value for the noise added to the data points.

max_n

The maximum value for the noise added to the data points.

Value

A matrix containing the generated data, with each row representing a data point.

Examples

# Generate two curvilinear clusters with noise with custom parameters
set.seed(20240412)
data <- two_curvy(n = 200, num_noise = 2, min_n = -0.05, max_n = 0.05)

Generate Two Curvilinear Differentiated Clusters with Noise

Description

This function generates data with two curvilinear clusters that are differentiated from each other, along with added noise.

Usage

two_curvy_diff_pts(cluster_size_vec, num_noise, min_n, max_n)

Arguments

cluster_size_vec

A vector specifying the number of points in each cluster.

num_noise

The number of additional noise dimensions to be generated.

min_n

The minimum value for the noise added to the data points.

max_n

The maximum value for the noise added to the data points.

Value

A matrix containing the generated data, with each row representing a data point.

Examples

# Generate two curvilinear differentiated clusters with noise with custom parameters
set.seed(20240412)
data <- two_curvy_diff_pts(
  cluster_size_vec = c(50, 100), num_noise = 2,
  min_n = -0.05, max_n = 0.05
)

Generate Two Curvy Pancakes with Noise

Description

This function generates a dataset representing two curvy pancake-shaped clusters with added noise.

Usage

two_curvy_panckakes(n, num_noise, min_n, max_n)

Arguments

n

The total number of samples to generate.

num_noise

The number of additional noise dimensions to add to the data.

min_n

The minimum value for the noise dimensions.

max_n

The maximum value for the noise dimensions.

Value

A matrix containing the two curvy pancakes data with noise.

Examples

set.seed(20240412)
two_curvy_panckakes <- two_curvy_panckakes(
  n = 300, num_noise = 2,
  min_n = -0.05, max_n = 0.05
)

Generate Two Doublets with Background Noise

Description

This function generates data with two doublets along with added background noise.

Usage

two_doublets_bkg(n, num_noise, min_n, max_n)

Arguments

n

The total number of data points to be generated.

num_noise

The number of additional noise dimensions to be generated.

min_n

The minimum value for the noise added to the data points.

max_n

The maximum value for the noise added to the data points.

Value

A matrix containing the generated data, with each row representing a data point.

Examples

# Generate two doublets with background noise with custom parameters
set.seed(20240412)
data <- two_doublets_bkg(n = 200, num_noise = 2, min_n = -0.05, max_n = 0.05)

Generate Doublets in Parallel with Noise

Description

This function generates data with two sets of doublets (pairs of clusters) running in parallel, along with added background noise.

Usage

two_doublets_parallel(n, num_noise, min_n, max_n)

Arguments

n

The total number of data points to be generated.

num_noise

The number of additional noise dimensions to be generated.

min_n

The minimum value for the noise added to the data points.

max_n

The maximum value for the noise added to the data points.

Value

A matrix containing the generated data, with each row representing a data point.

Examples

# Generate doublets in parallel with noise with custom parameters
set.seed(20240412)
data <- two_doublets_parallel(n = 440, num_noise = 2, min_n = -0.05, max_n = 0.05)

Generate Two Grids with Noise

Description

This function generates two grid datasets with noise dimensions.

Usage

two_grid(n_value, num_noise, min_n, max_n)

Arguments

n_value

The number of grid points along the x and y axes for each grid.

num_noise

The number of additional noise dimensions to add to the data.

min_n

The minimum value for the noise dimensions.

max_n

The maximum value for the noise dimensions.

Value

A list containing two grid datasets with added noise and the sample size of each dataset.

Examples

set.seed(20240412)
two_grids <- two_grid(n_value = 19, num_noise = 2, min_n = -0.05, max_n = 0.05)

Generate One Grid with Different Offset

Description

This function generates a single grid dataset with a different offset.

Usage

two_grid_comb(n_value, num_noise, min_n, max_n)

Arguments

n_value

The number of grid points along each axis for the grids.

num_noise

The number of additional noise dimensions to add to the data.

min_n

The minimum value for the noise dimensions.

max_n

The maximum value for the noise dimensions.

Value

A list containing the grid dataset with different offsets and the sample size.

Examples

set.seed(20240412)
two_grid_comb <- two_grid_comb(
  n_value = 10, num_noise = 2, min_n = -0.05,
  max_n = 0.05
)

Generate Two Grids with Background Noise

Description

This function generates two grid datasets with background noise.

Usage

two_grid_comb_bkg(n_value, num_noise, min_n, max_n)

Arguments

n_value

The number of grid points along each axis for the grids.

num_noise

The number of additional noise dimensions to add to the data.

min_n

The minimum value for the noise dimensions.

max_n

The maximum value for the noise dimensions.

Value

A list containing the two grid datasets with background noise and the sample size.

Examples

set.seed(20240412)
two_grid_comb_bkg <- two_grid_comb_bkg(
  n_value = 10, num_noise = 2,
  min_n = -0.05, max_n = 0.05
)

Generate Long Cluster Data

Description

This function generates a dataset consisting of two long clusters with added noise.

Usage

two_long_clust(n, num_noise, min_n, max_n)

Arguments

n

The total number of samples to generate.

num_noise

The number of additional noise dimensions to add to the data.

min_n

The minimum value for the noise dimensions.

max_n

The maximum value for the noise dimensions.

Value

A matrix containing the long cluster data with added noise.

Examples

set.seed(20240412)
long_cluster <- two_long_clust(
  n = 200, num_noise = 2, min_n = -0.05,
  max_n = 0.05
)

Generate Two Linear Differentiated Clusters with Noise

Description

This function generates data with two linear clusters that are differentiated from each other, along with added noise.

Usage

two_long_clust_diff(n, num_noise, min_n, max_n)

Arguments

n

The total number of data points to be generated.

num_noise

The number of additional noise dimensions to be generated.

min_n

The minimum value for the noise added to the data points.

max_n

The maximum value for the noise added to the data points.

Value

A matrix containing the generated data, with each row representing a data point.

Examples

# Generate two linear differentiated clusters with noise with custom parameters
set.seed(20240412)
data <- two_long_clust_diff(
  n = 300, num_noise = 2, min_n = -0.05,
  max_n = 0.05
)

Generate Two Nonlinear Clusters with Noise

Description

This function generates data with two nonlinear clusters along with added noise.

Usage

two_nonlinear(n, num_noise, min_n, max_n)

Arguments

n

The total number of data points to be generated.

num_noise

The number of additional noise dimensions to be generated.

min_n

The minimum value for the noise added to the data points.

max_n

The maximum value for the noise added to the data points.

Value

A matrix containing the generated data, with each row representing a data point.

Examples

# Generate two nonlinear clusters with noise with custom parameters
set.seed(20240412)
data <- two_nonlinear(n = 200, num_noise = 2, min_n = -0.05, max_n = 0.50)

Generate Two S-Curve Data with Noise

Description

This function generates two S-curve data with noise.

Usage

two_scurve_hole(n, num_noise, min_n, max_n)

Arguments

n

The total number of data points to be generated.

num_noise

The number of additional noise dimensions to be generated.

min_n

The minimum value for the noise added to the data points.

max_n

The maximum value for the noise added to the data points.

Value

A matrix containing the two S-curve datasets with added noise.

Examples

# Generate two S-curve data with noise with custom parameters
set.seed(20240412)
data <- two_scurve_hole(
  n = 200, num_noise = 2,
  min_n = -0.05, max_n = 0.05
)

Generate Two S-curve Datasets with Noise

Description

This function generates two S-curve datasets with added noise dimensions.

Usage

two_scurves(n, num_noise, min_n, max_n)

Arguments

n

The total number of samples to generate (should be divisible by 2).

num_noise

The number of additional noise dimensions to add to the data.

min_n

The minimum value for the noise dimensions.

max_n

The maximum value for the noise dimensions.

Value

A matrix containing the combined S-curve datasets with added noise.

Examples

set.seed(20240412)
two_s_curve_data <- two_scurves(
  n = 200, num_noise = 2,
  min_n = -0.05, max_n = 0.05
)