Title: | Collection of Data Structures |
---|---|
Description: | A collection of simple simulation datasets designed for generating Nonlinear Dimension Reduction representations techniques such as t-distributed Stochastic Neighbor Embedding, and Uniform Manifold Approximation and Projection. These datasets serve as a valuable resource for understanding the reliability of Nonlinear Dimension Reduction representations in various contexts. |
Authors: | Jayani P.G. Lakshika [aut, cre]
|
Maintainer: | Jayani P.G. Lakshika <[email protected]> |
License: | MIT + file LICENSE |
Version: | 0.1.1 |
Built: | 2025-02-12 05:59:17 UTC |
Source: | https://github.com/jayanilakshika/cardinalr |
This function generates a cell cycle dataset with added noise dimensions.
cell_cycle(n, num_noise, min_n, max_n)
cell_cycle(n, num_noise, min_n, max_n)
n |
The total number of samples to generate. |
num_noise |
The number of additional noise dimensions to add to the data. |
min_n |
The minimum value for the noise dimensions. |
max_n |
The maximum value for the noise dimensions. |
A matrix containing the cell cycle data with added noise.
set.seed(20240412) cell_cycle_data <- cell_cycle( n = 300, num_noise = 2, min_n = -0.05, max_n = 0.05 )
set.seed(20240412) cell_cycle_data <- cell_cycle( n = 300, num_noise = 2, min_n = -0.05, max_n = 0.05 )
This function generates clusters with different shapes, including both Gaussian and non-Gaussian clusters.
clust_diff_shapes( n, num_gau_clust, num_non_gau_clust, clust_sd_gau, clust_sd_non_gau, num_dims, a, b )
clust_diff_shapes( n, num_gau_clust, num_non_gau_clust, clust_sd_gau, clust_sd_non_gau, num_dims, a, b )
n |
The total number of data points to be generated. |
num_gau_clust |
The number of Gaussian clusters to generate. |
num_non_gau_clust |
The number of non-Gaussian clusters to generate. |
clust_sd_gau |
The standard deviation for the Gaussian clusters. |
clust_sd_non_gau |
The standard deviation for the non-Gaussian clusters. |
num_dims |
The number of dimensions for the data points. |
a |
The scaling factor for the non-Gaussian cluster shape. |
b |
The translation factor for the non-Gaussian cluster shape. |
A matrix containing the generated clusters with different shapes.
# Generate clusters with default parameters set.seed(20240412) data <- clust_diff_shapes( n = 300, num_gau_clust = 4, num_non_gau_clust = 2, clust_sd_gau = 0.05, clust_sd_non_gau = 0.1, num_dims = 7, a = 2, b = 4 )
# Generate clusters with default parameters set.seed(20240412) data <- clust_diff_shapes( n = 300, num_gau_clust = 4, num_non_gau_clust = 2, clust_sd_gau = 0.05, clust_sd_non_gau = 0.1, num_dims = 7, a = 2, b = 4 )
This function generates clusters with different shapes, including both Gaussian and non-Gaussian clusters, with different numbers of points in each cluster.
clust_diff_shapes_pts( clust_size_vec, num_gau_clust, num_non_gau_clust, clust_sd_gau, clust_sd_non_gau, num_dims, a, b )
clust_diff_shapes_pts( clust_size_vec, num_gau_clust, num_non_gau_clust, clust_sd_gau, clust_sd_non_gau, num_dims, a, b )
clust_size_vec |
A vector specifying the number of points for each cluster. |
num_gau_clust |
The number of Gaussian clusters to generate. |
num_non_gau_clust |
The number of non-Gaussian clusters to generate. |
clust_sd_gau |
The standard deviation for the Gaussian clusters. |
clust_sd_non_gau |
The standard deviation for the non-Gaussian clusters. |
num_dims |
The number of dimensions for the data points. |
a |
The scaling factor for the non-Gaussian cluster shape. |
b |
The translation factor for the non-Gaussian cluster shape. |
A matrix containing the generated clusters with different shapes and different numbers of points.
# Generate clusters with default parameters set.seed(20240412) data <- clust_diff_shapes_pts( clust_size_vec = c(50, 50, 50, 50, 100, 100), num_gau_clust = 4, num_non_gau_clust = 2, clust_sd_gau = 0.05, clust_sd_non_gau = 0.1, num_dims = 7, a = 2, b = 4 )
# Generate clusters with default parameters set.seed(20240412) data <- clust_diff_shapes_pts( clust_size_vec = c(50, 50, 50, 50, 100, 100), num_gau_clust = 4, num_non_gau_clust = 2, clust_sd_gau = 0.05, clust_sd_non_gau = 0.1, num_dims = 7, a = 2, b = 4 )
This function generates data points along a conic spiral curve with optional noise.
conic_spiral_3d(n, num_noise, min_n, max_n)
conic_spiral_3d(n, num_noise, min_n, max_n)
n |
Total number of data points to generate. |
num_noise |
Number of additional noise dimensions to add to the data. |
min_n |
Minimum value for the noise added to the data. |
max_n |
Maximum value for the noise added to the data. |
A matrix containing the generated data points with or without added noise.
set.seed(20240412) conic_spiral_3d(n = 100, num_noise = 2, min_n = -0.05, max_n = 0.05)
set.seed(20240412) conic_spiral_3d(n = 100, num_noise = 2, min_n = -0.05, max_n = 0.05)
This function generates points on a conic spiral in 3D space.
conic_spiral_3d_row(a, b, c, w)
conic_spiral_3d_row(a, b, c, w)
a |
Final radius of the cone. |
b |
Height of the object. |
c |
Inner radius. |
w |
Number of spirals. |
A matrix containing the generated points on the conic spiral.
set.seed(20240412) conic_spiral_3d_row(1, 2, 0.5, 3)
set.seed(20240412) conic_spiral_3d_row(1, 2, 0.5, 3)
This function generates a 3D cube along with optional noise.
cube_3d(num_dims, num_noise, min_n, max_n)
cube_3d(num_dims, num_noise, min_n, max_n)
num_dims |
Number of effective dimensions (default is 3 for a 3D cube). |
num_noise |
Number of additional noise dimensions to add to the data. |
min_n |
Minimum value for the noise added to the data. |
max_n |
Maximum value for the noise added to the data. |
A list containing the generated data matrix and the sample size.
set.seed(20240412) cube_3d(num_dims = 3, num_noise = 2, min_n = -0.01, max_n = 0.01)
set.seed(20240412) cube_3d(num_dims = 3, num_noise = 2, min_n = -0.01, max_n = 0.01)
This function generates points on a curvilinear 2D manifold based on a nonlinear equation.
curv_2d(n, num_noise, min_n, max_n)
curv_2d(n, num_noise, min_n, max_n)
n |
The number of points to generate. |
num_noise |
The number of noise dimensions to add to the generated points. |
min_n |
The minimum value for the noise dimensions. |
max_n |
The maximum value for the noise dimensions. |
A matrix containing the generated points on the curvilinear 2D manifold.
set.seed(20240412) curvilinear_points <- curv_2d( n = 100, num_noise = 2, min_n = -0.01, max_n = 0.01 )
set.seed(20240412) curvilinear_points <- curv_2d( n = 100, num_noise = 2, min_n = -0.01, max_n = 0.01 )
This function generates data with curvy branching clusters along with added noise.
curvy_branch(n, num_noise, min_n, max_n)
curvy_branch(n, num_noise, min_n, max_n)
n |
The total number of data points to be generated. |
num_noise |
The number of additional noise dimensions to be generated. |
min_n |
The minimum value for the noise added to the data points. |
max_n |
The maximum value for the noise added to the data points. |
A matrix containing the generated data, with each row representing a data point.
# Generate curvy branching clusters with noise with custom parameters set.seed(20240412) data <- curvy_branch(n = 200, num_noise = 2, min_n = -0.05, max_n = 0.05)
# Generate curvy branching clusters with noise with custom parameters set.seed(20240412) data <- curvy_branch(n = 200, num_noise = 2, min_n = -0.05, max_n = 0.05)
This function generates curvy branching cluster data with three clusters of different shapes.
curvy_branch_clust(n, clust_vec, num_noise, min_n, max_n)
curvy_branch_clust(n, clust_vec, num_noise, min_n, max_n)
n |
The total number of data points to be generated. |
clust_vec |
A vector specifying the number of points for each cluster. If not provided, the n is divided equally among the clusters. |
num_noise |
The number of additional noise dimensions to be generated. |
min_n |
The minimum value for the noise added to the data points. |
max_n |
The maximum value for the noise added to the data points. |
A matrix containing the generated data, with each row representing a data point.
# Generate curvy branching cluster data with custom parameters set.seed(20240412) data <- curvy_branch_clust( n = 300, clust_vec = c(100, 150, 50), num_noise = 2, min_n = -0.05, max_n = 0.05 )
# Generate curvy branching cluster data with custom parameters set.seed(20240412) data <- curvy_branch_clust( n = 300, clust_vec = c(100, 150, 50), num_noise = 2, min_n = -0.05, max_n = 0.05 )
This function generates data with four clusters, two of which follow a curvilinear pattern and the other two are distributed randomly.
curvy_branch_clust_bkg(n, num_noise, min_n, max_n)
curvy_branch_clust_bkg(n, num_noise, min_n, max_n)
n |
The total number of data points to be generated. |
num_noise |
The number of additional noise dimensions to be generated. |
min_n |
The minimum value for the noise added to the data points. |
max_n |
The maximum value for the noise added to the data points. |
A matrix containing the generated data, with each row representing a data point.
# Generate curvy branching cluster data with background noise with custom parameters set.seed(20240412) data <- curvy_branch_clust_bkg( n = 400, num_noise = 2, min_n = -0.05, max_n = 0.05 )
# Generate curvy branching cluster data with background noise with custom parameters set.seed(20240412) data <- curvy_branch_clust_bkg( n = 400, num_noise = 2, min_n = -0.05, max_n = 0.05 )
This function generates a curvy cell cycle dataset with added noise dimensions.
curvy_cycle(n, num_noise, min_n, max_n)
curvy_cycle(n, num_noise, min_n, max_n)
n |
The total number of samples to generate. |
num_noise |
The number of additional noise dimensions to add to the data. |
min_n |
The minimum value for the noise dimensions. |
max_n |
The maximum value for the noise dimensions. |
A matrix containing the curvy cell cycle data with added noise.
set.seed(20240412) curvy_cell_cycle_data <- curvy_cycle( n = 300, num_noise = 2, min_n = -0.05, max_n = 0.05 )
set.seed(20240412) curvy_cell_cycle_data <- curvy_cycle( n = 300, num_noise = 2, min_n = -0.05, max_n = 0.05 )
This function generates a dataset representing a curvy tree structure, with added noise.
curvy_tree(n, num_noise, min_n, max_n)
curvy_tree(n, num_noise, min_n, max_n)
n |
The total number of samples to generate. |
num_noise |
The number of additional noise dimensions to add to the data. |
min_n |
The minimum value for the noise dimensions. |
max_n |
The maximum value for the noise dimensions. |
A matrix containing the curvy tree data with added noise.
set.seed(20240412) tree_data <- curvy_tree(n = 300, num_noise = 2, min_n = -0.05, max_n = 0.05)
set.seed(20240412) tree_data <- curvy_tree(n = 300, num_noise = 2, min_n = -0.05, max_n = 0.05)
This function generates data points representing small spheres within a larger encompassing sphere and adds noise to the data if specified.
diff_sphere(n, num_noise, min_n, max_n)
diff_sphere(n, num_noise, min_n, max_n)
n |
Total number of data points to generate, should be a multiple of 13. |
num_noise |
Number of additional noise dimensions to add to the data. |
min_n |
Minimum value for the noise added to the data. |
max_n |
Maximum value for the noise added to the data. |
A matrix containing the generated data points with or without added noise.
set.seed(20240412) diff_sphere( n = 390, num_noise = 2, min_n = -0.05, max_n = 0.05 )
set.seed(20240412) diff_sphere( n = 390, num_noise = 2, min_n = -0.05, max_n = 0.05 )
This function generates points sampled from the Dini surface along with optional noise.
dini_surface_3d(n, num_noise, min_n, max_n)
dini_surface_3d(n, num_noise, min_n, max_n)
n |
Total number of data points to generate. |
num_noise |
Number of additional noise dimensions to add to the data. |
min_n |
Minimum value for the noise added to the data. |
max_n |
Maximum value for the noise added to the data. |
A matrix containing the generated data points with or without added noise.
set.seed(20240412) dini_surface_3d(n = 100, num_noise = 2, min_n = -0.05, max_n = 0.05)
set.seed(20240412) dini_surface_3d(n = 100, num_noise = 2, min_n = -0.05, max_n = 0.05)
This function generates points on a Dini's surface.
dini_surface_3d_row(a = 1, b = 1)
dini_surface_3d_row(a = 1, b = 1)
a |
Outer radius of the surface. |
b |
Space between loops. |
A matrix containing the generated points on the surface.
set.seed(20240412) dini_surface_3d_row(a = 1, b = 1)
set.seed(20240412) dini_surface_3d_row(a = 1, b = 1)
This function generates a dataset representing eight branching patterns, with added noise.
eight_branch(n, num_noise, min_n, max_n)
eight_branch(n, num_noise, min_n, max_n)
n |
The total number of samples to generate. |
num_noise |
The number of additional noise dimensions to add to the data. |
min_n |
The minimum value for the noise dimensions. |
max_n |
The maximum value for the noise dimensions. |
A matrix containing the eight branching data with added noise.
set.seed(20240412) branching_data <- eight_branch( n = 400, num_noise = 2, min_n = -0.05, max_n = 0.05 )
set.seed(20240412) branching_data <- eight_branch( n = 400, num_noise = 2, min_n = -0.05, max_n = 0.05 )
This function generates a dataset representing four branches with added noise.
four_branch(n, num_noise, min_n, max_n)
four_branch(n, num_noise, min_n, max_n)
n |
The total number of samples to generate. |
num_noise |
The number of additional noise dimensions to add to the data. |
min_n |
The minimum value for the noise dimensions. |
max_n |
The maximum value for the noise dimensions. |
A matrix containing the four-branching data with added noise.
set.seed(20240412) four_branching_data <- four_branch( n = 400, num_noise = 2, min_n = -0.05, max_n = 0.05 )
set.seed(20240412) four_branching_data <- four_branch( n = 400, num_noise = 2, min_n = -0.05, max_n = 0.05 )
This function generates a dataset consisting of four different long clusters with added noise.
four_long_clust(n, num_noise, min_n, max_n)
four_long_clust(n, num_noise, min_n, max_n)
n |
The total number of samples to generate. |
num_noise |
The number of additional noise dimensions to add to the data. |
min_n |
The minimum value for the noise dimensions. |
max_n |
The maximum value for the noise dimensions. |
A matrix containing the four different long clusters with added noise.
set.seed(20240412) four_diff_long_clusters <- four_long_clust( n = 200, num_noise = 2, min_n = -0.05, max_n = 0.05 )
set.seed(20240412) four_diff_long_clusters <- four_long_clust( n = 200, num_noise = 2, min_n = -0.05, max_n = 0.05 )
This function generates data with four long clusters along with background noise.
four_long_clust_bkg(n, num_noise, min_n, max_n)
four_long_clust_bkg(n, num_noise, min_n, max_n)
n |
The total number of data points to be generated. |
num_noise |
The number of additional noise dimensions to be generated. |
min_n |
The minimum value for the noise added to the data points. |
max_n |
The maximum value for the noise added to the data points. |
A matrix containing the generated data, with each row representing a data point.
# Generate four long clusters with background noise with custom parameters set.seed(20240412) data <- four_long_clust_bkg( n = 400, num_noise = 4, min_n = -0.05, max_n = 0.05 )
# Generate four long clusters with background noise with custom parameters set.seed(20240412) data <- four_long_clust_bkg( n = 400, num_noise = 4, min_n = -0.05, max_n = 0.05 )
Generate Gaussian Clusters
gau_clust( n, num_clust, mean_matrix, var_vec, num_dims, num_noise, min_n, max_n )
gau_clust( n, num_clust, mean_matrix, var_vec, num_dims, num_noise, min_n, max_n )
n |
The total number of data points to be generated. |
num_clust |
The number of clusters to generate. |
mean_matrix |
A matrix where each row represents the mean vector for a cluster. |
var_vec |
A vector specifying the variance for each cluster. |
num_dims |
The number of effective dimensions for the data points. |
num_noise |
The number of additional noise dimensions to be generated. |
min_n |
The minimum value for the noise added to the data points. |
max_n |
The maximum value for the noise added to the data points. |
This function generates Gaussian clusters with specified parameters.
A matrix containing the generated Gaussian clusters.
set.seed(20240412) gau_clust( n = 300, num_clust = 5, mean_matrix = rbind( c(1, 0, 0, 0), c(0, 1, 0, 0), c(0, 0, 1, 0), c(0, 0, 0, 1), c(0, 0, 0, 0) ), var_vec = c(0.05, 0.05, 0.05, 0.05, 0.05), num_dims = 4, num_noise = 2, min_n = -0.05, max_n = 0.05 )
set.seed(20240412) gau_clust( n = 300, num_clust = 5, mean_matrix = rbind( c(1, 0, 0, 0), c(0, 1, 0, 0), c(0, 0, 1, 0), c(0, 0, 0, 1), c(0, 0, 0, 0) ), var_vec = c(0.05, 0.05, 0.05, 0.05, 0.05), num_dims = 4, num_noise = 2, min_n = -0.05, max_n = 0.05 )
This function generates Gaussian clusters with different numbers of points per cluster.
gau_clust_diff( clust_size_vec, num_clust, mean_matrix, var_vec, num_dims, num_noise, min_n, max_n )
gau_clust_diff( clust_size_vec, num_clust, mean_matrix, var_vec, num_dims, num_noise, min_n, max_n )
clust_size_vec |
A vector specifying the number of points in each cluster. |
num_clust |
The number of clusters to generate. |
mean_matrix |
A matrix where each row represents the mean vector for a cluster. |
var_vec |
A vector specifying the variance for each cluster. |
num_dims |
The number of effective dimensions for the data points. |
num_noise |
The number of additional noise dimensions to be generated. |
min_n |
The minimum value for the noise added to the data points. |
max_n |
The maximum value for the noise added to the data points. |
A matrix containing the generated Gaussian clusters with different points.
# Generate Gaussian clusters with custom parameters set.seed(20240412) data <- gau_clust_diff( clust_size_vec = c(50, 100, 200, 50), num_clust = 4, mean_matrix = rbind( c(1, 0, 0, 0, 0, 0), c(0, 1, 0, 0, 0, 0), c(0, 0, 1, 0, 0, 0), c(0, 0, 0, 1, 0, 0) ), var_vec = c(0.02, 0.05, 0.06, 0.1), num_dims = 6, num_noise = 4, min_n = -0.05, max_n = 0.05 )
# Generate Gaussian clusters with custom parameters set.seed(20240412) data <- gau_clust_diff( clust_size_vec = c(50, 100, 200, 50), num_clust = 4, mean_matrix = rbind( c(1, 0, 0, 0, 0, 0), c(0, 1, 0, 0, 0, 0), c(0, 0, 1, 0, 0, 0), c(0, 0, 0, 1, 0, 0) ), var_vec = c(0.02, 0.05, 0.06, 0.1), num_dims = 6, num_noise = 4, min_n = -0.05, max_n = 0.05 )
This function generates data with two clusters, one following a curvilinear pattern and the other distributed randomly.
gau_curvy_clust(n, clust_size_vec, num_noise, min_n, max_n)
gau_curvy_clust(n, clust_size_vec, num_noise, min_n, max_n)
n |
The total number of data points to be generated. |
clust_size_vec |
A vector specifying the number of points for each cluster. If not provided, the n is divided equally between the two clusters. |
num_noise |
The number of additional noise dimensions to be generated. |
min_n |
The minimum value for the noise added to the data points. |
max_n |
The maximum value for the noise added to the data points. |
A matrix containing the generated data, with each row representing a data point.
# Generate cluster and curvilinear data with custom parameters set.seed(20240412) data <- gau_curvy_clust( n = 300, clust_size_vec = c(100, 200), num_noise = 3, min_n = -0.05, max_n = 0.05 )
# Generate cluster and curvilinear data with custom parameters set.seed(20240412) data <- gau_curvy_clust( n = 300, clust_size_vec = c(100, 200), num_noise = 3, min_n = -0.05, max_n = 0.05 )
This function generates data with clusters and curvilinear patterns along with added background noise.
gau_curvy_clust_bkg(n, num_noise, min_n, max_n)
gau_curvy_clust_bkg(n, num_noise, min_n, max_n)
n |
The total number of data points to be generated. |
num_noise |
The number of additional noise dimensions to be generated. |
min_n |
The minimum value for the noise added to the data points. |
max_n |
The maximum value for the noise added to the data points. |
A matrix containing the generated data, with each row representing a data point.
# Generate clusters and curvilinear data with noise with custom parameters set.seed(20240412) data <- gau_curvy_clust_bkg( n = 260, num_noise = 2, min_n = -0.05, max_n = 0.05 )
# Generate clusters and curvilinear data with noise with custom parameters set.seed(20240412) data <- gau_curvy_clust_bkg( n = 260, num_noise = 2, min_n = -0.05, max_n = 0.05 )
This function generates background noise data with specified parameters such as the number of samples, number of dimensions, mean, and standard deviation.
gen_bkg_noise(n, num_dims, mean, sd)
gen_bkg_noise(n, num_dims, mean, sd)
n |
Number of samples to generate. |
num_dims |
Number of dimensions (columns) of the data. |
mean |
Mean of the normal distribution used to generate noise (default is 0). |
sd |
Standard deviation of the normal distribution used to generate noise (default is 1). |
A matrix containing the generated background noise data, with
n
rows and num_dims
columns.
# Generate background noise with custom mean and standard deviation set.seed(20240412) gen_bkg_noise(n = 50, num_dims = 3, mean = 5, sd = 2)
# Generate background noise with custom mean and standard deviation set.seed(20240412) gen_bkg_noise(n = 50, num_dims = 3, mean = 5, sd = 2)
This function generates random noise dimensions to be added to the coordinates of a sphere.
gen_noise_dims(n, num_noise, min_n, max_n)
gen_noise_dims(n, num_noise, min_n, max_n)
n |
The number of observations for which to generate noise dimensions. |
num_noise |
The number of noise dimensions to generate. |
min_n |
The minimum value for the random noise. |
max_n |
The maximum value for the random noise. |
A matrix containing the generated random noise dimensions.
# Generate random noise dimensions with 3 dimensions, minimum value -1, and maximum value 1 set.seed(20240412) gen_noise_dims(n = 50, num_noise = 3, min_n = -0.01, max_n = 0.01)
# Generate random noise dimensions with 3 dimensions, minimum value -1, and maximum value 1 set.seed(20240412) gen_noise_dims(n = 50, num_noise = 3, min_n = -0.01, max_n = 0.01)
This function generates mirror S-curve datasets with added noise dimensions.
mirror_scurves(n, num_noise, min_n, max_n)
mirror_scurves(n, num_noise, min_n, max_n)
n |
The total number of samples to generate (should be divisible by 2). |
num_noise |
The number of additional noise dimensions to add to the data. |
min_n |
The minimum value for the noise dimensions. |
max_n |
The maximum value for the noise dimensions. |
A matrix containing the combined mirror S-curve datasets with added noise.
set.seed(20240412) mirror_s_curve_data <- mirror_scurves( n = 200, num_noise = 2, min_n = -0.05, max_n = 0.05 )
set.seed(20240412) mirror_s_curve_data <- mirror_scurves( n = 200, num_noise = 2, min_n = -0.05, max_n = 0.05 )
This function generates a dataset representing a 5-dimensional Mobius strip.
mobius_5d(n, num_noise, min_n, max_n)
mobius_5d(n, num_noise, min_n, max_n)
n |
The number of points to generate for the Mobius strip. |
num_noise |
The number of additional noise dimensions to add to the data. |
min_n |
The minimum value for the noise dimensions. |
max_n |
The maximum value for the noise dimensions. |
A matrix containing the generated Mobius strip.
set.seed(20240412) mobius_data <- mobius_5d(n = 100, num_noise = 2, min_n = -0.05, max_n = 0.05)
set.seed(20240412) mobius_data <- mobius_5d(n = 100, num_noise = 2, min_n = -0.05, max_n = 0.05)
This function generates a single row of data representing a point on a 5-dimensional Mobius strip.
mobius_5d_row()
mobius_5d_row()
A vector containing the coordinates of the point on the Mobius strip.
set.seed(20240412) mobius_row <- mobius_5d_row()
set.seed(20240412) mobius_row <- mobius_5d_row()
This function generates a dataset consisting of a mobius cluster with added noise.
mobius_clust(n, num_noise, min_n, max_n)
mobius_clust(n, num_noise, min_n, max_n)
n |
The total number of samples to generate. |
num_noise |
The number of additional noise dimensions to add to the data. |
min_n |
The minimum value for the noise dimensions. |
max_n |
The maximum value for the noise dimensions. |
A matrix containing the mobius cluster with added noise.
mobius_cluster <- mobius_clust( n = 200, num_noise = 2, min_n = -0.05, max_n = 0.05 )
mobius_cluster <- mobius_clust( n = 200, num_noise = 2, min_n = -0.05, max_n = 0.05 )
The 'mobius_clust_data' dataset contains a 3-dimensional Mobius and Gaussian cluster with added noise dimensions. Each data point is represented by five dimensions (x1 to x5).
data(mobius_clust_data)
data(mobius_clust_data)
A data frame with 500 rows and 5 columns:
High-dimensional coordinates
This dataset is generated for illustrative purposes.
# Load the mobius_clust_data dataset data(mobius_clust_data) # Display the first few rows of the dataset head(mobius_clust_data)
# Load the mobius_clust_data dataset data(mobius_clust_data) # Display the first few rows of the dataset head(mobius_clust_data)
The 'mobius_clust_tsne_param1' dataset contains the tSNE (t-distributed Stochastic Neighbor Embedding) embeddings of a five-dimensional mobius_clust_data. Each data point is represented by two tSNE coordinates (emb1 and emb2).
data(mobius_clust_tsne_param1)
data(mobius_clust_tsne_param1)
## 'mobius_clust_tsne_param1' A data frame with 500 rows and 2 columns:
Numeric, first tSNE 2D embeddings.
Numeric, second tSNE 2D embeddings.
This dataset is generated for illustrative purposes.
# Load the mobius_clust_tsne_param1 dataset data(mobius_clust_tsne_param1) # Display the first few rows of the dataset head(mobius_clust_tsne_param1)
# Load the mobius_clust_tsne_param1 dataset data(mobius_clust_tsne_param1) # Display the first few rows of the dataset head(mobius_clust_tsne_param1)
The 'mobius_clust_tsne_param2' dataset contains the tSNE (t-distributed Stochastic Neighbor Embedding) embeddings of a five-dimensional mobius_clust_data. Each data point is represented by two tSNE coordinates (emb1 and emb2).
data(mobius_clust_tsne_param2)
data(mobius_clust_tsne_param2)
## 'mobius_clust_tsne_param2' A data frame with 500 rows and 2 columns:
Numeric, first tSNE 2D embeddings.
Numeric, second tSNE 2D embeddings.
This dataset is generated for illustrative purposes.
# Load the mobius_clust_tsne_param2 dataset data(mobius_clust_tsne_param2) # Display the first few rows of the dataset head(mobius_clust_tsne_param2)
# Load the mobius_clust_tsne_param2 dataset data(mobius_clust_tsne_param2) # Display the first few rows of the dataset head(mobius_clust_tsne_param2)
The 'mobius_clust_tsne_param3' dataset contains the tSNE (t-distributed Stochastic Neighbor Embedding) embeddings of a five-dimensional mobius_clust_data. Each data point is represented by two tSNE coordinates (emb1 and emb2).
data(mobius_clust_tsne_param3)
data(mobius_clust_tsne_param3)
## 'mobius_clust_tsne_param3' A data frame with 500 rows and 2 columns:
Numeric, first tSNE 2D embeddings.
Numeric, second tSNE 2D embeddings.
This dataset is generated for illustrative purposes.
# Load the mobius_clust_tsne_param1 dataset data(mobius_clust_tsne_param3) # Display the first few rows of the dataset head(mobius_clust_tsne_param3)
# Load the mobius_clust_tsne_param1 dataset data(mobius_clust_tsne_param3) # Display the first few rows of the dataset head(mobius_clust_tsne_param3)
The 'mobius_clust_umap_param1' dataset contains the UMAP (Uniform Manifold Approximation and Projection) embeddings of a five-dimensional mobius_clust_data. Each data point is represented by two UMAP coordinates (emb1 and emb2).
data(mobius_clust_umap_param1)
data(mobius_clust_umap_param1)
## 'mobius_clust_umap_param1' A data frame with 500 rows and 2 columns:
Numeric, first UMAP 2D embeddings.
Numeric, second UMAP 2D embeddings.
This dataset is generated for illustrative purposes.
# Load the mobius_clust_umap_param1 dataset data(mobius_clust_umap_param1) # Display the first few rows of the dataset head(mobius_clust_umap_param1)
# Load the mobius_clust_umap_param1 dataset data(mobius_clust_umap_param1) # Display the first few rows of the dataset head(mobius_clust_umap_param1)
The 'mobius_clust_umap_param2' dataset contains the UMAP (Uniform Manifold Approximation and Projection) embeddings of a five-dimensional mobius_clust_data. Each data point is represented by two UMAP coordinates (emb1 and emb2).
data(mobius_clust_umap_param2)
data(mobius_clust_umap_param2)
## 'mobius_clust_umap_param2' A data frame with 500 rows and 2 columns:
Numeric, first UMAP 2D embeddings.
Numeric, second UMAP 2D embeddings.
This dataset is generated for illustrative purposes.
# Load the mobius_clust_umap_param2 dataset data(mobius_clust_umap_param2) # Display the first few rows of the dataset head(mobius_clust_umap_param2)
# Load the mobius_clust_umap_param2 dataset data(mobius_clust_umap_param2) # Display the first few rows of the dataset head(mobius_clust_umap_param2)
The 'mobius_clust_umap_param3' dataset contains the UMAP (Uniform Manifold Approximation and Projection) embeddings of a five-dimensional mobius_clust_data. Each data point is represented by two UMAP coordinates (emb1 and emb2).
data(mobius_clust_umap_param3)
data(mobius_clust_umap_param3)
## 'mobius_clust_umap_param3' A data frame with 500 rows and 2 columns:
Numeric, first UMAP 2D embeddings.
Numeric, second UMAP 2D embeddings.
This dataset is generated for illustrative purposes.
# Load the mobius_clust_umap_param3 dataset data(mobius_clust_umap_param3) # Display the first few rows of the dataset head(mobius_clust_umap_param3)
# Load the mobius_clust_umap_param3 dataset data(mobius_clust_umap_param3) # Display the first few rows of the dataset head(mobius_clust_umap_param3)
This function generates points on a nonlinear 2D manifold based on a given equation.
nonlinear_2d(n, num_noise, min_n, max_n)
nonlinear_2d(n, num_noise, min_n, max_n)
n |
The number of points to generate. |
num_noise |
The number of noise dimensions to add to the generated points. |
min_n |
The minimum value for the noise dimensions. |
max_n |
The maximum value for the noise dimensions. |
A matrix containing the generated points on the nonlinear 2D manifold.
set.seed(20240412) nonlinear_points <- nonlinear_2d( n = 100, num_noise = 2, min_n = -0.01, max_n = 0.01 )
set.seed(20240412) nonlinear_points <- nonlinear_2d( n = 100, num_noise = 2, min_n = -0.01, max_n = 0.01 )
This function generates a dataset representing nonlinear connected clusters with added noise.
nonlinear_connect(n, num_noise, min_n, max_n)
nonlinear_connect(n, num_noise, min_n, max_n)
n |
The total number of samples to generate. |
num_noise |
The number of additional noise dimensions to add to the data. |
min_n |
The minimum value for the noise dimensions. |
max_n |
The maximum value for the noise dimensions. |
A matrix containing the nonlinear connected data with noise.
set.seed(20240412) nonlinear_connect <- nonlinear_connect( n = 400, num_noise = 2, min_n = -0.05, max_n = 0.05 )
set.seed(20240412) nonlinear_connect <- nonlinear_connect( n = 400, num_noise = 2, min_n = -0.05, max_n = 0.05 )
This function generates a dataset representing two mirror-image clusters with added noise.
nonlinear_mirror(n, num_noise, min_n, max_n)
nonlinear_mirror(n, num_noise, min_n, max_n)
n |
The total number of samples to generate. |
num_noise |
The number of additional noise dimensions to add to the data. |
min_n |
The minimum value for the noise dimensions. |
max_n |
The maximum value for the noise dimensions. |
A matrix containing the nonlinear mirror data with noise.
set.seed(20240412) nonlinear_mirror <- nonlinear_mirror( n = 400, num_noise = 2, min_n = -0.05, max_n = 0.05 )
set.seed(20240412) nonlinear_mirror <- nonlinear_mirror( n = 400, num_noise = 2, min_n = -0.05, max_n = 0.05 )
This function generates data with one set of doublets (pairs of clusters) along with added background noise.
one_doublet(n, num_noise, min_n, max_n)
one_doublet(n, num_noise, min_n, max_n)
n |
The total number of data points to be generated. |
num_noise |
The number of additional noise dimensions to be generated. |
min_n |
The minimum value for the noise added to the data points. |
max_n |
The maximum value for the noise added to the data points. |
A matrix containing the generated data, with each row representing a data point.
# Generate doublets with noise with custom parameters set.seed(20240412) data <- one_doublet(n = 220, num_noise = 2, min_n = -0.05, max_n = 0.05)
# Generate doublets with noise with custom parameters set.seed(20240412) data <- one_doublet(n = 220, num_noise = 2, min_n = -0.05, max_n = 0.05)
This function generates data with doublets (pairs of clusters) along with added background noise.
one_doublet_bkg(n, num_noise, min_n, max_n)
one_doublet_bkg(n, num_noise, min_n, max_n)
n |
The total number of data points to be generated. |
num_noise |
The number of additional noise dimensions to be generated. |
min_n |
The minimum value for the noise added to the data points. |
max_n |
The maximum value for the noise added to the data points. |
A matrix containing the generated data, with each row representing a data point.
# Generate doublets with background noise with custom parameters set.seed(20240412) data <- one_doublet_bkg(n = 250, num_noise = 2, min_n = -0.05, max_n = 0.05)
# Generate doublets with background noise with custom parameters set.seed(20240412) data <- one_doublet_bkg(n = 250, num_noise = 2, min_n = -0.05, max_n = 0.05)
This function generates data with one set of doublets (pairs of clusters) having different patterns, along with added background noise.
one_doublet_diff_patterns(n, num_noise, min_n, max_n)
one_doublet_diff_patterns(n, num_noise, min_n, max_n)
n |
The total number of data points to be generated. |
num_noise |
The number of additional noise dimensions to be generated. |
min_n |
The minimum value for the noise added to the data points. |
max_n |
The maximum value for the noise added to the data points. |
A matrix containing the generated data, with each row representing a data point.
# Generate doublets with different pattern clusters and noise with custom parameters set.seed(20240412) data <- one_doublet_diff_patterns( n = 280, num_noise = 2, min_n = -0.05, max_n = 0.05 )
# Generate doublets with different pattern clusters and noise with custom parameters set.seed(20240412) data <- one_doublet_diff_patterns( n = 280, num_noise = 2, min_n = -0.05, max_n = 0.05 )
This function generates data with one set of doublets (pairs of clusters) having clusters with different variance, along with added background noise.
one_doublet_diff_var_clust(n, num_noise, min_n, max_n)
one_doublet_diff_var_clust(n, num_noise, min_n, max_n)
n |
The total number of data points to be generated. |
num_noise |
The number of additional noise dimensions to be generated. |
min_n |
The minimum value for the noise added to the data points. |
max_n |
The maximum value for the noise added to the data points. |
A matrix containing the generated data, with each row representing a data point.
# Generate doublets with different variance clusters and noise with custom parameters set.seed(20240412) data <- one_doublet_diff_var_clust( n = 260, num_noise = 2, min_n = -0.05, max_n = 0.05 )
# Generate doublets with different variance clusters and noise with custom parameters set.seed(20240412) data <- one_doublet_diff_var_clust( n = 260, num_noise = 2, min_n = -0.05, max_n = 0.05 )
This function generates data with one set of doublets (pairs of clusters) containing four clusters, along with added background noise.
one_doublet_four_clusts(n, num_noise, min_n, max_n)
one_doublet_four_clusts(n, num_noise, min_n, max_n)
n |
The total number of data points to be generated. |
num_noise |
The number of additional noise dimensions to be generated. |
min_n |
The minimum value for the noise added to the data points. |
max_n |
The maximum value for the noise added to the data points. |
A matrix containing the generated data, with each row representing a data point.
# Generate doublets with four clusters and noise with custom parameters set.seed(20240412) data <- one_doublet_four_clusts( n = 440, num_noise = 2, min_n = -0.05, max_n = 0.05 )
# Generate doublets with four clusters and noise with custom parameters set.seed(20240412) data <- one_doublet_four_clusts( n = 440, num_noise = 2, min_n = -0.05, max_n = 0.05 )
This function generates a grid dataset with specified grid points along the x and y axes, and optionally adds noise dimensions.
one_grid(nx, ny, num_noise, min_n, max_n)
one_grid(nx, ny, num_noise, min_n, max_n)
nx |
The number of grid points along the x axis. |
ny |
The number of grid points along the y axis. |
num_noise |
The number of additional noise dimensions to add to the data. |
min_n |
The minimum value for the noise dimensions. |
max_n |
The maximum value for the noise dimensions. |
A matrix containing the grid data with added noise.
set.seed(20240412) one_grid <- one_grid(nx = 10, ny = 10, num_noise = 2, min_n = -0.05, max_n = 0.05)
set.seed(20240412) one_grid <- one_grid(nx = 10, ny = 10, num_noise = 2, min_n = -0.05, max_n = 0.05)
This function generates a grid dataset with different values and background noise.
one_grid_bkg(n_value, num_noise, min_n, max_n)
one_grid_bkg(n_value, num_noise, min_n, max_n)
n_value |
The number of grid points along each axis for the grids. |
num_noise |
The number of additional noise dimensions to add to the data. |
min_n |
The minimum value for the noise dimensions. |
max_n |
The maximum value for the noise dimensions. |
A list containing the one grid datasets with background noise and the sample size.
set.seed(20240412) one_grid_bkg <- one_grid_bkg( n_value = 10, num_noise = 2, min_n = -0.05, max_n = 0.05 )
set.seed(20240412) one_grid_bkg <- one_grid_bkg( n_value = 10, num_noise = 2, min_n = -0.05, max_n = 0.05 )
This function generates points on a plane in 3D space based on the provided coefficients, intercepts, and ranges for the parameters.
plane( n, coef_x1, coef_x2, coef_y1, coef_y2, intercept_x, intercept_y, u_min, u_max, v_min, v_max, num_noise, min_n, max_n )
plane( n, coef_x1, coef_x2, coef_y1, coef_y2, intercept_x, intercept_y, u_min, u_max, v_min, v_max, num_noise, min_n, max_n )
n |
The number of points to generate. |
coef_x1 |
The coefficient of the first parameter in the x-dimension equation. |
coef_x2 |
The coefficient of the second parameter in the x-dimension equation. |
coef_y1 |
The coefficient of the first parameter in the y-dimension equation. |
coef_y2 |
The coefficient of the second parameter in the y-dimension equation. |
intercept_x |
The intercept for the x-dimension equation. |
intercept_y |
The intercept for the y-dimension equation. |
u_min |
The minimum value for the first parameter (u) range. |
u_max |
The maximum value for the first parameter (u) range. |
v_min |
The minimum value for the second parameter (v) range. |
v_max |
The maximum value for the second parameter (v) range. |
num_noise |
The number of noise dimensions to add to the generated points. |
min_n |
The minimum value for the noise dimensions. |
max_n |
The maximum value for the noise dimensions. |
A matrix containing the generated points on the plane.
set.seed(20240412) plane_points <- plane( n = 100, coef_x1 = 1, coef_x2 = 1, coef_y1 = -1, coef_y2 = 1, intercept_x = -10, intercept_y = 8, u_min = 10, u_max = 30, v_min = 10, v_max = 20, num_noise = 2, min_n = -0.05, max_n = 0.05 )
set.seed(20240412) plane_points <- plane( n = 100, coef_x1 = 1, coef_x2 = 1, coef_y1 = -1, coef_y2 = 1, intercept_x = -10, intercept_y = 8, u_min = 10, u_max = 30, v_min = 10, v_max = 20, num_noise = 2, min_n = -0.05, max_n = 0.05 )
This function generates a dataset representing a 2D plane with a hole in the middle, with added noise.
plane_2d_hole(n, num_noise, min_n, max_n)
plane_2d_hole(n, num_noise, min_n, max_n)
n |
The total number of samples to generate. |
num_noise |
The number of additional noise dimensions to add to the data. |
min_n |
The minimum value for the noise dimensions. |
max_n |
The maximum value for the noise dimensions. |
A list containing the 2D plane data with a hole and the sample size.
set.seed(20240412) plane_data <- plane_2d_hole( n = 100, num_noise = 2, min_n = -0.05, max_n = 0.05 )
set.seed(20240412) plane_data <- plane_2d_hole( n = 100, num_noise = 2, min_n = -0.05, max_n = 0.05 )
This function generates data points on a Roman surface with optional noise.
roman_surface_3d(n, num_noise, min_n, max_n)
roman_surface_3d(n, num_noise, min_n, max_n)
n |
Total number of data points to generate. |
num_noise |
Number of additional noise dimensions to add to the data. |
min_n |
Minimum value for the noise added to the data. |
max_n |
Maximum value for the noise added to the data. |
A matrix containing the generated data points with or without added noise.
set.seed(20240412) roman_surface_3d(n = 100, num_noise = 2, min_n = -0.05, max_n = 0.05)
set.seed(20240412) roman_surface_3d(n = 100, num_noise = 2, min_n = -0.05, max_n = 0.05)
This function generates points on a Roman surface in 3D space.
roman_surface_3d_row(a = 1)
roman_surface_3d_row(a = 1)
a |
Maximum radius of the object. |
A matrix containing the generated points on the Roman surface in 3D space.
set.seed(20240412) roman_surface_3d_row(a = 1)
set.seed(20240412) roman_surface_3d_row(a = 1)
This function generates S-curve data, which is a commonly used dataset for testing and visualizing dimensionality reduction algorithms.
scurve(n, num_noise, min_n, max_n)
scurve(n, num_noise, min_n, max_n)
n |
The number of samples to generate. |
num_noise |
The number of additional noise dimensions to add to the data. |
min_n |
The minimum value for the noise dimensions. |
max_n |
The maximum value for the noise dimensions. |
A matrix containing the generated S-curve data.
set.seed(20240412) s_curve_data <- scurve( n = 100, num_noise = 2, min_n = -0.05, max_n = 0.05 )
set.seed(20240412) s_curve_data <- scurve( n = 100, num_noise = 2, min_n = -0.05, max_n = 0.05 )
This function generates S-curve data with a hole by filtering out samples that are not close to a specified anchor point.
scurve_hole(n, num_noise, min_n, max_n)
scurve_hole(n, num_noise, min_n, max_n)
n |
The number of samples to generate. |
num_noise |
The number of additional noise dimensions to add to the data. |
min_n |
The minimum value for the noise dimensions. |
max_n |
The maximum value for the noise dimensions. |
A matrix containing the generated S-curve data with a hole.
set.seed(20240412) s_curve_hole_data <- scurve_hole( n = 100, num_noise = 2, min_n = -0.05, max_n = 0.05 )
set.seed(20240412) s_curve_hole_data <- scurve_hole( n = 100, num_noise = 2, min_n = -0.05, max_n = 0.05 )
This function generates a dataset representing seven branches with added noise.
seven_branch(n, num_noise, min_n, max_n)
seven_branch(n, num_noise, min_n, max_n)
n |
The total number of samples to generate. |
num_noise |
The number of additional noise dimensions to add to the data. |
min_n |
The minimum value for the noise dimensions. |
max_n |
The maximum value for the noise dimensions. |
A matrix containing the seven-branching data with added noise.
set.seed(20240412) seven_branching_data <- seven_branch( n = 210, num_noise = 2, min_n = -0.05, max_n = 0.05 )
set.seed(20240412) seven_branching_data <- seven_branch( n = 210, num_noise = 2, min_n = -0.05, max_n = 0.05 )
This function generates a dataset representing a sine curve with added noise.
sine_curve(n, num_noise, min_n, max_n)
sine_curve(n, num_noise, min_n, max_n)
n |
The number of samples to generate. |
num_noise |
The number of additional noise dimensions to add to the data. |
min_n |
The minimum value for the noise dimensions. |
max_n |
The maximum value for the noise dimensions. |
A matrix containing the sine curve data with noise.
set.seed(20240412) sine_curve <- sine_curve(n = 100, num_noise = 2, min_n = -0.05, max_n = 0.05)
set.seed(20240412) sine_curve <- sine_curve(n = 100, num_noise = 2, min_n = -0.05, max_n = 0.05)
This function generates the coordinates for a sphere in three-dimensional space.
sphere(radius, resolution, num_noise, min_n, max_n)
sphere(radius, resolution, num_noise, min_n, max_n)
radius |
The radius of the sphere. |
resolution |
The number of points used to approximate the surface of the sphere. |
num_noise |
The number of additional noise dimensions to add to the coordinates. |
min_n |
The minimum value for the random noise added to the coordinates. |
max_n |
The maximum value for the random noise added to the coordinates. |
A matrix containing the Cartesian coordinates of the points on the sphere.
# Generate coordinates for a sphere with radius 1 and resolution 20 set.seed(20240412) sphere( radius = 1, resolution = 20, num_noise = 3, min_n = -0.05, max_n = 0.05 )
# Generate coordinates for a sphere with radius 1 and resolution 20 set.seed(20240412) sphere( radius = 1, resolution = 20, num_noise = 3, min_n = -0.05, max_n = 0.05 )
This function generates a dataset arranged in a spiral pattern with optional noise.
spiral_3d(n, num_dims, num_noise, min_n, max_n)
spiral_3d(n, num_dims, num_noise, min_n, max_n)
n |
Total number of data points to generate. |
num_dims |
Number of effective dimensions for each data point. |
num_noise |
Number of additional noise dimensions to add to the data. |
min_n |
Minimum value for the noise added to the data. |
max_n |
Maximum value for the noise added to the data. |
A matrix containing the generated data points with or without added noise.
set.seed(20240412) spiral_3d(n = 100, num_dims = 10, num_noise = 2, min_n = -0.05, max_n = 0.05)
set.seed(20240412) spiral_3d(n = 100, num_dims = 10, num_noise = 2, min_n = -0.05, max_n = 0.05)
This function generates data points in the shape of a Swiss roll.
swiss_roll(n, num_noise, min_n, max_n)
swiss_roll(n, num_noise, min_n, max_n)
n |
The total number of data points to be generated. |
num_noise |
The number of additional noise dimensions to be generated. |
min_n |
The minimum value for the noise added to the data points. |
max_n |
The maximum value for the noise added to the data points. |
A matrix containing the generated Swiss roll data points.
# Generate Swiss roll data with noise with custom parameters set.seed(20240412) data <- swiss_roll(n = 200, num_noise = 2, min_n = -0.05, max_n = 0.05)
# Generate Swiss roll data with noise with custom parameters set.seed(20240412) data <- swiss_roll(n = 200, num_noise = 2, min_n = -0.05, max_n = 0.05)
This function generates three circular clusters in 4D space with added noise dimensions.
three_circulars(n, num_noise, min_n, max_n)
three_circulars(n, num_noise, min_n, max_n)
n |
The total number of samples to generate. |
num_noise |
The number of additional noise dimensions to add to the data. |
min_n |
The minimum value for the noise dimensions. |
max_n |
The maximum value for the noise dimensions. |
A matrix containing the three circular clusters with added noise.
set.seed(20240412) circular_clusters_data <- three_circulars( n = 300, num_noise = 2, min_n = -0.05, max_n = 0.05 )
set.seed(20240412) circular_clusters_data <- three_circulars( n = 300, num_noise = 2, min_n = -0.05, max_n = 0.05 )
This function generates three clusters of data points along with optional noise.
three_clust_diff_dist(n, num_dims, num_noise, min_n, max_n)
three_clust_diff_dist(n, num_dims, num_noise, min_n, max_n)
n |
Total number of data points to generate, should be a multiple of three. |
num_dims |
Number of dimensions for each data point. |
num_noise |
Number of additional noise dimensions to add to the data. |
min_n |
Minimum value for the noise added to the data. |
max_n |
Maximum value for the noise added to the data. |
A matrix containing the generated data points with or without added noise.
set.seed(20240412) three_clust_diff_dist( n = 150, num_dims = 7, num_noise = 4, min_n = -0.05, max_n = 0.05 )
set.seed(20240412) three_clust_diff_dist( n = 150, num_dims = 7, num_noise = 4, min_n = -0.05, max_n = 0.05 )
This function generates data with three clusters forming a mirror image, along with added noise.
three_clust_mirror(n, num_noise, min_n, max_n)
three_clust_mirror(n, num_noise, min_n, max_n)
n |
The total number of data points to be generated. |
num_noise |
The number of additional noise dimensions to be generated. |
min_n |
The minimum value for the noise added to the data points. |
max_n |
The maximum value for the noise added to the data points. |
A matrix containing the generated data, with each row representing a data point.
# Generate three cluster mirror with noise with custom parameters set.seed(20240412) data <- three_clust_mirror(n = 300, num_noise = 2, min_n = -0.05, max_n = 0.05)
# Generate three cluster mirror with noise with custom parameters set.seed(20240412) data <- three_clust_mirror(n = 300, num_noise = 2, min_n = -0.05, max_n = 0.05)
This function generates a dataset consisting of three different linear patterns with added noise.
three_diff_linear(n, num_noise, min_n, max_n)
three_diff_linear(n, num_noise, min_n, max_n)
n |
The total number of samples to generate. |
num_noise |
The number of additional noise dimensions to add to the data. |
min_n |
The minimum value for the noise dimensions. |
max_n |
The maximum value for the noise dimensions. |
A matrix containing the three different linear data with added noise.
set.seed(20240412) three_diff_linear <- three_diff_linear( n = 150, num_noise = 2, min_n = -0.05, max_n = 0.05 )
set.seed(20240412) three_diff_linear <- three_diff_linear( n = 150, num_noise = 2, min_n = -0.05, max_n = 0.05 )
This function generates data with three sets of doublets (pairs of clusters) along with added background noise.
three_doublets(n, num_noise, min_n, max_n)
three_doublets(n, num_noise, min_n, max_n)
n |
The total number of data points to be generated. |
num_noise |
The number of additional noise dimensions to be generated. |
min_n |
The minimum value for the noise added to the data points. |
max_n |
The maximum value for the noise added to the data points. |
A matrix containing the generated data, with each row representing a data point.
# Generate doublets with three clusters and noise with custom parameters set.seed(20240412) data <- three_doublets( n = 420, num_noise = 2, min_n = -0.05, max_n = 0.05 )
# Generate doublets with three clusters and noise with custom parameters set.seed(20240412) data <- three_doublets( n = 420, num_noise = 2, min_n = -0.05, max_n = 0.05 )
This function generates three grid datasets with noise dimensions.
three_grid(n_value, num_noise, min_n, max_n)
three_grid(n_value, num_noise, min_n, max_n)
n_value |
The number of grid points along the x and y axes for each grid. |
num_noise |
The number of additional noise dimensions to add to the data. |
min_n |
The minimum value for the noise dimensions. |
max_n |
The maximum value for the noise dimensions. |
A list containing three grid datasets with added noise and the sample size of each dataset.
set.seed(20240412) three_grids <- three_grid( n_value = 19, num_noise = 2, min_n = -0.05, max_n = 0.05 )
set.seed(20240412) three_grids <- three_grid( n_value = 19, num_noise = 2, min_n = -0.05, max_n = 0.05 )
This function generates data with three linear clusters, along with added noise.
three_long_clust(n, num_noise, min_n, max_n)
three_long_clust(n, num_noise, min_n, max_n)
n |
The total number of data points to be generated. |
num_noise |
The number of additional noise dimensions to be generated. |
min_n |
The minimum value for the noise added to the data points. |
max_n |
The maximum value for the noise added to the data points. |
A matrix containing the generated data, with each row representing a data point.
# Generate three linear clusters with noise with custom parameters set.seed(20240412) data <- three_long_clust(n = 300, num_noise = 2, min_n = -0.05, max_n = 0.05)
# Generate three linear clusters with noise with custom parameters set.seed(20240412) data <- three_long_clust(n = 300, num_noise = 2, min_n = -0.05, max_n = 0.05)
This function generates data with three nonlinear clusters, along with added noise.
three_nonlinear(n, num_noise, min_n, max_n)
three_nonlinear(n, num_noise, min_n, max_n)
n |
The total number of data points to be generated. |
num_noise |
The number of additional noise dimensions to be generated. |
min_n |
The minimum value for the noise added to the data points. |
max_n |
The maximum value for the noise added to the data points. |
A matrix containing the generated data, with each row representing a data point.
# Generate three nonlinear clusters with noise with custom parameters set.seed(20240412) data <- three_nonlinear(n = 300, num_noise = 2, min_n = -0.05, max_n = 0.05)
# Generate three nonlinear clusters with noise with custom parameters set.seed(20240412) data <- three_nonlinear(n = 300, num_noise = 2, min_n = -0.05, max_n = 0.05)
This function generates a torus-shaped dataset along with optional noise.
torus_3d(n, num_noise, min_n, max_n)
torus_3d(n, num_noise, min_n, max_n)
n |
Total number of data points to generate. |
num_noise |
Number of additional noise dimensions to add to the data. |
min_n |
Minimum value for the noise added to the data. |
max_n |
Maximum value for the noise added to the data. |
A matrix containing the generated torus-shaped data points with or without added noise.
set.seed(20240412) torus_3d(n = 100, num_noise = 2, min_n = -0.05, max_n = 0.05)
set.seed(20240412) torus_3d(n = 100, num_noise = 2, min_n = -0.05, max_n = 0.05)
This function generates a row of data points for a 3D torus with given radii.
torus_3d_row(radius)
torus_3d_row(radius)
radius |
A numeric vector containing the radii of the torus, from largest to smallest. |
A vector representing a row of data points for the 3D torus.
set.seed(20240412) torus_3d_row(c(2, 1))
set.seed(20240412) torus_3d_row(c(2, 1))
This function generates a dataset representing a tree-like structure, with added noise.
tree(n, num_noise, min_n, max_n)
tree(n, num_noise, min_n, max_n)
n |
The total number of samples to generate. |
num_noise |
The number of additional noise dimensions to add to the data. |
min_n |
The minimum value for the noise dimensions. |
max_n |
The maximum value for the noise dimensions. |
A matrix containing the tree-like data with added noise.
set.seed(20240412) tree_data <- tree(n = 300, num_noise = 2, min_n = -0.05, max_n = 0.05)
set.seed(20240412) tree_data <- tree(n = 300, num_noise = 2, min_n = -0.05, max_n = 0.05)
This function generates triangular 3D datasets with added noise dimensions.
tri_3d(n, num_noise, min_n, max_n)
tri_3d(n, num_noise, min_n, max_n)
n |
The total number of samples to generate. |
num_noise |
The number of additional noise dimensions to add to the data. |
min_n |
The minimum value for the noise dimensions. |
max_n |
The maximum value for the noise dimensions. |
A matrix containing the triangular 3D datasets with added noise.
set.seed(20240412) triangular_3d_data <- tri_3d( n = 100, num_noise = 2, min_n = -0.05, max_n = 0.05 )
set.seed(20240412) triangular_3d_data <- tri_3d( n = 100, num_noise = 2, min_n = -0.05, max_n = 0.05 )
This function generates a triangular plane dataset with background noise dimensions.
tri_plane_bkg(n, num_noise, min_n, max_n)
tri_plane_bkg(n, num_noise, min_n, max_n)
n |
The total number of samples to generate. |
num_noise |
The number of additional noise dimensions to add to the data. |
min_n |
The minimum value for the noise dimensions. |
max_n |
The maximum value for the noise dimensions. |
A matrix containing the triangular plane dataset with background noise.
set.seed(20240412) triangular_plane_data <- tri_plane_bkg( n = 216, num_noise = 2, min_n = -0.05, max_n = 0.05 )
set.seed(20240412) triangular_plane_data <- tri_plane_bkg( n = 216, num_noise = 2, min_n = -0.05, max_n = 0.05 )
This function generates linked data points.
two_circulars(n, num_noise, min_n, max_n)
two_circulars(n, num_noise, min_n, max_n)
n |
The total number of data points to be generated. Should be a product of two. |
num_noise |
The number of additional noise dimensions to be generated. |
min_n |
The minimum value for the noise added to the data points. |
max_n |
The maximum value for the noise added to the data points. |
A matrix containing the generated linked data points.
# Generate linked data with noise with custom parameters set.seed(20240412) data <- two_circulars(n = 200, num_noise = 2, min_n = -0.05, max_n = 0.05)
# Generate linked data with noise with custom parameters set.seed(20240412) data <- two_circulars(n = 200, num_noise = 2, min_n = -0.05, max_n = 0.05)
This function generates a dataset representing two curvilinear clusters with added noise.
two_curvilinear(n, num_noise, min_n, max_n)
two_curvilinear(n, num_noise, min_n, max_n)
n |
The total number of samples to generate. |
num_noise |
The number of additional noise dimensions to add to the data. |
min_n |
The minimum value for the noise dimensions. |
max_n |
The maximum value for the noise dimensions. |
A matrix containing the two curvilinear data with noise.
set.seed(20240412) two_curvilinear <- two_curvilinear( n = 250, num_noise = 2, min_n = -0.05, max_n = 0.05 )
set.seed(20240412) two_curvilinear <- two_curvilinear( n = 250, num_noise = 2, min_n = -0.05, max_n = 0.05 )
This function generates data with two curvilinear clusters along with added noise.
two_curvy(n, num_noise, min_n, max_n)
two_curvy(n, num_noise, min_n, max_n)
n |
The total number of data points to be generated. |
num_noise |
The number of additional noise dimensions to be generated. |
min_n |
The minimum value for the noise added to the data points. |
max_n |
The maximum value for the noise added to the data points. |
A matrix containing the generated data, with each row representing a data point.
# Generate two curvilinear clusters with noise with custom parameters set.seed(20240412) data <- two_curvy(n = 200, num_noise = 2, min_n = -0.05, max_n = 0.05)
# Generate two curvilinear clusters with noise with custom parameters set.seed(20240412) data <- two_curvy(n = 200, num_noise = 2, min_n = -0.05, max_n = 0.05)
This function generates data with two curvilinear clusters that are differentiated from each other, along with added noise.
two_curvy_diff_pts(cluster_size_vec, num_noise, min_n, max_n)
two_curvy_diff_pts(cluster_size_vec, num_noise, min_n, max_n)
cluster_size_vec |
A vector specifying the number of points in each cluster. |
num_noise |
The number of additional noise dimensions to be generated. |
min_n |
The minimum value for the noise added to the data points. |
max_n |
The maximum value for the noise added to the data points. |
A matrix containing the generated data, with each row representing a data point.
# Generate two curvilinear differentiated clusters with noise with custom parameters set.seed(20240412) data <- two_curvy_diff_pts( cluster_size_vec = c(50, 100), num_noise = 2, min_n = -0.05, max_n = 0.05 )
# Generate two curvilinear differentiated clusters with noise with custom parameters set.seed(20240412) data <- two_curvy_diff_pts( cluster_size_vec = c(50, 100), num_noise = 2, min_n = -0.05, max_n = 0.05 )
This function generates a dataset representing two curvy pancake-shaped clusters with added noise.
two_curvy_panckakes(n, num_noise, min_n, max_n)
two_curvy_panckakes(n, num_noise, min_n, max_n)
n |
The total number of samples to generate. |
num_noise |
The number of additional noise dimensions to add to the data. |
min_n |
The minimum value for the noise dimensions. |
max_n |
The maximum value for the noise dimensions. |
A matrix containing the two curvy pancakes data with noise.
set.seed(20240412) two_curvy_panckakes <- two_curvy_panckakes( n = 300, num_noise = 2, min_n = -0.05, max_n = 0.05 )
set.seed(20240412) two_curvy_panckakes <- two_curvy_panckakes( n = 300, num_noise = 2, min_n = -0.05, max_n = 0.05 )
This function generates data with two doublets along with added background noise.
two_doublets_bkg(n, num_noise, min_n, max_n)
two_doublets_bkg(n, num_noise, min_n, max_n)
n |
The total number of data points to be generated. |
num_noise |
The number of additional noise dimensions to be generated. |
min_n |
The minimum value for the noise added to the data points. |
max_n |
The maximum value for the noise added to the data points. |
A matrix containing the generated data, with each row representing a data point.
# Generate two doublets with background noise with custom parameters set.seed(20240412) data <- two_doublets_bkg(n = 200, num_noise = 2, min_n = -0.05, max_n = 0.05)
# Generate two doublets with background noise with custom parameters set.seed(20240412) data <- two_doublets_bkg(n = 200, num_noise = 2, min_n = -0.05, max_n = 0.05)
This function generates data with two sets of doublets (pairs of clusters) running in parallel, along with added background noise.
two_doublets_parallel(n, num_noise, min_n, max_n)
two_doublets_parallel(n, num_noise, min_n, max_n)
n |
The total number of data points to be generated. |
num_noise |
The number of additional noise dimensions to be generated. |
min_n |
The minimum value for the noise added to the data points. |
max_n |
The maximum value for the noise added to the data points. |
A matrix containing the generated data, with each row representing a data point.
# Generate doublets in parallel with noise with custom parameters set.seed(20240412) data <- two_doublets_parallel(n = 440, num_noise = 2, min_n = -0.05, max_n = 0.05)
# Generate doublets in parallel with noise with custom parameters set.seed(20240412) data <- two_doublets_parallel(n = 440, num_noise = 2, min_n = -0.05, max_n = 0.05)
This function generates two grid datasets with noise dimensions.
two_grid(n_value, num_noise, min_n, max_n)
two_grid(n_value, num_noise, min_n, max_n)
n_value |
The number of grid points along the x and y axes for each grid. |
num_noise |
The number of additional noise dimensions to add to the data. |
min_n |
The minimum value for the noise dimensions. |
max_n |
The maximum value for the noise dimensions. |
A list containing two grid datasets with added noise and the sample size of each dataset.
set.seed(20240412) two_grids <- two_grid(n_value = 19, num_noise = 2, min_n = -0.05, max_n = 0.05)
set.seed(20240412) two_grids <- two_grid(n_value = 19, num_noise = 2, min_n = -0.05, max_n = 0.05)
This function generates a single grid dataset with a different offset.
two_grid_comb(n_value, num_noise, min_n, max_n)
two_grid_comb(n_value, num_noise, min_n, max_n)
n_value |
The number of grid points along each axis for the grids. |
num_noise |
The number of additional noise dimensions to add to the data. |
min_n |
The minimum value for the noise dimensions. |
max_n |
The maximum value for the noise dimensions. |
A list containing the grid dataset with different offsets and the sample size.
set.seed(20240412) two_grid_comb <- two_grid_comb( n_value = 10, num_noise = 2, min_n = -0.05, max_n = 0.05 )
set.seed(20240412) two_grid_comb <- two_grid_comb( n_value = 10, num_noise = 2, min_n = -0.05, max_n = 0.05 )
This function generates two grid datasets with background noise.
two_grid_comb_bkg(n_value, num_noise, min_n, max_n)
two_grid_comb_bkg(n_value, num_noise, min_n, max_n)
n_value |
The number of grid points along each axis for the grids. |
num_noise |
The number of additional noise dimensions to add to the data. |
min_n |
The minimum value for the noise dimensions. |
max_n |
The maximum value for the noise dimensions. |
A list containing the two grid datasets with background noise and the sample size.
set.seed(20240412) two_grid_comb_bkg <- two_grid_comb_bkg( n_value = 10, num_noise = 2, min_n = -0.05, max_n = 0.05 )
set.seed(20240412) two_grid_comb_bkg <- two_grid_comb_bkg( n_value = 10, num_noise = 2, min_n = -0.05, max_n = 0.05 )
This function generates a dataset consisting of two long clusters with added noise.
two_long_clust(n, num_noise, min_n, max_n)
two_long_clust(n, num_noise, min_n, max_n)
n |
The total number of samples to generate. |
num_noise |
The number of additional noise dimensions to add to the data. |
min_n |
The minimum value for the noise dimensions. |
max_n |
The maximum value for the noise dimensions. |
A matrix containing the long cluster data with added noise.
set.seed(20240412) long_cluster <- two_long_clust( n = 200, num_noise = 2, min_n = -0.05, max_n = 0.05 )
set.seed(20240412) long_cluster <- two_long_clust( n = 200, num_noise = 2, min_n = -0.05, max_n = 0.05 )
This function generates data with two linear clusters that are differentiated from each other, along with added noise.
two_long_clust_diff(n, num_noise, min_n, max_n)
two_long_clust_diff(n, num_noise, min_n, max_n)
n |
The total number of data points to be generated. |
num_noise |
The number of additional noise dimensions to be generated. |
min_n |
The minimum value for the noise added to the data points. |
max_n |
The maximum value for the noise added to the data points. |
A matrix containing the generated data, with each row representing a data point.
# Generate two linear differentiated clusters with noise with custom parameters set.seed(20240412) data <- two_long_clust_diff( n = 300, num_noise = 2, min_n = -0.05, max_n = 0.05 )
# Generate two linear differentiated clusters with noise with custom parameters set.seed(20240412) data <- two_long_clust_diff( n = 300, num_noise = 2, min_n = -0.05, max_n = 0.05 )
This function generates data with two nonlinear clusters along with added noise.
two_nonlinear(n, num_noise, min_n, max_n)
two_nonlinear(n, num_noise, min_n, max_n)
n |
The total number of data points to be generated. |
num_noise |
The number of additional noise dimensions to be generated. |
min_n |
The minimum value for the noise added to the data points. |
max_n |
The maximum value for the noise added to the data points. |
A matrix containing the generated data, with each row representing a data point.
# Generate two nonlinear clusters with noise with custom parameters set.seed(20240412) data <- two_nonlinear(n = 200, num_noise = 2, min_n = -0.05, max_n = 0.50)
# Generate two nonlinear clusters with noise with custom parameters set.seed(20240412) data <- two_nonlinear(n = 200, num_noise = 2, min_n = -0.05, max_n = 0.50)
This function generates two S-curve data with noise.
two_scurve_hole(n, num_noise, min_n, max_n)
two_scurve_hole(n, num_noise, min_n, max_n)
n |
The total number of data points to be generated. |
num_noise |
The number of additional noise dimensions to be generated. |
min_n |
The minimum value for the noise added to the data points. |
max_n |
The maximum value for the noise added to the data points. |
A matrix containing the two S-curve datasets with added noise.
# Generate two S-curve data with noise with custom parameters set.seed(20240412) data <- two_scurve_hole( n = 200, num_noise = 2, min_n = -0.05, max_n = 0.05 )
# Generate two S-curve data with noise with custom parameters set.seed(20240412) data <- two_scurve_hole( n = 200, num_noise = 2, min_n = -0.05, max_n = 0.05 )
This function generates two S-curve datasets with added noise dimensions.
two_scurves(n, num_noise, min_n, max_n)
two_scurves(n, num_noise, min_n, max_n)
n |
The total number of samples to generate (should be divisible by 2). |
num_noise |
The number of additional noise dimensions to add to the data. |
min_n |
The minimum value for the noise dimensions. |
max_n |
The maximum value for the noise dimensions. |
A matrix containing the combined S-curve datasets with added noise.
set.seed(20240412) two_s_curve_data <- two_scurves( n = 200, num_noise = 2, min_n = -0.05, max_n = 0.05 )
set.seed(20240412) two_s_curve_data <- two_scurves( n = 200, num_noise = 2, min_n = -0.05, max_n = 0.05 )