Documentation can be found at documentation
folder. This is in R documentation format and can be easily accessed through RStudio.
Detailed documentation as follows:
This function is used to generate the design matrix from a the given categorical observation.
Binary(MultiLevel, N)
Argument | Description |
---|---|
MultiLevel |
refers to the categorical observations |
N |
vector indicating the degree of freedom for each categorical variable |
MultiLevel
: Transform all the variables into categorical variables with a upper bound for degree of freedom
Only a common upper bound MaxGroup for the degree of freedom is required for the data transformation.
MultiLevel(data, MaxGroup, size)
Argument | Description |
---|---|
data |
n by p data matrix |
MaxGroup |
the common upper bound for the number of levels |
This is the main function that implement our algorithm.
netmat(A, width)
Argument | Description |
---|---|
A |
a binary matrix representing the transfomred data |
width |
a vector indicating which colomns are belonging to the same categorical variable |
This function uses dynamic programming algorithm to transform the zero-inflated continuous variable into categorical variable that minimize the squared Euclidean distance.
CuttingPoint(x, N)
Argument | Description |
---|---|
x |
vector x representing the observations on continuous variable to be transformed. |
N |
N representing the specified number of the levels |