kamila: Clustering Mixed-Type Data in R and Hadoop
In this paper we discuss the challenge of equitably combining continuous (quantitative) and categorical (qualitative) variables for the purpose of cluster analysis. Existing techniques require strong parametric assumptions, or difficult-to-specify tuning parameters. We describe the kamila package, w...
Main Authors: | Alexander H. Foss, Marianthi Markatou |
---|---|
Format: | Article |
Language: | English |
Published: |
Foundation for Open Access Statistics
2018-02-01
|
Series: | Journal of Statistical Software |
Subjects: | |
Online Access: | https://www.jstatsoft.org/index.php/jss/article/view/2812 |
Similar Items
-
LINEAR REGRESSION WITH R AND HADOOP
by: Bogdan OANCEA
Published: (2015-07-01) -
ESTIMATION OF SEMIPARAMETRIC REGRESSION CURVE WITH MIXED ESTIMATOR OF MULTIVARIABLE LINEAR TRUNCATED SPLINE AND MULTIVARIABLE KERNEL
by: Hesikumalasari Hesikumalasari, et al.
Published: (2022-07-01) -
A Novel Clustering Technique for Efficient Clustering of Big Data in Hadoop Ecosystem
by: Sunil Kumar, et al.
Published: (2019-12-01) -
Integrating R and Hadoop for Big Data Analysis
by: Bogdan Oancea, et al.
Published: (2014-06-01) -
mixtools: An R Package for Analyzing Mixture Models
by: Tatiana Benaglia, et al.
Published: (2009-10-01)