Data irregularities in discretisation of test sets used for evaluation of classification systems: A case study on authorship attribution

When patterns to be recognised are described by features of continuous type, discretisation becomes either an optional or necessary step in the initial data pre-processing stage. Characteristics of data, distribution of data points in the input space, can significantly influence the process of trans...

Full description

Bibliographic Details
Main Authors: Urszula Stańczyk, Beata Zielosko
Format: Article
Language:English
Published: Polish Academy of Sciences 2021-06-01
Series:Bulletin of the Polish Academy of Sciences: Technical Sciences
Subjects:
Online Access:https://journals.pan.pl/Content/119904/PDF/17_01628_Bpast.No.69(4)_27.08.21_druk.pdf