Use ggbreak to Effectively Utilize Plotting Space to Deal With Large Datasets and Outliers

With the rapid increase of large-scale datasets, biomedical data visualization is facing challenges. The data may be large, have different orders of magnitude, contain extreme values, and the data distribution is not clear. Here we present an R package ggbreak that allows users to create broken axes...

Full description

Bibliographic Details
Main Authors: Shuangbin Xu, Meijun Chen, Tingze Feng, Li Zhan, Lang Zhou, Guangchuang Yu
Format: Article
Language:English
Published: Frontiers Media S.A. 2021-11-01
Series:Frontiers in Genetics
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/fgene.2021.774846/full
_version_ 1830198735807709184
author Shuangbin Xu
Meijun Chen
Tingze Feng
Li Zhan
Lang Zhou
Guangchuang Yu
author_facet Shuangbin Xu
Meijun Chen
Tingze Feng
Li Zhan
Lang Zhou
Guangchuang Yu
author_sort Shuangbin Xu
collection DOAJ
description With the rapid increase of large-scale datasets, biomedical data visualization is facing challenges. The data may be large, have different orders of magnitude, contain extreme values, and the data distribution is not clear. Here we present an R package ggbreak that allows users to create broken axes using ggplot2 syntax. It can effectively use the plotting area to deal with large datasets (especially for long sequential data), data with different magnitudes, and contain outliers. The ggbreak package increases the available visual space for a better presentation of the data and detailed annotation, thus improves our ability to interpret the data. The ggbreak package is fully compatible with ggplot2 and it is easy to superpose additional layers and applies scale and theme to adjust the plot using the ggplot2 syntax. The ggbreak package is open-source software released under the Artistic-2.0 license, and it is freely available on CRAN (https://CRAN.R-project.org/package=ggbreak) and Github (https://github.com/YuLab-SMU/ggbreak).
first_indexed 2024-12-18T01:59:31Z
format Article
id doaj.art-a45b3d556daa4fb282376e3dfaf95e7f
institution Directory Open Access Journal
issn 1664-8021
language English
last_indexed 2024-12-18T01:59:31Z
publishDate 2021-11-01
publisher Frontiers Media S.A.
record_format Article
series Frontiers in Genetics
spelling doaj.art-a45b3d556daa4fb282376e3dfaf95e7f2022-12-21T21:24:48ZengFrontiers Media S.A.Frontiers in Genetics1664-80212021-11-011210.3389/fgene.2021.774846774846Use ggbreak to Effectively Utilize Plotting Space to Deal With Large Datasets and OutliersShuangbin XuMeijun ChenTingze FengLi ZhanLang ZhouGuangchuang YuWith the rapid increase of large-scale datasets, biomedical data visualization is facing challenges. The data may be large, have different orders of magnitude, contain extreme values, and the data distribution is not clear. Here we present an R package ggbreak that allows users to create broken axes using ggplot2 syntax. It can effectively use the plotting area to deal with large datasets (especially for long sequential data), data with different magnitudes, and contain outliers. The ggbreak package increases the available visual space for a better presentation of the data and detailed annotation, thus improves our ability to interpret the data. The ggbreak package is fully compatible with ggplot2 and it is easy to superpose additional layers and applies scale and theme to adjust the plot using the ggplot2 syntax. The ggbreak package is open-source software released under the Artistic-2.0 license, and it is freely available on CRAN (https://CRAN.R-project.org/package=ggbreak) and Github (https://github.com/YuLab-SMU/ggbreak).https://www.frontiersin.org/articles/10.3389/fgene.2021.774846/fullaxis breakgap plotlong sequential dataoutlierggplot2
spellingShingle Shuangbin Xu
Meijun Chen
Tingze Feng
Li Zhan
Lang Zhou
Guangchuang Yu
Use ggbreak to Effectively Utilize Plotting Space to Deal With Large Datasets and Outliers
Frontiers in Genetics
axis break
gap plot
long sequential data
outlier
ggplot2
title Use ggbreak to Effectively Utilize Plotting Space to Deal With Large Datasets and Outliers
title_full Use ggbreak to Effectively Utilize Plotting Space to Deal With Large Datasets and Outliers
title_fullStr Use ggbreak to Effectively Utilize Plotting Space to Deal With Large Datasets and Outliers
title_full_unstemmed Use ggbreak to Effectively Utilize Plotting Space to Deal With Large Datasets and Outliers
title_short Use ggbreak to Effectively Utilize Plotting Space to Deal With Large Datasets and Outliers
title_sort use ggbreak to effectively utilize plotting space to deal with large datasets and outliers
topic axis break
gap plot
long sequential data
outlier
ggplot2
url https://www.frontiersin.org/articles/10.3389/fgene.2021.774846/full
work_keys_str_mv AT shuangbinxu useggbreaktoeffectivelyutilizeplottingspacetodealwithlargedatasetsandoutliers
AT meijunchen useggbreaktoeffectivelyutilizeplottingspacetodealwithlargedatasetsandoutliers
AT tingzefeng useggbreaktoeffectivelyutilizeplottingspacetodealwithlargedatasetsandoutliers
AT lizhan useggbreaktoeffectivelyutilizeplottingspacetodealwithlargedatasetsandoutliers
AT langzhou useggbreaktoeffectivelyutilizeplottingspacetodealwithlargedatasetsandoutliers
AT guangchuangyu useggbreaktoeffectivelyutilizeplottingspacetodealwithlargedatasetsandoutliers