Use ggbreak to Effectively Utilize Plotting Space to Deal With Large Datasets and Outliers
With the rapid increase of large-scale datasets, biomedical data visualization is facing challenges. The data may be large, have different orders of magnitude, contain extreme values, and the data distribution is not clear. Here we present an R package ggbreak that allows users to create broken axes...
Main Authors: | , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Frontiers Media S.A.
2021-11-01
|
Series: | Frontiers in Genetics |
Subjects: | |
Online Access: | https://www.frontiersin.org/articles/10.3389/fgene.2021.774846/full |
_version_ | 1830198735807709184 |
---|---|
author | Shuangbin Xu Meijun Chen Tingze Feng Li Zhan Lang Zhou Guangchuang Yu |
author_facet | Shuangbin Xu Meijun Chen Tingze Feng Li Zhan Lang Zhou Guangchuang Yu |
author_sort | Shuangbin Xu |
collection | DOAJ |
description | With the rapid increase of large-scale datasets, biomedical data visualization is facing challenges. The data may be large, have different orders of magnitude, contain extreme values, and the data distribution is not clear. Here we present an R package ggbreak that allows users to create broken axes using ggplot2 syntax. It can effectively use the plotting area to deal with large datasets (especially for long sequential data), data with different magnitudes, and contain outliers. The ggbreak package increases the available visual space for a better presentation of the data and detailed annotation, thus improves our ability to interpret the data. The ggbreak package is fully compatible with ggplot2 and it is easy to superpose additional layers and applies scale and theme to adjust the plot using the ggplot2 syntax. The ggbreak package is open-source software released under the Artistic-2.0 license, and it is freely available on CRAN (https://CRAN.R-project.org/package=ggbreak) and Github (https://github.com/YuLab-SMU/ggbreak). |
first_indexed | 2024-12-18T01:59:31Z |
format | Article |
id | doaj.art-a45b3d556daa4fb282376e3dfaf95e7f |
institution | Directory Open Access Journal |
issn | 1664-8021 |
language | English |
last_indexed | 2024-12-18T01:59:31Z |
publishDate | 2021-11-01 |
publisher | Frontiers Media S.A. |
record_format | Article |
series | Frontiers in Genetics |
spelling | doaj.art-a45b3d556daa4fb282376e3dfaf95e7f2022-12-21T21:24:48ZengFrontiers Media S.A.Frontiers in Genetics1664-80212021-11-011210.3389/fgene.2021.774846774846Use ggbreak to Effectively Utilize Plotting Space to Deal With Large Datasets and OutliersShuangbin XuMeijun ChenTingze FengLi ZhanLang ZhouGuangchuang YuWith the rapid increase of large-scale datasets, biomedical data visualization is facing challenges. The data may be large, have different orders of magnitude, contain extreme values, and the data distribution is not clear. Here we present an R package ggbreak that allows users to create broken axes using ggplot2 syntax. It can effectively use the plotting area to deal with large datasets (especially for long sequential data), data with different magnitudes, and contain outliers. The ggbreak package increases the available visual space for a better presentation of the data and detailed annotation, thus improves our ability to interpret the data. The ggbreak package is fully compatible with ggplot2 and it is easy to superpose additional layers and applies scale and theme to adjust the plot using the ggplot2 syntax. The ggbreak package is open-source software released under the Artistic-2.0 license, and it is freely available on CRAN (https://CRAN.R-project.org/package=ggbreak) and Github (https://github.com/YuLab-SMU/ggbreak).https://www.frontiersin.org/articles/10.3389/fgene.2021.774846/fullaxis breakgap plotlong sequential dataoutlierggplot2 |
spellingShingle | Shuangbin Xu Meijun Chen Tingze Feng Li Zhan Lang Zhou Guangchuang Yu Use ggbreak to Effectively Utilize Plotting Space to Deal With Large Datasets and Outliers Frontiers in Genetics axis break gap plot long sequential data outlier ggplot2 |
title | Use ggbreak to Effectively Utilize Plotting Space to Deal With Large Datasets and Outliers |
title_full | Use ggbreak to Effectively Utilize Plotting Space to Deal With Large Datasets and Outliers |
title_fullStr | Use ggbreak to Effectively Utilize Plotting Space to Deal With Large Datasets and Outliers |
title_full_unstemmed | Use ggbreak to Effectively Utilize Plotting Space to Deal With Large Datasets and Outliers |
title_short | Use ggbreak to Effectively Utilize Plotting Space to Deal With Large Datasets and Outliers |
title_sort | use ggbreak to effectively utilize plotting space to deal with large datasets and outliers |
topic | axis break gap plot long sequential data outlier ggplot2 |
url | https://www.frontiersin.org/articles/10.3389/fgene.2021.774846/full |
work_keys_str_mv | AT shuangbinxu useggbreaktoeffectivelyutilizeplottingspacetodealwithlargedatasetsandoutliers AT meijunchen useggbreaktoeffectivelyutilizeplottingspacetodealwithlargedatasetsandoutliers AT tingzefeng useggbreaktoeffectivelyutilizeplottingspacetodealwithlargedatasetsandoutliers AT lizhan useggbreaktoeffectivelyutilizeplottingspacetodealwithlargedatasetsandoutliers AT langzhou useggbreaktoeffectivelyutilizeplottingspacetodealwithlargedatasetsandoutliers AT guangchuangyu useggbreaktoeffectivelyutilizeplottingspacetodealwithlargedatasetsandoutliers |