Large scale genomic rearrangements in selected Arabidopsis thaliana T-DNA lines are caused by T-DNA insertion mutagenesis
Abstract Background Experimental proof of gene function assignments in plants is based on mutant analyses. T-DNA insertion lines provided an invaluable resource of mutants and enabled systematic reverse genetics-based investigation of the functions of Arabidopsis thaliana genes during the last decad...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2021-08-01
|
Series: | BMC Genomics |
Subjects: | |
Online Access: | https://doi.org/10.1186/s12864-021-07877-8 |
_version_ | 1818607221080064000 |
---|---|
author | Boas Pucker Nils Kleinbölting Bernd Weisshaar |
author_facet | Boas Pucker Nils Kleinbölting Bernd Weisshaar |
author_sort | Boas Pucker |
collection | DOAJ |
description | Abstract Background Experimental proof of gene function assignments in plants is based on mutant analyses. T-DNA insertion lines provided an invaluable resource of mutants and enabled systematic reverse genetics-based investigation of the functions of Arabidopsis thaliana genes during the last decades. Results We sequenced the genomes of 14 A. thaliana GABI-Kat T-DNA insertion lines, which eluded flanking sequence tag-based attempts to characterize their insertion loci, with Oxford Nanopore Technologies (ONT) long reads. Complex T-DNA insertions were resolved and 11 previously unknown T-DNA loci identified, resulting in about 2 T-DNA insertions per line and suggesting that this number was previously underestimated. T-DNA mutagenesis caused fusions of chromosomes along with compensating translocations to keep the gene set complete throughout meiosis. Also, an inverted duplication of 800 kbp was detected. About 10 % of GABI-Kat lines might be affected by chromosomal rearrangements, some of which do not involve T-DNA. Local assembly of selected reads was shown to be a computationally effective method to resolve the structure of T-DNA insertion loci. We developed an automated workflow to support investigation of long read data from T-DNA insertion lines. All steps from DNA extraction to assembly of T-DNA loci can be completed within days. Conclusions Long read sequencing was demonstrated to be an effective way to resolve complex T-DNA insertions and chromosome fusions. Many T-DNA insertions comprise not just a single T-DNA, but complex arrays of multiple T-DNAs. It is becoming obvious that T-DNA insertion alleles must be characterized by exact identification of both T-DNA::genome junctions to generate clear genotype-to-phenotype relations. |
first_indexed | 2024-12-16T14:23:18Z |
format | Article |
id | doaj.art-4956c9e56d1f45f6a9ba113d733bcfaf |
institution | Directory Open Access Journal |
issn | 1471-2164 |
language | English |
last_indexed | 2024-12-16T14:23:18Z |
publishDate | 2021-08-01 |
publisher | BMC |
record_format | Article |
series | BMC Genomics |
spelling | doaj.art-4956c9e56d1f45f6a9ba113d733bcfaf2022-12-21T22:28:26ZengBMCBMC Genomics1471-21642021-08-0122112110.1186/s12864-021-07877-8Large scale genomic rearrangements in selected Arabidopsis thaliana T-DNA lines are caused by T-DNA insertion mutagenesisBoas Pucker0Nils Kleinbölting1Bernd Weisshaar2Genetics and Genomics of Plants, Center for Biotechnology (CeBiTec), Bielefeld UniversityBioinformatics Resource Facility, Center for Biotechnology (CeBiTec, Bielefeld UniversityGenetics and Genomics of Plants, Center for Biotechnology (CeBiTec), Bielefeld UniversityAbstract Background Experimental proof of gene function assignments in plants is based on mutant analyses. T-DNA insertion lines provided an invaluable resource of mutants and enabled systematic reverse genetics-based investigation of the functions of Arabidopsis thaliana genes during the last decades. Results We sequenced the genomes of 14 A. thaliana GABI-Kat T-DNA insertion lines, which eluded flanking sequence tag-based attempts to characterize their insertion loci, with Oxford Nanopore Technologies (ONT) long reads. Complex T-DNA insertions were resolved and 11 previously unknown T-DNA loci identified, resulting in about 2 T-DNA insertions per line and suggesting that this number was previously underestimated. T-DNA mutagenesis caused fusions of chromosomes along with compensating translocations to keep the gene set complete throughout meiosis. Also, an inverted duplication of 800 kbp was detected. About 10 % of GABI-Kat lines might be affected by chromosomal rearrangements, some of which do not involve T-DNA. Local assembly of selected reads was shown to be a computationally effective method to resolve the structure of T-DNA insertion loci. We developed an automated workflow to support investigation of long read data from T-DNA insertion lines. All steps from DNA extraction to assembly of T-DNA loci can be completed within days. Conclusions Long read sequencing was demonstrated to be an effective way to resolve complex T-DNA insertions and chromosome fusions. Many T-DNA insertions comprise not just a single T-DNA, but complex arrays of multiple T-DNAs. It is becoming obvious that T-DNA insertion alleles must be characterized by exact identification of both T-DNA::genome junctions to generate clear genotype-to-phenotype relations.https://doi.org/10.1186/s12864-021-07877-8Long read sequencingGenome assemblyStructural variantsTranslocationsChromosome fusionsReverse genetics |
spellingShingle | Boas Pucker Nils Kleinbölting Bernd Weisshaar Large scale genomic rearrangements in selected Arabidopsis thaliana T-DNA lines are caused by T-DNA insertion mutagenesis BMC Genomics Long read sequencing Genome assembly Structural variants Translocations Chromosome fusions Reverse genetics |
title | Large scale genomic rearrangements in selected Arabidopsis thaliana T-DNA lines are caused by T-DNA insertion mutagenesis |
title_full | Large scale genomic rearrangements in selected Arabidopsis thaliana T-DNA lines are caused by T-DNA insertion mutagenesis |
title_fullStr | Large scale genomic rearrangements in selected Arabidopsis thaliana T-DNA lines are caused by T-DNA insertion mutagenesis |
title_full_unstemmed | Large scale genomic rearrangements in selected Arabidopsis thaliana T-DNA lines are caused by T-DNA insertion mutagenesis |
title_short | Large scale genomic rearrangements in selected Arabidopsis thaliana T-DNA lines are caused by T-DNA insertion mutagenesis |
title_sort | large scale genomic rearrangements in selected arabidopsis thaliana t dna lines are caused by t dna insertion mutagenesis |
topic | Long read sequencing Genome assembly Structural variants Translocations Chromosome fusions Reverse genetics |
url | https://doi.org/10.1186/s12864-021-07877-8 |
work_keys_str_mv | AT boaspucker largescalegenomicrearrangementsinselectedarabidopsisthalianatdnalinesarecausedbytdnainsertionmutagenesis AT nilskleinbolting largescalegenomicrearrangementsinselectedarabidopsisthalianatdnalinesarecausedbytdnainsertionmutagenesis AT berndweisshaar largescalegenomicrearrangementsinselectedarabidopsisthalianatdnalinesarecausedbytdnainsertionmutagenesis |