Efficient Latent Space Compression for Lightning-Fast Fine-Tuning and Inference of Transformer-Based Models

Efficient Latent Space Compression for Lightning-Fast Fine-Tuning and Inference of Transformer-Based Models

This paper presents a technique to reduce the number of parameters in a transformer-based encoder–decoder architecture by incorporating autoencoders. To discover the optimal compression, we trained different autoencoders on the embedding space (encoder’s output) of several pre-trained models. The ex...

Full description

Bibliographic Details
Main Authors:	Ala Alam Falaki, Robin Gras
Format:	Article
Language:	English
Published:	MDPI AG 2023-07-01
Series:	Machine Learning and Knowledge Extraction
Subjects:	transformers autoencoder (AE) sequence-to-sequence (seq2seq) compression summarization translation
Online Access:	https://www.mdpi.com/2504-4990/5/3/45

Similar Items

2FAST2Q: a general-purpose sequence search and counting program for FASTQ files
by: Afonso M. Bravo, et al.
Published: (2022-10-01)

Stacked LSTM Sequence-to-Sequence Autoencoder with Feature Selection for Daily Solar Radiation Prediction: A Review and New Modeling Results
by: Sujan Ghimire, et al.
Published: (2022-01-01)

Abstractive text summarization of low-resourced languages using deep learning
by: Nida Shafiq, et al.
Published: (2023-01-01)

Sequence-to-Sequence Voice Reconstruction for Silent Speech in a Tonal Language
by: Huiyan Li, et al.
Published: (2022-06-01)

Prediction accuracy of regulatory elements from sequence varies by functional sequencing technique
by: Ronald J. Nowling, et al.
Published: (2023-08-01)

A literature review of abstractive summarization methods
by: D. V. Shypik, et al.
Published: (2019-12-01)

Variational inference for detecting differential translation in ribosome profiling studies
by: David C. Walker, et al.
Published: (2023-06-01)

BART-IT: An Efficient Sequence-to-Sequence Model for Italian Text Summarization
by: Moreno La Quatra, et al.
Published: (2022-12-01)

Current strategies for detecting functional convergence across B-cell receptor repertoires
by: Matthew I. J. Raybould, et al.
Published: (2021-01-01)

Protocol for identifying differentially expressed genes using the RumBall RNA-seq analysis platform
by: Luis Augusto Eijy Nagai, et al.
Published: (2024-03-01)

A Multiple Comprehensive Analysis of scATAC-seq Based on Auto-Encoder and Matrix Decomposition
by: Yuyao Huang, et al.
Published: (2021-08-01)

Tunable protein synthesis by transcript isoforms in human cells
by: Stephen N Floor, et al.
Published: (2016-01-01)

The design of experiments for the transcriptome studies by high-throughput sequencing methods
by: P. N. Menshanov, et al.
Published: (2016-05-01)

Comparison of the performance of MiSeq and NovaSeq in oral microbiome study
by: Hyejung Han, et al.
Published: (2024-12-01)

Multiomics Analysis Reveals Novel Genetic Determinants for Lens Differentiation, Structure, and Transparency
by: Joshua Disatham, et al.
Published: (2023-04-01)

3D-UNet-LSTM: A Deep Learning-Based Radar Echo Extrapolation Model for Convective Nowcasting
by: Shiqing Guo, et al.
Published: (2023-03-01)

Fast sequence-based microsatellite genotyping development workflow
by: Olivier Lepais, et al.
Published: (2020-05-01)

The neural machine translation models for the low-resource Kazakh–English language pair
by: Vladislav Karyukin, et al.
Published: (2023-02-01)

Single-cell RNA sequencing comparison of the human metastatic prostate spine tumor microenvironment
by: Annee D. Nguyen, et al.
Published: (2024-03-01)

Ship Trajectory Prediction: An Integrated Approach Using ConvLSTM-Based Sequence-to-Sequence Model
by: Wenxiong Wu, et al.
Published: (2023-07-01)

Absolute quantification of translational regulation and burden using combined sequencing approaches
by: Thomas E Gorochowski, et al.
Published: (2019-05-01)

Robust Sub-nanomolar Library Preparation for High Throughput Next Generation Sequencing
by: Wells W. Wu, et al.
Published: (2018-05-01)

Comparison of Single Cell Transcriptome Sequencing Methods: Of Mice and Men
by: Bastian V. H. Hornung, et al.
Published: (2023-12-01)

Recent Challenges and Opportunities in Video Summarization With Machine Learning Algorithms
by: Payal Kadam, et al.
Published: (2022-01-01)

Rising of efficiency of enciphering on the basis of summation of products
by: Alexander I Nikonov
Published: (2014-06-01)

Protocol for generating customizable and reproducible plots of sequencing coverage data using the seqNdisplayR package
by: Søren Lykke-Andersen, et al.
Published: (2024-06-01)

Mutation prediction and phylogenetic analysis of SARS-CoV2 protein sequences using LSTM based encoder-decoder model
by: Sweeti Sah, et al.
Published: (2023-03-01)

Bacterial diversity and community in Qula from the Qinghai–Tibetan Plateau in China
by: Yan Zhu, et al.
Published: (2018-12-01)

Hydrop enables droplet-based single-cell ATAC-seq and single-cell RNA-seq using dissolvable hydrogel beads
by: Florian V De Rop, et al.
Published: (2022-02-01)

Exploring the optimization of autoencoder design for imputing single-cell RNA sequencing data
by: Nan Miles Xi, et al.
Published: (2023-01-01)

Effects of error, chimera, bias, and GC content on the accuracy of amplicon sequencing
by: Yujia Qin, et al.
Published: (2023-12-01)

scNAT: a deep learning method for integrating paired single-cell RNA and T cell receptor sequencing profiles
by: Biqing Zhu, et al.
Published: (2023-12-01)

Direct Comparative Analyses of 10X Genomics Chromium and Smart-seq2
by: Xiliang Wang, et al.
Published: (2021-04-01)

Unipro UGENE NGS pipelines and components for variant calling, RNA-seq and ChIP-seq data analyses
by: Olga Golosova, et al.
Published: (2014-11-01)

Characterization and Optimization of Multiomic Single-Cell Epigenomic Profiling
by: Leticia Sandoval, et al.
Published: (2023-06-01)

Estimating microhaplotype allele frequencies from low-coverage or pooled sequencing data
by: Thomas A. Delomas, et al.
Published: (2023-11-01)

Help Transformer Improve Performance in Automatic Mathematics Word Problem-Solving
by: Dong Liu, et al.
Published: (2022-01-01)

A two-stage fine-tuning method for low-resource cross-lingual summarization
by: Kaixiong Zhang, et al.
Published: (2024-01-01)

Why and how to use the SeqCode
by: William B. Whitman, et al.
Published: (2024-03-01)

A comprehensive assessment of exome capture methods for RNA sequencing of formalin-fixed and paraffin-embedded samples
by: Liang Zong, et al.
Published: (2023-12-01)