Extractive Summarization of Call Transcripts

Automatic text summarization is one of the most challenging and interesting problems in natural language processing (NLP). Text summarization is the process of extracting the most important information from the text and presenting it concisely in fewer sentences. Call transcript involves textual des...

Full description

Bibliographic Details
Main Authors: Pratik K. Biswas, Aleksandr Iakubovich
Format: Article
Language:English
Published: IEEE 2022-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9946852/
_version_ 1797986993744904192
author Pratik K. Biswas
Aleksandr Iakubovich
author_facet Pratik K. Biswas
Aleksandr Iakubovich
author_sort Pratik K. Biswas
collection DOAJ
description Automatic text summarization is one of the most challenging and interesting problems in natural language processing (NLP). Text summarization is the process of extracting the most important information from the text and presenting it concisely in fewer sentences. Call transcript involves textual description of a phone conversation between a customer (caller) and agent(s) (customer representatives). Call transcripts pose unique challenges that are not adequately addressed by most open-source automatic text summarizers, which are developed to summarize continuous texts such as articles and stories. This paper presents an indigenously developed method that combines topic modeling and sentence selection with punctuation restoration in condensing ill-punctuated or un-punctuated call transcripts to produce more readable summaries. This unique combination is what distinguishes the proposed summarizer from other text summarizers. Extensive testing, evaluation and comparisons, with an open-source, state-of-the-art extractive summarizer using three different pre-trained language models, have demonstrated the efficacy of this summarizer for call transcript summarization. The summaries generated by the proposed summarizer are shown to be more compelling and useful based on multiple criteria.
first_indexed 2024-04-11T07:41:50Z
format Article
id doaj.art-7c7d94444a704d3ab888bdf69c13bd16
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-04-11T07:41:50Z
publishDate 2022-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-7c7d94444a704d3ab888bdf69c13bd162022-12-22T04:36:30ZengIEEEIEEE Access2169-35362022-01-011011982611984010.1109/ACCESS.2022.32214049946852Extractive Summarization of Call TranscriptsPratik K. Biswas0https://orcid.org/0000-0001-8570-3108Aleksandr Iakubovich1https://orcid.org/0000-0002-2287-8039Artificial Intelligence and Data, Global Network and Technology (GNT), Verizon Communications, Basking Ridge, NJ, USACore Engineering and Operations, Global Network and Technology (GNT), Verizon Communications, Richardson, TX, USAAutomatic text summarization is one of the most challenging and interesting problems in natural language processing (NLP). Text summarization is the process of extracting the most important information from the text and presenting it concisely in fewer sentences. Call transcript involves textual description of a phone conversation between a customer (caller) and agent(s) (customer representatives). Call transcripts pose unique challenges that are not adequately addressed by most open-source automatic text summarizers, which are developed to summarize continuous texts such as articles and stories. This paper presents an indigenously developed method that combines topic modeling and sentence selection with punctuation restoration in condensing ill-punctuated or un-punctuated call transcripts to produce more readable summaries. This unique combination is what distinguishes the proposed summarizer from other text summarizers. Extensive testing, evaluation and comparisons, with an open-source, state-of-the-art extractive summarizer using three different pre-trained language models, have demonstrated the efficacy of this summarizer for call transcript summarization. The summaries generated by the proposed summarizer are shown to be more compelling and useful based on multiple criteria.https://ieeexplore.ieee.org/document/9946852/Extractive summarizationtopic modelstransformersembeddingpunctuation restoration
spellingShingle Pratik K. Biswas
Aleksandr Iakubovich
Extractive Summarization of Call Transcripts
IEEE Access
Extractive summarization
topic models
transformers
embedding
punctuation restoration
title Extractive Summarization of Call Transcripts
title_full Extractive Summarization of Call Transcripts
title_fullStr Extractive Summarization of Call Transcripts
title_full_unstemmed Extractive Summarization of Call Transcripts
title_short Extractive Summarization of Call Transcripts
title_sort extractive summarization of call transcripts
topic Extractive summarization
topic models
transformers
embedding
punctuation restoration
url https://ieeexplore.ieee.org/document/9946852/
work_keys_str_mv AT pratikkbiswas extractivesummarizationofcalltranscripts
AT aleksandriakubovich extractivesummarizationofcalltranscripts