A Novel Topic Segmentation Approach for Enhanced Dialogue Summarization

A Novel Topic Segmentation Approach for Enhanced Dialogue Summarization

Authors

  • Zheng Ren College of Computing, Georgia Institute of Technology, North Avenue, Atlanta, GA 30332

DOI:

https://doi.org/10.53469/wjimt.2024.07(04).05

Keywords:

Dialogue Summarization, Topic Segmentation, Natural Language Processing

Abstract

Dialogue summarization aims to distill a given conversation into a brief and focused summary. The challenge lies in the diverse perspectives of participants and the frequent shifts in topics throughout the dialogue. These factors make it difficult to extract and highlight the most significant information effectively. To tackle this challenge, we introduce a novel topic segmentation method that assigns distinct topics to dialogue segments while accounting for their importance and influence within the entire conversation. Our method's performance has been validated on two public benchmark datasets, CSDS and SAMSUM, demonstrating significant improvements in accuracy and coherence. The results show that our approach not only captures the essential content of dialogues more precisely but also enhances the overall quality and coverage of the summaries. This work provides a fresh approach to dialogue summarization and highlights its potential for practical application.

References

"[1]T. Berg-Kirkpatrick, D. Gillick, and D. Klein, “Jointly learning to extract and compress,” in Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies (D. Lin, Y. Matsumoto, and R. Mihalcea, eds.), (Portland, Oregon, USA), pp. 481–490, Association for Computational Linguistics, June 2011.

K. Filippova, E. Alfonseca, C. A. Colmenares, L. Kaiser, and O. Vinyals, “Sentence compression by deletion with LSTMs,” in Proceedings of the 2015 Conferenceon Empirical Methods in Natural Language Processing (L. M`arquez, C. CallisonBurch, and J. Su, eds.), (Lisbon, Portugal), pp. 360–368, Association for Computational Linguistics, Sept. 2015.

H. Jing and K. R. McKeown, “Cut and paste based text summarization,” in 1st Meeting of the North American Chapter of the Association for Computational Linguistics, 2000.

M. Banko, V. O. Mittal, and M. J. Witbrock, “Headline generation based on statistical translation,” in Proceedings of the 38th Annual Meeting of the Association for Computational Linguistics, (Hong Kong), pp. 318–325, Association for Computational Linguistics, Oct. 2000.

S. Chopra, M. Auli, and A. M. Rush, “Abstractive sentence summarization with attentive recurrent neural networks,” in Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (K. Knight, A. Nenkova, and O. Rambow, eds.), (San Diego, California), pp. 93–98, Association for Computational Linguistics, June 2016.

R. Nallapati, B. Zhou, C. dos Santos, C¸ . Gul¸cehre, and B. Xiang, “Abstractive text summarization using sequence-to-sequence RNNs and beyond,” in Proceedings of the 20th SIGNLL Conference on Computational Natural Language Learning (S. Riezler and Y. Goldberg, eds.), (Berlin, Germany), pp. 280–290, Association for Computational Linguistics, Aug. 2016.

H. Lin, L. Ma, J. Zhu, L. Xiang, Y. Zhou, J. Zhang, and C. Zong, “CSDS: A finegrained chinese dataset for customer service dialogue summarization,” in Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, EMNLP 2021, Virtual Event / Punta Cana, Dominican Republic, 7-11 November, 2021, pp. 4436–4451, 2021.

Y. Zou, L. Zhao, Y. Kang, J. Lin, M. Peng, Z. Jiang, C. Sun, Q. Zhang, X. Huang, and X. Liu, “Topic-oriented spoken dialogue summarization for customer service with saliency-aware topic modeling,” vol. 35, pp. 14665–14673, May 2021.

C. Liu, P. Wang, X. Jiang, Z. Li, and J. Ye, “Automatic dialogue summary generation for customer service,” pp. 1957–1965, 07 2019.

B. Gliwa, I. Mochol, M. Biesek, and A. Wawer, “SAMSum corpus: A humanannotated dialogue dataset for abstractive summarization,” in Proceedings of the 2nd Workshop on New Frontiers in Summarization, pp. 70–79, 2019.

X. Feng, X. Feng, L. Qin, B. Qin, and T. Liu, “Language model as an annotator: Exploring DialoGPT for dialogue summarization,” (Online), pp. 1479–1491,Association for Computational Linguistics, Aug. 2021.

L. Zhao, F. Zheng, W. Zeng, K. He, G. Ruotong, H. Jiang, W. Wu, and W. Xu,“Adpl: Adversarial prompt-based domain adaptation for dialogue summarization with knowledge disentanglement,” 07 2022.

Y. Chen, Y. Liu, L. Chen, and Y. Zhang, “Dialogsum: A real-life scenario dialogue summarization dataset,” pp. 5062–5074, 01 2021.

Y. Song, Y. Tian, N. Wang, and F. Xia, “Summarizing medical conversations via identifying important utterances,” (Barcelona, Spain (Online)), pp. 717–729, International Committee on Computational Linguistics, Dec. 2020.

K. Krishna, S. Khosla, J. Bigham, and Z. C. Lipton, “Generating SOAP notes from doctor-patient conversations using modular summarization techniques,” (Online), pp. 4958–4972, Association for Computational Linguistics, Aug. 2021.

Y. Zhang, A. Ni, T. Yu, R. Zhang, C. Zhu, B. Deb, A. Celikyilmaz, A. H. Awadallah, and D. Radev, “An exploratory study on long dialogue summarization: What works and what’s next,” (Punta Cana, Dominican Republic), pp. 4426–4433, Association for Computational Linguistics, Nov. 2021.

H. Lin, J. Zhu, L. Xiang, F. Zhai, Y. Zhou, J. Zhang, and C. Zong, “Topic-oriented dialogue summarization,” vol. 31, pp. 1797–1810, 2023.

Lyu, H., Wang, Z., & Babakhani, A. (2020). A UHF/UWB hybrid RFID tag with a 51-m energy-harvesting sensitivity for remote vital-sign monitoring. IEEE transactions on microwave theory and techniques, 68(11), 4886-4895.

Lin, Z., Wang, Z., Zhu, Y., Li, Z., & Qin, H. (2024). Text Sentiment Detection and Classification Based on Integrated Learning Algorithm. Applied Science and Engineering Journal for Advanced Research, 3(3), 27-33.

Wang, Z., Zhu, Y., Li, Z., Wang, Z., Qin, H., & Liu, X. (2024). Graph neural network recommendation system for football formation. Applied Science and Biotechnology Journal for Advanced Research, 3(3), 33-39.

Zhu, Z., Wang, Z., Wu, Z., Zhang, Y., & Bo, S. (2024). Adversarial for Sequential Recommendation Walking in the Multi-Latent Space. Applied Science and Biotechnology Journal for Advanced Research, 3(4), 1-9.

Wang, Z., Zhu, Y., He, S., Yan, H., & Zhu, Z. (2024). LLM for Sentiment Analysis in E-commerce: A Deep Dive into Customer Feedback. Applied Science and Engineering Journal for Advanced Research, 3(4), 8-13.

M. Khalifa, M. Ballesteros, and K. McKeown, “A bag of tricks for dialogue summarization,” (Online and Punta Cana, Dominican Republic), pp. 8014–8022, Association for Computational Linguistics, Nov. 2021.

X. Zhang, R. Zhang, M. Zaheer, and A. Ahmed, “Unsupervised abstractive dialogue summarization for tete-a-tetes,” vol. 35, pp. 14489–14497, May 2021.

C.-Y. Lin and E. Hovy, “Manual and automatic evaluation of summaries,” in Proceedings of the ACL-02 Workshop on Automatic Summarization, pp. 45–51, 2002.

K. Papineni, S. Roukos, T. Ward, and W.J. Zhu, “Bleu: a method for automatic evaluation of machine translation,” in Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics, pp. 311–318, 2002.

T. Zhang, V. Kishore, F. Wu, K. Q. Weinberger, and Y. Artzi, “Bertscore: Evaluating text generation with BERT,” in 8th International Conference on Learning Representations, ICLR 2020, Addis Ababa, Ethiopia, April 26-30, 2020, 2020.

W. Zhao, M. Peyrard, F. Liu, Y. Gao, C. M. Meyer, and S. Eger, “MoverScore: Text generation evaluating with contextualized embeddings and earth mover distance,” in Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pp. 563–578, 2019.

Ren, Z. (2024). VGCN: An Enhanced Graph Convolutional Network Model for Text Classification. Journal of Industrial Engineering and Applied Science, 2(4), 110–115. https://doi.org/10.5281/zenodo.13118430

Yuan, B., & Song, T. (2023, November). Structural Resilience and Connectivity of the IPv6 Internet: An AS-level Topology Examination. In Proceedings of the 4th International Conference on Artificial Intelligence and Computer Engineering (pp. 853-856).

Yuan, B., Song, T., & Yao, J. (2024, January). Identification of important nodes in the information propagation network based on the artificial intelligence method. In 2024 4th International Conference on Consumer Electronics and Computer Engineering (ICCECE) (pp. 11-14). IEEE.

Yuan, B. (2024). Design of an Intelligent Dialogue System Based on Natural Language Processing. Journal of Theory and Practice of Engineering Science, 4(01), 72-78.

Wang, Z. (2024, August). CausalBench: A Comprehensive Benchmark for Evaluating Causal Reasoning Capabilities of Large Language Models. In Proceedings of the 10th SIGHAN Workshop on Chinese Language Processing (SIGHAN-10) (pp. 143-151).

Ren, Z. (2024). Enhanced YOLOv8 Infrared Image Object Detection Method with SPD Module. Journal of Theory and Practice in Engineering and Technology, 1(2), 1–7. Retrieved from https://woodyinternational.com/index.php/jtpet/article/view/42 "

Ren, Z. (2024). Enhanced YOLOv8 Infrared Image Object Detection Method with SPD Module. Journal of Theory and Practice in Engineering and Technology, 1(2), 1–7. Retrieved from https://woodyinternational.com/index.php/jtpet/article/view/42

Downloads

Published

2024-08-20
Loading...