Skip to main content

Advertisement

Log in

A Game Theory Approach for Multi-document Summarization

  • Research Article - Computer Engineering and Computer Science
  • Published:
Arabian Journal for Science and Engineering Aims and scope Submit manuscript

Abstract

In today’s era, information has been growing exponentially on the web, due to which extraction of relevant and concise information has become a challenging task. To overcome the above problem, a fundamental tool known as summarization techniques has been used for understanding and organizing such large datasets. Recently, researchers have been devoting a lot of effort to develop semantics-based models, so as to improve summarization performance. In this paper, a versatile and principled game theory-based multi-document summarization framework integrated with Wikipedia ontology is proposed. The framework exploits the submodularity hidden in underlying ontology and is optimized using the proposed improved algorithm, to enhance the summarization performance. Results of the proposed approach were evaluated with the ROUGE evaluation metric for different multi-document summarization tasks against human-generated summaries and it outperformed DUC, TAC competitors, and other competitive methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Marr, B.: Big data: 20 mind-boggling facts everyone must read. https://www.forbes.com/sites/bernardmarr/2015/09/30/big-data-20-mindboggling-facts-everyone-must-read (2015)

  2. Wang, D.; Li, T.; Zhu, S.; Ding, C.: Multi-document summarization via sentence-level semantic analysis and symmetric factorization. In: Proceedings of SIGIR, pp. 307–314 (2008)

  3. Wang, S.; Li, W.; Deng, H.: Document update summarization using incremental hierarchical clustering. In Proceedings of the 19th ACM International Conference on Information and Knowledge Management, pp. 279–288 (2010)

  4. Radev, D.R.; Hovy, E.; McKeown, K.: Introduction to the special issue on summarization. Comput. Linguist. 28(4), 399408 (2002)

    Article  Google Scholar 

  5. Inderjeet, M.: Summarization evaluation: an overview. In: Proceedings of the NTCIR Workshop (2001)

  6. He, R.; Qjn, B.; Lin, T.: A novel approach to update summarization using evolutionary manifold-ranking and spectral clustering. Expert Syst. Appl. 39, 2375–2384 (2012)

    Article  Google Scholar 

  7. Nenkova, A.; Passonneau, R.; Mckeown, K.: The pyramid method: incorporating human content selection variation in summarization evaluation. ACM Trans. Lang. Speech Process. 4(2), 1–23 (2007)

    Article  Google Scholar 

  8. Goldstein, J.; Mittal, V.; Carbonell, J.; Kantrowitz, M.: Multi-document summarization by sentence extraction. In: Proceedings of the 2000 NAACL-ANLPWorkshop on Automatic Summarization, pp. 40–48 (2000)

  9. Wang, D.; Zhu, S.; Li, T.; Chi, Y.; Gong, Y.: Integrating document clustering and multi-document summarization. ACM Trans. Knowl. Discov. Data 5, 14:1–14:26 (2011)

    Article  Google Scholar 

  10. Radev, D.R.: Lexrank: Graph-based lexical centrality as salience in text summarization. J. Artif. Intell. Res. 22, 457–479 (2004)

    Article  Google Scholar 

  11. Rumeng, L.; Hiroyuki, S.: A hierarchical tree model for update summarization. In: European Conference on Information Retrieval, pp. 660–665 (2015)

  12. Sánchez, D.; Batet, M.; Valls, A.; Gibert, K.: Ontology-driven web-based semantic similarity. J. Intell. Inf. Syst. 35(3), 383–413 (2010)

    Article  Google Scholar 

  13. Shapley, L.S.: A value for n-person games. In: Kuhn, H.W., Tucker, A.W. (eds.) Contributions to the Theory of Games, vol. I. Princeton University Press, Princeton (1950)

    Google Scholar 

  14. Shen, D.; Sun, J.T.; Li, H.; Yang, Q.; Chen, Z.: Document summarization using conditional random fields. In: Proceedings of IJCAI (2007)

  15. Sidney, C.: The Art of Legging. Maxline International, London (1976)

    Google Scholar 

  16. Takeda, H.: Ontology extraction by collaborative tagging. In: WWW 2009. ACM Press (2009)

  17. Town, C.: Ontological inference for image and video analysis. Mach. Vis. Appl. 17, 94–115 (2006)

    Article  Google Scholar 

  18. Yang, C.C.; Wang, F.L.: Hierarchical summarization of large documents. J. Am. Soc. Inf. Sci. Technol. 59(6), 887–902 (2008)

    Article  Google Scholar 

  19. Wang, D.; Li, T.: Weighted consensus multi-document summarization. Inf. Process. Manag. 48, 513–523 (2012)

    Article  Google Scholar 

  20. Wang, D.; Zhu, S.; Gong, L.Y.: Multi-document summarization using sentence-based topic models. In: Proceedings of ACL-IJCNLP, Association for Computational Linguistics, pp. 297–300 (2009)

  21. Wang, S.; Li, W.; Deng, H.: A survey on automatic summarization. In: International Forum on Information Technology and Applications (2010)

  22. Wei, F.; Li, W.; Lu, Q.; He, Y.: Query-sensitive mutual reinforcement chain and its application in query-oriented multi-document summarization. In: Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, New York, p. 283290 (2008)

  23. Liu, W.; Luo, X.; Zhang, J.; Xue, R.; Xu, R.Y.D.: Semantic summary automatic generation in news event. Concurr. Comput. Pract. Exp. 29(24), e4287 (2017)

    Article  Google Scholar 

  24. Li, Xuan; Liang, D.; Yi-Dong, S.: Update summarization via graph-based sentence ranking. IEEE Trans. Knowl. Data Eng. 25, 1162–1174 (2013)

    Article  Google Scholar 

  25. Yang, Z.; Cai, K.; Tang, J.; Zhang, L.; Su, Z.; Li, J.: Social context summarization. In: Proceedings of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval SIGIR’11, pp. 255–284 (2011)

  26. Conroy, J.M.; Schlesinger, J.D.; Goldstein, J.; O’Leary, D.P.: Left-brain/right-brain multi-document summarization. In: DUC 2004 Conference Proceedings (2004)

  27. Luo, W.; Zhuang, F.; Shi, Z.: Exploiting relevance, coverage and novelty for query-focussed multi-document summarization. Knowl. Based Syst. 46(1), 33–42 (2013)

    Article  Google Scholar 

  28. Narayanam, R.; Narahari, Y.: A shapley value-based approach to discover influential nodes in social networks. IEEE Trans. Autom. Sci. Eng. 8(1), 130–147 (2011)

    Article  Google Scholar 

  29. Myerson, R.B.: Game Theory: Analysis of Conflict. Harvard University Press, Cambridge (1997)

    MATH  Google Scholar 

  30. Li, T.; Zhu, S.; Ogihara, M.: Hierarchical document classification using automatically generated hierarchy. J. Intell. Inf. Syst. 29(2), 211–230 (2007)

    Article  Google Scholar 

  31. Radev, D.R.; Jing, H.; Styś, M.; Tam, D.: Centroid-based summarization of multiple documents. Inf. Process. Manag. 40(6), 919–938 (2004)

    Article  MATH  Google Scholar 

  32. Ferreira, R.; de Souza Cabral, L.; Lins, R.D.; Silva, G.P.; Freitas, F.; Cavalcanti, G.D.C.; Lima, R.; Simske, S.J.; Favaro, L.: Assessing sentence scoring techniques for extractive text summarization. Expert Syst. Appl. 40, 5755–5764 (2013)

    Article  Google Scholar 

  33. Canhasi, E.; Kononeko, I.: Weighted archetypal analysis of the multi-document summarization. Expert Syst. Appl. 41(21), 535–543 (2014)

    Article  Google Scholar 

  34. Alguliev, R.M.; Aliguliyev, R.M.; Isazade, N.R.: Multiple documents summarization based on evolutionary optimization algorithm. Expert Syst. Appl. 40, 16751689 (2013)

    Google Scholar 

  35. Celikyilmaz, A.; Hakkani-Tur, D.; Tur, G.; Fidler, A.; Hillard, D.: Exploiting distance based similarity in topic models for user intent detection. In: IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU) (2011)

  36. Chong, L.; Min-Lie, H.; Xiao-Yan, Z.; Li, Ming: A new approach for multi-document update summarization. J. Comput. Sci. Technol. 25, 739–749 (2010)

    Article  MathSciNet  Google Scholar 

  37. Lin, H.; Bilmes, J.: A class of submodular functions for document summarization. In: The Meeting of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference, pp. 510–520 (1998)

  38. Kogilavani, A.; Balasubramanie, B.: Ontology enhanced clustering based summarization of medical documents. Int. J. Recent Trends Eng. 1, 546549 (2009)

    Google Scholar 

  39. Nastase, V.: Topic-driven multi-document summarization with encyclopedic knowledge and spreading activation. In: EMNLP. ACL, p. 763772 (2008)

  40. Garcia, L.F.F.; Valdeni de Lima, J.; Loh, S.; Palazzo Moreira de Oliveira, J.: Using ontological modeling in a context-aware summarization system to adapt text for mobile devices. In: Chen, P.P., Wong, L.Y. (eds.) Active Conceptual Modeling of Learning. Lecture Notes in Computer Science, Vol. 4512, p. 144154. Springer (2006)

  41. Hamasaki, M.; Matsuo, Y.; Nishimua, T.; Takeda, H.: Ontology extraction by collaborative tagging. In: WWW 2009 ACM Press (2009)

  42. Pourvali, M.; Abadeh, M.S.: Automated text summarization base on lexicales chain and graph using of wordnet and wikipedia knowledge base. CoRR, abs/1203.3586 (2012)

  43. Filatova, E.: A formal model for information selection in multi-sentence text extraction. In: Proceedings of the International Conference on Computational Linguistics, COLING, pp. 397403 (2004)

  44. Baralas, E.; Cagliero, L.; Jabeen, S.; Fiori, A.; Shah, S.: Multi-document summarization based on the Yago ontology. Expert Syst. Appl. 40, 6976–6984 (2013)

    Article  Google Scholar 

  45. Khuller, S.; Moss, A.; Naor, S.: The budgeted maximum coverage problem. Inf. Process. Lett. 70(1), 39–45 (1999)

    Article  MathSciNet  MATH  Google Scholar 

  46. Lin, D.: An information-theoretic definition of similarity. In: Proceedings of ICML, Vol. 296304, p. 296304 (1998)

  47. John, A.; Premjith, P.S.; Wilscy, M.: Extractive multi-document summarization using population-based multicriteria optimization. Expert Syst. Appl. 86, 385–397 (2017)

    Article  Google Scholar 

  48. Dang, H.T.; Owczarzak, K.: Overview of the TAC 2008 update summarization task. In: Proceedings of Text Analysis Conference (2008)

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Amreen Ahmad.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Ahmad, A., Ahmad, T. A Game Theory Approach for Multi-document Summarization. Arab J Sci Eng 44, 3655–3667 (2019). https://doi.org/10.1007/s13369-018-3619-y

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s13369-018-3619-y

Keywords

Navigation