Abstract
In today’s era, information has been growing exponentially on the web, due to which extraction of relevant and concise information has become a challenging task. To overcome the above problem, a fundamental tool known as summarization techniques has been used for understanding and organizing such large datasets. Recently, researchers have been devoting a lot of effort to develop semantics-based models, so as to improve summarization performance. In this paper, a versatile and principled game theory-based multi-document summarization framework integrated with Wikipedia ontology is proposed. The framework exploits the submodularity hidden in underlying ontology and is optimized using the proposed improved algorithm, to enhance the summarization performance. Results of the proposed approach were evaluated with the ROUGE evaluation metric for different multi-document summarization tasks against human-generated summaries and it outperformed DUC, TAC competitors, and other competitive methods.
Similar content being viewed by others
References
Marr, B.: Big data: 20 mind-boggling facts everyone must read. https://www.forbes.com/sites/bernardmarr/2015/09/30/big-data-20-mindboggling-facts-everyone-must-read (2015)
Wang, D.; Li, T.; Zhu, S.; Ding, C.: Multi-document summarization via sentence-level semantic analysis and symmetric factorization. In: Proceedings of SIGIR, pp. 307–314 (2008)
Wang, S.; Li, W.; Deng, H.: Document update summarization using incremental hierarchical clustering. In Proceedings of the 19th ACM International Conference on Information and Knowledge Management, pp. 279–288 (2010)
Radev, D.R.; Hovy, E.; McKeown, K.: Introduction to the special issue on summarization. Comput. Linguist. 28(4), 399408 (2002)
Inderjeet, M.: Summarization evaluation: an overview. In: Proceedings of the NTCIR Workshop (2001)
He, R.; Qjn, B.; Lin, T.: A novel approach to update summarization using evolutionary manifold-ranking and spectral clustering. Expert Syst. Appl. 39, 2375–2384 (2012)
Nenkova, A.; Passonneau, R.; Mckeown, K.: The pyramid method: incorporating human content selection variation in summarization evaluation. ACM Trans. Lang. Speech Process. 4(2), 1–23 (2007)
Goldstein, J.; Mittal, V.; Carbonell, J.; Kantrowitz, M.: Multi-document summarization by sentence extraction. In: Proceedings of the 2000 NAACL-ANLPWorkshop on Automatic Summarization, pp. 40–48 (2000)
Wang, D.; Zhu, S.; Li, T.; Chi, Y.; Gong, Y.: Integrating document clustering and multi-document summarization. ACM Trans. Knowl. Discov. Data 5, 14:1–14:26 (2011)
Radev, D.R.: Lexrank: Graph-based lexical centrality as salience in text summarization. J. Artif. Intell. Res. 22, 457–479 (2004)
Rumeng, L.; Hiroyuki, S.: A hierarchical tree model for update summarization. In: European Conference on Information Retrieval, pp. 660–665 (2015)
Sánchez, D.; Batet, M.; Valls, A.; Gibert, K.: Ontology-driven web-based semantic similarity. J. Intell. Inf. Syst. 35(3), 383–413 (2010)
Shapley, L.S.: A value for n-person games. In: Kuhn, H.W., Tucker, A.W. (eds.) Contributions to the Theory of Games, vol. I. Princeton University Press, Princeton (1950)
Shen, D.; Sun, J.T.; Li, H.; Yang, Q.; Chen, Z.: Document summarization using conditional random fields. In: Proceedings of IJCAI (2007)
Sidney, C.: The Art of Legging. Maxline International, London (1976)
Takeda, H.: Ontology extraction by collaborative tagging. In: WWW 2009. ACM Press (2009)
Town, C.: Ontological inference for image and video analysis. Mach. Vis. Appl. 17, 94–115 (2006)
Yang, C.C.; Wang, F.L.: Hierarchical summarization of large documents. J. Am. Soc. Inf. Sci. Technol. 59(6), 887–902 (2008)
Wang, D.; Li, T.: Weighted consensus multi-document summarization. Inf. Process. Manag. 48, 513–523 (2012)
Wang, D.; Zhu, S.; Gong, L.Y.: Multi-document summarization using sentence-based topic models. In: Proceedings of ACL-IJCNLP, Association for Computational Linguistics, pp. 297–300 (2009)
Wang, S.; Li, W.; Deng, H.: A survey on automatic summarization. In: International Forum on Information Technology and Applications (2010)
Wei, F.; Li, W.; Lu, Q.; He, Y.: Query-sensitive mutual reinforcement chain and its application in query-oriented multi-document summarization. In: Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, New York, p. 283290 (2008)
Liu, W.; Luo, X.; Zhang, J.; Xue, R.; Xu, R.Y.D.: Semantic summary automatic generation in news event. Concurr. Comput. Pract. Exp. 29(24), e4287 (2017)
Li, Xuan; Liang, D.; Yi-Dong, S.: Update summarization via graph-based sentence ranking. IEEE Trans. Knowl. Data Eng. 25, 1162–1174 (2013)
Yang, Z.; Cai, K.; Tang, J.; Zhang, L.; Su, Z.; Li, J.: Social context summarization. In: Proceedings of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval SIGIR’11, pp. 255–284 (2011)
Conroy, J.M.; Schlesinger, J.D.; Goldstein, J.; O’Leary, D.P.: Left-brain/right-brain multi-document summarization. In: DUC 2004 Conference Proceedings (2004)
Luo, W.; Zhuang, F.; Shi, Z.: Exploiting relevance, coverage and novelty for query-focussed multi-document summarization. Knowl. Based Syst. 46(1), 33–42 (2013)
Narayanam, R.; Narahari, Y.: A shapley value-based approach to discover influential nodes in social networks. IEEE Trans. Autom. Sci. Eng. 8(1), 130–147 (2011)
Myerson, R.B.: Game Theory: Analysis of Conflict. Harvard University Press, Cambridge (1997)
Li, T.; Zhu, S.; Ogihara, M.: Hierarchical document classification using automatically generated hierarchy. J. Intell. Inf. Syst. 29(2), 211–230 (2007)
Radev, D.R.; Jing, H.; Styś, M.; Tam, D.: Centroid-based summarization of multiple documents. Inf. Process. Manag. 40(6), 919–938 (2004)
Ferreira, R.; de Souza Cabral, L.; Lins, R.D.; Silva, G.P.; Freitas, F.; Cavalcanti, G.D.C.; Lima, R.; Simske, S.J.; Favaro, L.: Assessing sentence scoring techniques for extractive text summarization. Expert Syst. Appl. 40, 5755–5764 (2013)
Canhasi, E.; Kononeko, I.: Weighted archetypal analysis of the multi-document summarization. Expert Syst. Appl. 41(21), 535–543 (2014)
Alguliev, R.M.; Aliguliyev, R.M.; Isazade, N.R.: Multiple documents summarization based on evolutionary optimization algorithm. Expert Syst. Appl. 40, 16751689 (2013)
Celikyilmaz, A.; Hakkani-Tur, D.; Tur, G.; Fidler, A.; Hillard, D.: Exploiting distance based similarity in topic models for user intent detection. In: IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU) (2011)
Chong, L.; Min-Lie, H.; Xiao-Yan, Z.; Li, Ming: A new approach for multi-document update summarization. J. Comput. Sci. Technol. 25, 739–749 (2010)
Lin, H.; Bilmes, J.: A class of submodular functions for document summarization. In: The Meeting of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference, pp. 510–520 (1998)
Kogilavani, A.; Balasubramanie, B.: Ontology enhanced clustering based summarization of medical documents. Int. J. Recent Trends Eng. 1, 546549 (2009)
Nastase, V.: Topic-driven multi-document summarization with encyclopedic knowledge and spreading activation. In: EMNLP. ACL, p. 763772 (2008)
Garcia, L.F.F.; Valdeni de Lima, J.; Loh, S.; Palazzo Moreira de Oliveira, J.: Using ontological modeling in a context-aware summarization system to adapt text for mobile devices. In: Chen, P.P., Wong, L.Y. (eds.) Active Conceptual Modeling of Learning. Lecture Notes in Computer Science, Vol. 4512, p. 144154. Springer (2006)
Hamasaki, M.; Matsuo, Y.; Nishimua, T.; Takeda, H.: Ontology extraction by collaborative tagging. In: WWW 2009 ACM Press (2009)
Pourvali, M.; Abadeh, M.S.: Automated text summarization base on lexicales chain and graph using of wordnet and wikipedia knowledge base. CoRR, abs/1203.3586 (2012)
Filatova, E.: A formal model for information selection in multi-sentence text extraction. In: Proceedings of the International Conference on Computational Linguistics, COLING, pp. 397403 (2004)
Baralas, E.; Cagliero, L.; Jabeen, S.; Fiori, A.; Shah, S.: Multi-document summarization based on the Yago ontology. Expert Syst. Appl. 40, 6976–6984 (2013)
Khuller, S.; Moss, A.; Naor, S.: The budgeted maximum coverage problem. Inf. Process. Lett. 70(1), 39–45 (1999)
Lin, D.: An information-theoretic definition of similarity. In: Proceedings of ICML, Vol. 296304, p. 296304 (1998)
John, A.; Premjith, P.S.; Wilscy, M.: Extractive multi-document summarization using population-based multicriteria optimization. Expert Syst. Appl. 86, 385–397 (2017)
Dang, H.T.; Owczarzak, K.: Overview of the TAC 2008 update summarization task. In: Proceedings of Text Analysis Conference (2008)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Ahmad, A., Ahmad, T. A Game Theory Approach for Multi-document Summarization. Arab J Sci Eng 44, 3655–3667 (2019). https://doi.org/10.1007/s13369-018-3619-y
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s13369-018-3619-y