Abstract
The study presents a machine learning approach to predict a user’s next location of visit based on their previous tweets and personality traits. The future behavior of the people is reflected by their profile and past behaviors; this motivates the work presented in this study. Nowadays, people are fond of sharing their experiences related to travel, a visit to a restaurants or hotel or some historic places etc. on the social media platforms. Twitter, Instagram and Facebook are one of the popular social media platforms where this sharing of information can be found. This study used Twitter data for the analysis and to develop a model using machine learning (ML) to predict a user’s next visiting location. ML enables a computer to learn from the historical records and use this knowledge for prediction and decision making for new data. Prediction accuracy is one of the important and required parameter for any prediction model. If prediction model does not provide good prediction accuracy then the model cannot be accepted as reliable. Considering this phenomena, this study used an ensemble classification approach (ESA) to develop a prediction model for the problem under study. ESA trains different classifiers on the same data and use voting method to select the most accurate prediction. This study used ESA for first time to perform such type of study and results reveal that ESA certainly enhances the prediction accuracy of the model which is highly desirable.
Similar content being viewed by others
References
Abdel F, Gao Q, Houben GJ, Tao K (2013) Twitter-based user modeling for news recommendations. In: Rossi F (ed) IJCAI 2013, proceedings of the 23rd international joint conference on artificial intelligence, Beijing, August 3–9, pp 2962–2966
Asur S, Huberman BA (2010) Predicting the future with social media. In: Proceedings of the IEEE/WIC/ACM international conference on web intelligence (WI’10), Toronto, pp 492–499
Bhattacharya P, Zafar MB, Ganguly N, Ghosh S, Gummadi KP (2014) Inferring user interests in the twitter social network. In: Proceedings of the 8th ACM conference on recommender systems (RecSys’14), Foster City, pp 357–360
Bollen J, Mao H, Zeng X (2011) Twitter mood predicts the stock market. J Comput Sci 2(1):1–8
Budak C, Kannan A, Agrawal R, Pedersen J (2014) Inferring user interests from microblogs. Tech. Rep., MSR-TR-2014-68
Chauhan A, Toshniwal D, Tejwani R (2016) Predicting future place of visit using user’s personality profile. In: 2016 International conference on computational techniques in information and communication technologies (ICCTICT), New Delhi, March, pp 427–432
Chauhan A, Kummamuru K, Toshniwal D (2017) Prediction of places of visits using tweets. Knowl Inf Syst 50(1):145–166
Chen T (2018) An innovative fuzzy and artificial neural network approach for forecasting yield under an uncertain learning environment. J Ambient Intell Humaniz Comput 9(4):1013–1025
Chen J, Hsieh G, Mahmud J, Nichols J (2014) Understanding individuals’ personal values from social media word use. In: CSCW, proceedings of the 17th ACM conference on Computer supported cooperative work & social computing, pp 405–414
DecisionTreeClassifier (2018) http://www.cs.ubbcluj.ro/~gabis/DocDiplome/DT/DecisionTrees.pdf. Accessed on 11 May 2018
Friedman N, Geiger D, Goldszmidt M (1997) Bayesian Network classifiers. Mach Learn 29(2–3):131–163
Golbeck J, Robles C, Edmondson M, Turner K (2011) Predicting personality from twitter. In: SocialCom/PASSAT. IEEE, pp 149–156
Golder SA, Macy MW (2011) Diurnal and seasonal mood vary with work, sleep and day length across diverse cultures. Science 333(6051):1878–1881
Jonnalagedda N, Gauch S (2013) Personalized news recommendation using twitter. In: Proceedings of the IEEE international conference on web intelligence and intelligent agent technology (WIC/ACM’13), Atlanta, pp 21–25
Jurafsky D, Martin JH (2017) Speech and language processing. Chap. 7, Logistic regression
Kosmides P, Demestichas K, Adamopoulou E et al (2016) Providing recommendations on location-based social networks. J Ambient Intell Humaniz Comput 7(4):567–568
Kumar S, Toshniwal D (2017) Severity analysis of powered two wheeler traffic accidents in Uttarakhand, India. Eur Transp Res Rev 9:24
Lee K, Ganti RK, Srivatsa M, Liu L (2014) when twitter meets foursquare: tweet location prediction using foursquare. In: Proceedings of the 11th international conference on mobile and ubiquitous systems: computing, networking and services (MOBIQUITOUS’14). ICST (Institute for Computer Sciences, Social-Informatics and Telecommunications Engineering), ICST, Brussels, pp 198–207
Lee K, Mahmud J, Chen J, Zhou M, Nichols J (2015) Who will retweet this? Detecting strangers from twitter to retweet information. ACM Trans Intell Syst Technol 6(3):31:1–31:25
Lichman M, Smyth P (2014) Modeling human location data with mixtures of kernel densities. In: Macskassy SA, Perlich C, Leskovec J, Wang W, Ghani R (eds) The 20th ACM SIGKDD international conference on knowledge discovery and data mining, KDD’14, New York, August 24–27, 2014. ACM, pp 35–44
LSBN (2018) https://www.microsoft.com/en-us/research/project/location-based-social-networks/. Accessed on 23 Jan 2018
Mahmud J, Zhou MX, Megiddo N, Nichols J, Drews C (2013) Recommending targeted strangers from whom to solicit information on social media. In: Proceedings of the 2013 international conference on intelligent user interfaces, March 19–22, Santa Monica
Mathew W, Raposo R, Martins B (2012) Predicting future locations with hidden Markov models. In: Dey AK, Chu H-H, Hayes GR (eds) The 2012 ACM conference on ubiquitous computing, Ubicomp’12, Pittsburgh, September 5–8, pp 911–918
Matthews G, Deary IJ, Whiteman MC (2009) Personality traits. Cambridge University Press, Cambridge. http://assets.cambridge.org/97805218/87786/frontmatter/9780521887786_frontmatter.pdf
Microblogs (2010) https://en.wikipedia.org/wiki/Microblogging. Accessed 23 Jan 2018
Morales DF, Gionis A, Lucchese C (2012) From chatter to headlines: harnessing the real-time web for personalized news recommendation. In: Proceedings of the 5th ACM international conference on web search and data mining (WSDM’12), Seattle, pp 153–162
Pennebaker JW, Chung CK, Ireland M, Gonzales A, Booth RJ (2007) The development and psychometric properties of liwc2007. University of Texas in Austin, Austin
PersonalityRecognizerTool (2018) http://farm2.user.srcf.net/research/personality/recognizer. Accessed 23 Jan 2018
PsycholinguisticDatabase (2018) http://www.psy.uwa.edu.au/mrcdatabase/uwa_mrc.htm. Accessed 23 Jan 2018
Sadilek A, Brennan SP, Kautz HA, Silenzio V (2013) nEmesis: which restaurants should you avoid today? In: Proceedings of the 1st AAAI conference on human computation and crowdsourcing (HCOMP’13), AAAI, pp 138–146
Schwartz HA, Eichstaedt JC, Kern ML, Dziurzynski L et al (2013) Personality, gender, and age in the language of social media: the open-vocabulary approach. PLoS One 8(9):e73791
Tan PN, Steinbach M, Kumar V (2005) Introduction to data mining. Addison-Wesley Longman Publishing Co., Boston
Tseng C, Patel N, Paranjape H, Lin TY, Teoh ST (2012) Classifying Twitter data with Naïve Bayes classifier. In: IEEE conf. on granular computing, Hangzhou, pp 294–299
Tweepy (2018) http://www.tweepy.org/. Accessed 24 Mar 2018
Walsh ES, Kreakie BJ, Cantwell MG, Nacci D (2017) A random forest approach to predict the spatial distribution of sediment pollution in an estuarine system, PLoS One 12(7):e0179473
WEKA (2018) https://www.cs.waikato.ac.nz/ml/weka/. Accessed 02 Feb 2018
Xin B, He M, Jianhua L, Yuliang M, Deyang C (2018) A positive and unlabeled learning framework based on extreme learning machine for drug-drug interactions discovery. J Ambient Intell Humaniz Comput. https://doi.org/10.1007/s12652-018-0960-7
Yang H, Li Y (2013) Identifying user needs from social media. IBM Technical Report. http://domino.research.ibm.com/library/cyberdig.nsf/0/28778c32c50ddb9185257bfa005af49b?OpenDocument
Yin Z, Cao L, Han J, Zhai C, Huang TS (2011) Geographical topic discovery and comparison. In: WWW, Hyderabad, March 28–1 April, pp 247–256
Acknowledgements
The authors gratefully acknowledge the financial support of the Ministry of Education and Science of the Russian Federation in the framework of Increase Competitiveness Program of NUST “MISiS” (No. К4-2017-052).
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Kumar, S., Nezhurina, M.I. An ensemble classification approach for prediction of user’s next location based on Twitter data. J Ambient Intell Human Comput 10, 4503–4513 (2019). https://doi.org/10.1007/s12652-018-1134-3
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12652-018-1134-3