Twitter is increasingly being used during disasters to communicate with authorities, ascertain the ground reality, and coordinate real-time rescue and recovery activities. Geographical location information about users and events is critical in these scenarios. Geotagged tweets are extremely infrequent, and other location fields, such as user location and place name, are unreliable. The extraction of geographical information from tweet text is limited by the fact that individuals frequently publish multilingual tweets that contain numerous grammatical and spelling errors, as well as nonstandard acronyms. As a result, determining the geographical location of the tweet is a challenging problem. This article presents a technique based on deep neural networks for extracting geographical references mentioned in bilingual tweets. Several deep learning-based models, including convolutional neural networks (CNNs), long short-term memory (LSTM), bidirectional LSTM (Bi-LSTM), and attention-based Bi-LSTM, are implemented on real-world English and Hindi language tweets to determine their suitability for extracting location references. The proposed CNN, along with a conditional random field at the last layer, is found to perform better than other models, with an <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$F_1$</tex-math> </inline-formula> -score of 0.858. The findings of this study can aid in early event detection, pinpointing the area of devastation and victims, real-time traffic management, and a number of other location-based applications. The suggested system’s code and trained model may be obtained at https://github.com/Abhinavkmr/Bi-lingual-location-reference-identification.git.