Every day, people are using search engines for different purposes such as research, shopping, or entertainment. Among the behaviors of search engine users, we are particularly interested in search-and-go behavior, which intuitively corresponds to a simple but challenging question, i.e., will users go where they search? Accurately estimating such behavior can be of great importance for Internet companies to recommend point-of-interest (POI), advertisement, and route, as well as for governments and public service operators like metro companies to conduct traffic monitoring, crowd management, and transportation scheduling. Therefore, in this study, we first collect search log data and GPS log data with linked and consistent user ID from Yahoo! Japan portal application installed in millions of smart-phones and tablets. Then we propose a framework including a complete data-processing procedure and an end-to-end deep learning model to predict whether a user will check-in the searched place or not. Specifically, as users’ daily activities are considered to have high correlation with their travel, eating, and recreation decision in the future (i.e., go or not), Deep Spatial–Temporal Interaction Network (DeepSTIN) is elaborately designed to automatically learn the sophisticated spatiotemporal interactions between mobility data and search query data. Experimental results based on the standard metrics demonstrate that our proposed framework can achieve satisfactory performances on multiple real-world search scenarios.