Wireless body area network (WBAN) is an internet-of-things technology that facilitates remote patient monitoring and enables medical staff to administer timely treatments. One of the main challenges in designing WBANs is the routing problem, which is complicated due to dynamic changes in network topology and the limited resources of nodes. Several heuristic and metaheuristic methods have been presented to solve the routing problem in WBANs. Although metaheuristics outperform heuristics by producing higher-quality solutions, they cannot respond to real-time requests. This paper introduces a reactive routing protocol for WBANs that combines a fuzzy heuristic with a metaheuristic learning model. It utilizes a Takagi-Sugeno Fuzzy Inference System in conjunction with the Grey Wolf Optimizer (named TSFIS-GWO). The objective is to simultaneously benefit from the advantages of both approaches, namely, the effectiveness of metaheuristics for offline hyperparameter tuning and the quickness of fuzzy heuristics for real-time routing. At every round, the tuned fuzzy system takes multiple parameters of the current state of the nodes and links to construct the multi-hop routing tree under IEEE 802.15.6. To optimize the performance of the protocol for each WBAN, the fuzzy rules of the TSFIS model are automatically adjusted through a learning method based on GWO. This is done in accordance with the specific requirements of the application, and the tuning process takes place once before the protocol is applied. Simulation results in three applications demonstrate that the proposed TSFIS-GWO model is capable of providing real-time solutions while outperforming the existing methods in terms of application-specific performance measures.