By developing IoTs (Internet of Things), a new concept is arising in WBANs (Wireless Body Area Networks), called IoBs (Internet of Bodies). WBANs are the safety-critical devices that use rule-based knowledge with fuzzy variables for monitoring patients. There are two issues with the knowledgebase: (1) appropriate knowledge representation and inference and (2) handling the situations for which there is no rule in the knowledgebase. In this paper, for the first issue we present MFCPN (Multi-level Fuzzy Colored Petri-nets) and for the second one we present a RL (Reinforcement Learning) mechanism for decision making based on the past device’s feedback to the environment. Through a few scenarios and two datasets, the proposed approach has been applied to pacemakers to show the approach's effectiveness. The comparison between decision making by our approach and two real datasets indicates the proposed approach has 79.3 % accuracy.