In the realm of Wireless Sensor Networks (WSNs), the quest for enhanced energy efficiency remains paramount, given their pivotal role in facilitating the Internet of Things (IoT) applications. Existing methodologies often grapple with the dual challenge of optimizing energy consumption while ensuring robust data transmission, thereby limiting their efficacy in dynamic, resource-constrained environments. This work introduces a novel paradigm that meticulously addresses these constraints, thereby heralding a significant leap towards optimizing WSNs' operational efficiency. At the core of our proposed model lies the integration of Fuzzy Analytic Hierarchy Process (Fuzzy AHP)-based clustering, which ingeniously segregates nodes by leveraging multi-criteria such as location, energy efficiency, and temporal performance. This clustering serves as a precursor to the application of an innovative Iterative Grey Wolf Jelly Fish Optimizer (GWJFO). The GWJFO stands out by its strategic prowess in delineating optimal routing paths, thus minimizing energy expenditure and enhancing data relay efficiency. Furthermore, our model is fortified with a Q Learning Method, ingeniously designed to identify and execute optimal alternate routing paths through a Make Before Break strategy. This addition not only mitigates potential faults but also significantly boosts computational efficiency and packet delivery performance. Empirical validation through real-time network simulations underscored the model's superiority, demonstrating a 9.5% improvement in communication speed, an 8.5% increase in energy efficiency, a 4.5% improvement in packet delivery performance under fault conditions, a 10.4% rise in throughput, and a 5.9% enhancement in network consistency over existing benchmarks. This groundbreaking work not only paves the way for more energy-efficient WSNs but also sets a new standard in achieving high-performance metrics essential for the next generation of IoT applications. The implications of these advancements extend beyond mere technical enhancements, offering a beacon for future research in the domain, potentially revolutionizing the way we deploy and manage sensor networks in an array of applications.