This paper studies the optimization of task assignment and pickup and delivery operations using a heterogeneous fleet of unmanned aerial vehicles (UAVs). We specifically address the distribution of emergency medical supplies, including medications, vaccines, and essential medical aid, as well as the collection of biological blood samples for testing and analysis. Unique challenges, such as supply shortages, time windows, and geographical considerations, are explicitly taken into account. The problem is first formulated as a mixed-integer linear programming model aimed at maximizing the total profit derived from the execution of a set of emergency healthcare pickup and delivery tasks. An enhanced Q-learning-based adaptive large neighborhood search (QALNS) is proposed for large-scale benchmark instances. QALNS exhibits a superior performance on benchmark instances. It also improves the quality of the solutions on average by 5.49% and 6.86% compared to the Gurobi solver and a state-of-the-art adaptive large neighborhood search algorithm, respectively. Sensitivity analyses are performed on critical factors contributing to the performance of the QALNS algorithm, such as the learning rate and the discount indicator. Finally, we provide managerial insights on the use of the fleet of UAVs and the design of the network.