We propose a novel spectrum decision scheme (i.e., channel selection and handoff) for wireless mesh networks (WMN) which use multiple channels and nodes equipped with multi-beam directional antennas. Our scheme has the following features: (i) It performs spectrum decision by considering various WMN parameters, including the channel quality, beam orientation, antenna-caused deafness and capture effects, and application priority level. (ii) It uses the reinforcement learning (RL)-based spectrum decision process to achieve the optimal quality of multimedia transmission in the long term. However, a newly-joined WMN node could take a long time to make a correct spectrum decision due to the difficult choice of initial RL parameters. Therefore, our scheme uses the apprenticeship learning in conjunction with the RL model, to speed up the spectrum decision process by choosing a suitable neighboring node (called “expert”) to teach a newly-joined node (called “apprentice”). Our experiments demonstrate that the proposed spectrum decision scheme improves the network performance and multimedia transmission quality.