Outsourcing decision tree inference services to the cloud is highly beneficial, yet raises critical privacy concerns on the proprietary decision tree of the model provider and the private input data of the client. In this paper, we design, implement, and evaluate a new system that allows highly efficient outsourcing of decision tree inference. Our system significantly improves upon prior art in the overall online end-to-end secure inference service latency at the cloud as well as the local-side performance of the model provider. We first present a new scheme which securely shifts most of the processing of the model provider to the cloud, resulting in a substantial reduction on the model provider's performance complexities. We further devise a scheme which substantially optimizes the performance for secure decision tree inference at the cloud, particularly the communication round complexities. The synergy of these techniques allows our new system to achieve up to <inline-formula><tex-math notation="LaTeX">$8 \times$</tex-math></inline-formula> better overall online end-to-end secure inference latency at the cloud side over realistic WAN environment, as well as bring the model provider up to <inline-formula><tex-math notation="LaTeX">$19 \times$</tex-math></inline-formula> savings in communication and <inline-formula><tex-math notation="LaTeX">$18 \times$</tex-math></inline-formula> savings in computation.