Laryngoscopy is routinely used for suspicious vocal cord lesions with limited performance. Accumulated studies have demonstrated the bright prospect of deep learning in processing medical imaging. In this study, we perform a systematic review and meta-analysis to investigate diagnostic utility of deep learning in laryngoscopy. The study was performed according to the Primary Reporting Items for Systematic Reviews and Meta-analyses (PRISMA) guidelines. We comprehensively retrieved articles from the PubMed, Scopus, Embase, and Web of Science up to July 14, 2024. Eligible studies with application of deep learning algorithm in laryngoscopy were assessed and enrolled by two independent investigators. The pooled sensitivity, specificity, positive likelihood ratio, negative likelihood ratio and diagnostic odds ratio with 95% confidence intervals (CIs) were calculated using a random effects model. We retained 9 eligible studies adding up to 106,175 endoscopic images for the meta-analysis. The pooled sensitivity and specificity to diagnose laryngeal cancer were 0.95(95% CI: 0.85-0.98) and 0.96 (95% CI: 0.91-0.98). The area under the curve of deep learning was 0.99 (95%CI: 0.97-0.99). Deep learning demonstrated excellent diagnostic efficacy in assessing laryngeal cancer with laryngoscope images in current studies, which manifests its potential of aiding endoscopist for laryngeal cancer diagnosis and clinical decision making.