We study mathematical formulations for batch-processing machine scheduling problems (BPMPs), which are the challenging issues in the machine scheduling literature where machines are capable of processing a batch of jobs simultaneously if jobs with non-identical sizes can be packed in a capacitated machine. In this paper, we tackle single- and parallel-machine BPMPs, and other interesting problem variants that aim at minimizing the makespan. We develop novel formulations along with valid inequalities and an algorithm framework that makes use of dual information and bounding techniques to achieve efficiency when instances are intractable. Extensive computational experiments on benchmark instances show that our approaches achieve state-of-the-art results and prove the optimality of intractable instances in the literature.