We derive the likelihood functions and the maximum likelihood (ML) detectors for four classes of single-input double-output (SIDO) communication systems, i.e., systems with one transmit and two receive antennas. For all classes, the received signals are contaminated by a Gaussian noise component and a non-Gaussian component induced by the Gaussian transmissions of a proactive continuous single-antenna jammer over an unknown complex <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"><tex-math notation="LaTeX">$2\times 1$</tex-math></inline-formula> Gaussian vector channel. The considered classes correspond to whether full channel distribution information (CDI), or partial CDI about the transmitter channel and the jammer channel is available at the receiver. Unlike their scalar counterparts, the vector channels considered herein interweave the components of the received signal, rendering the derivation of the likelihood function a daunting task for more than two receive antennas. Furthermore, the interweaving of the received signal components in the vector channel case prevents the optimal ML detector for unit-norm constellations from reducing to the corresponding Gaussian approximation-based detector. This is in sharp contrast with the scalar case, wherein the two detectors are equivalent for unit-norm constellations. Confirming our analytical findings, experimental results show that the difference between the two detectors can be significant, especially when the transmitter-receiver and jammer-receiver channels have substantial line-of-sight components. Although the computational cost of performing optimal ML detection in the presence of non-Gaussian jamming is higher in the case of two receive antennas, the performance advantage over the single antenna case justifies it.