EEG-enabled earbuds represent a promising frontier in brain activity monitoring beyond traditional laboratory testing. Their discrete form factor and proximity to the brain make them the ideal candidate for the first generation of discrete non-invasive brain-computer interfaces (BCIs). However, this new technology will require comprehensive characterization before we see widespread consumer and health-related usage. To address this need, we developed a validation toolkit that aims to facilitate and expand the assessment of ear-EEG devices. The first component of this toolkit is a desktop application ("EaR-P Lab") that controls several EEG validation paradigms. This application uses the Lab Streaming Layer (LSL) protocol, making it compatible with most current EEG systems. The second element of the toolkit introduces an adaptation of the phantom evaluation concept to the domain of ear-EEGs. Specifically, it utilizes 3D scans of the test subjects' ears to simulate typical EEG activity around and inside the ear, allowing for controlled assessment of different ear-EEG form factors and sensor configurations. Each of the EEG paradigms were validated using wet-electrode ear-EEG recordings and benchmarked against scalp-EEG measurements. The ear-EEG phantom was successful in acquiring performance metrics for hardware characterization, revealing differences in performance based on electrode location. This information was leveraged to optimize the electrode reference configuration, resulting in increased auditory steady-state response (ASSR) power. Through this work, an ear-EEG evaluation toolkit is made available with the intention to facilitate the systematic assessment of novel ear-EEG devices from hardware to neural signal acquisition.