The task of anomaly detection in data patterns remains challenging due to the diverse of fault patterns, which often render existing models ineffective when encountered with unknown or unseen data patterns in the training data. Furthermore, insufficient samples, the presence of chaotic behavior, unbalanced anomaly patterns are potential limiting factors in the practical application. While prediction error-based approaches attempt to address these challenges, the discriminability of abnormal patterns in existing scenarios is not yet sufficient, thus limiting the ability to identify abnormal phenomena. To overcome these limitations, we proposed a novel model called MCGnet. The backbone architecture of MCGnet employed a novel design of continuous multi-layer circular convolution transformations. This design enhances the expressive capability of the model by expanding the differences among data patterns. Additionally, we introduced in parallel the newly proposed temporal-frequency gated Gaussian unit, which generates richer feature representations. Subsequently, the obtained feature representations were fed into an explicit causal network to learn complex patterns of data along temporal dependencies. The MCGnet not only solves high-precision time series prediction problems but also significantly improves the discriminability between normal and abnormal patterns. Since the proposed model is accurately modeled using only a small amount of normal data, it bypasses the limiting factors such as insufficient fault samples and sample imbalance, and is able to effectively deal with various unknown data patterns. The accuracy of the proposed method was verified through simulation data (Duffing system) and its potential for engineering applications was demonstrated through real-world data (rotor system).