The stiffness of soft biological tissues not only depends on the applied deformation, but also on the deformation rate. To model this type of behavior, traditional approaches select a specific time-dependent constitutive model and fit its parameters to experimental data. Instead, a new trend now suggests a machine-learning based approach that simultaneously discovers both the best model and best parameters to explain given data. Recent studies have shown that feed-forward constitutive neural networks can robustly discover constitutive models and parameters for hyperelastic materials. However, feed-forward architectures fail to capture the history dependence of viscoelastic soft tissues. Here we combine a feed-forward constitutive neural network for the hyperelastic response and a recurrent neural network for the viscous response inspired by the theory of quasi-linear viscoelasticity. Our novel rheologically-informed network architecture discovers the time-independent initial stress using the feed-forward network and the time-dependent relaxation using the recurrent network. We train and test our combined network using unconfined compression relaxation experiments of passive skeletal muscle and compare our discovered model to a neo Hookean standard linear solid, to an advanced mechanics-based model, and to a vanilla recurrent neural network with no mechanics knowledge. We demonstrate that, for limited experimental data, our new constitutive recurrent neural network discovers models and parameters that satisfy basic physical principles and generalize well to unseen data. We discover a Mooney–Rivlin type two-term initial stored energy function that is linear in the first invariant I1 and quadratic in the second invariant I2 with stiffness parameters of 0.60 kPa and 0.55 kPa. We also discover a Prony-series type relaxation function with time constants of 0.362s, 2.54s, and 52.0s with coefficients of 0.89, 0.05, and 0.03. Our newly discovered model outperforms both the neo Hookean standard linear solid and the vanilla recurrent neural network in terms of prediction accuracy on unseen data. Our results suggest that constitutive recurrent neural networks can autonomously discover both model and parameters that best explain experimental data of soft viscoelastic tissues. Our source code, data, and examples are available at https://github.com/LivingMatterLab.