Biofuel has received worldwide attention as one of the most promising renewable energy sources. Particularly, in many countries such as the U.S. and Brazil, first-generation ethanol from corn and sugar cane has been used as automobile fuel after blending with gasoline. Nevertheless, in order to continuously increase the use of biofuels, efforts are needed to reduce the cost of biofuel production and increase its profitability. This can be achieved by increasing the efficiency of a sequential biofuel production process consisting of multiple operations such as feedstock supply, pretreatment, fermentation, distillation, and biofuel transportation. This study aims at investigating methodologies for predicting feedstock yields, which is the earliest step for stable and sustainable biofuel production. Particularly, this study reviews feedstock yield estimation approaches using machine learning technologies that focus on gradually improving estimation accuracy by using big data and computer algorithms from traditional statistical approaches. Given that it is becoming increasingly difficult to stably produce biofuel feedstocks as climate change worsens, research on developing predictive modeling for raw material supply using the latest ML techniques is very important. As a result, this study will help researchers and engineers predict feedstock yields using various machine learning techniques, and contribute to efficient and stable biofuel production and supply chain design based on accurate predictions of feedstocks.