In order to solve the problems of low power flow calculation accuracy and voltage overcrossing of the distribution network caused by large-scale distributed power supply, a data-driven power flow analysis and volt/var optimization control strategy for distribution network are proposed in this paper. Firstly, a data-driven power flow analysis model for the distribution network is proposed, and the nonlinear mapping relationship between the distribution network state and power flow results is described from the data-driven perspective. Secondly, the paper analyzes the influence of photovoltaic power supply on the voltage of the distribution network, and establishes the volt/var optimization model based on photovoltaic power supply. Then, on the basis of data-driven power flow analysis of the distribution network, the distribution network reactive voltage optimization strategy based on data-driven power flow analysis is proposed to ensure the safe and stable operation of distribution network voltage, reduce the operating network loss of distribution network, and realize the distribution network reactive voltage optimization is not affected by factors such as distribution network line parameters. Finally, IEEE 33 node power distribution system is used to verify the effectiveness of the proposed strategy.