Request for quotation (RFQ) is a lengthy document soliciting vendor products and services according to rigid specifications. This research develops an integrated natural language processing (NLP), text mining, and machine learning approach for intelligent RFQ summarization. Over 1,300 power transformer RFQ requests are used to build a word-embedding model for training and testing. Domain keywords are extracted using N-gram TF-IDF. The method automatically extracts essential specifications such as voltage, capacity, and impedance from RFQs using text analytics. The K-means algorithm groups the sentences of each specification. The TextRank algorithm identifies important sentences of all specifications to generate RFQ summaries. The summarization system helps engineers shorten the time to identify all specifications and reduces the risk of missing important requirements during manual RFQ reading. The system helps improve the complex product design for manufacturers and improve the cost estimation and competitiveness of quotations in a highly competitive marketplace.
Read full abstract