Abstract

Federal government agencies and organizations doing business with them have to adhere to the Code of Federal Regulations (CFR). The CFRs are currently available as large text documents that are not machine processable and so require extensive manual effort to parse and comprehend, especially when sections cross-reference topics spread across various titles. We have developed a novel framework to automatically extract knowledge from CFRs and represent it using a semantically rich knowledge graph. The framework captures knowledge in the form of key terms, rules, topic summaries, relationships between various terms, semantically similar terminologies, deontic expressions, and cross-referenced facts and rules. We built our framework using deep learning technologies like TensorFlow for word embeddings and text summarization, Gensim for topic modeling, and Semantic Web technologies for building the knowledge graph. In this article, we describe our framework in detail and present the results of our analysis of the Title 48 CFR knowledge base that we have built using this framework. Our framework and knowledge graph can be adopted by federal agencies and businesses to automate their internal processes that reference the CFR rules and policies.

Highlights

  • As core documents of the Executive Branch of the U.S government, the Code of Federal Regulations (CFR) [25] provides the public with a comprehensive publication vehicle for all of the regulations issued by federal agencies and the president; the documents are indispensable to the government’s operations and communication [2]

  • Since each topic has a large number of rules and policies associated with it, the CFR titles are further organized into chapters, with every title having an average of 50 chapters

  • As CFR titles are much longer and complex documents than Service Level Agreements (SLAs) or privacy policy documents, we significantly improved and refined our previous approach for developing this framework to capture various facts and rules spread across the documents

Read more

Summary

Introduction

As core documents of the Executive Branch of the U.S government, the Code of Federal Regulations (CFR) [25] provides the public with a comprehensive publication vehicle for all of the regulations issued by federal agencies and the president; the documents are indispensable to the government’s operations and communication [2]. These regulations are published in the Federal Register and must be adhered to by every organization that wants to do business with the U.S government.

Methods
Results
Discussion
Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.