Active learning for e-rulemaking: public comment categorization

Stephen Purpura ,Claire Cardie ,Jesse Simons

doi:10.5555/1367832.1367873

Abstract

We address the e-rulemaking problem of reducing the manual labor required to analyze public comment sets. In current and previous work, for example, text categorization techniques have been used to speed up the comment analysis phase of e-rulemaking --- by classifying sentences automatically, according to the rule-specific issues [2] or general topics that they address [7, 8]. Manually annotated data, however, is still required to train the supervised inductive learning algorithms that perform the categorization. This paper, therefore, investigates the application of active learning methods for public comment categorization: we develop two new, general-purpose, active learning techniques to selectively sample from the available training data for human labeling when building the sentence-level classifiers employed in public comment categorization. Using an e-rulemaking corpus developed for our purposes [2], we compare our methods to the well-known query by committee (QBC) active learning algorithm [5] and to a baseline that randomly selects instances for labeling in each round of active learning. We show that our methods statistically significantly exceed the performance of the random selection active learner and the query by committee (QBC) variation, requiring many fewer training examples to reach the same levels of accuracy on a held-out test set. This provides promising evidence that automated text categorization methods might be used effectively to support public comment analysis.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Active learning for e-rulemaking: public comment categorization

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Query by committee, linear separation and random walks
Shai Fine ... Eli Shamir
Theoretical Computer Science | VOL. 284
Shai Fine, et. al.Shai Fine ... Eli Shamir
19 Jun 2002
Theoretical Computer Science | VOL. 284

Implementasi Metode Active Learning dalam Menumbuhkan Kreativitas Berpikir Anak Usia Dini

Estudios Demográficos y Urbanos | VOL. 4

06 Dec 2019
Estudios Demográficos y Urbanos | VOL. 4

Implementation of active learning methods in mathematics classes of Woliso town primary schools, Ethiopia

-

30 Jun 2020
30 Jun 2020

LEARNING ON GRAPHS: ALGORITHMS FOR CLASSIFICATION AND SEQUENTIAL DECISIONS

-

28 Mar 2014
28 Mar 2014

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Active learning for e-rulemaking: public comment categorization

Abstract

Talk to us

Similar Papers