Estimating text regressions using txtreg_train

Carlo Schwarz

doi:10.1177/1536867x231196349

Estimating text regressions using txtreg_train

Carlo Schwarz

https://doi.org/10.1177/1536867x231196349

Copy DOI

Journal: The Stata Journal: Promoting communications on statistics and Stata	Publication Date: Sep 1, 2023
Citations: 1

Affiliation: Bocconi University

#Text Strings #Coefficients Of Regression Model + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

In this article, I introduce new commands to estimate text regressions for continuous, binary, and categorical variables based on text strings. The command txtreg_train automatically handles text cleaning, tokenization, model training, and cross-validation for lasso, ridge, elastic-net, and regularized logistic regressions. The txtreg_predict command obtains the predictions from the trained text regression model. Furthermore, the txtreg_analyze command facilitates the analysis of the coefficients of the text regression model. Together, these commands provide a convenient toolbox for researchers to train text regressions. They also allow sharing of pretrained text regression models with other researchers.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

More From: The Stata Journal: Promoting communications on statistics and Stata

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.