Documentation Data Set Research Articles

In a previous work, a parsimonious topic model (PTM) was proposed for text corpora. In that work, unlike LDA, the modeling determined a subset of salient words for each topic, with topic-specific probabilities, with the rest of the words in the dictionary explained by a universal shared model. Further, in LDA all topics are in principle present in every document. In contrast, PTM gives sparse topic representation, determining the (small) subset of relevant topics for each document. A customized Bayesian information criterion (BIC) was derived, balancing model complexity and goodness of fit, with the BIC minimized to jointly determine the entire model—the topic-specific words, document-specific topics, all model parameter values, and the total number of topics—in a wholly unsupervised fashion. In the present work, several important modeling and algorithm (parameter learning) extensions of PTM are proposed. First, we modify the BIC objective function using a lossless coding scheme with low modeling cost for describing words that are non-salient for all topics—such words are essentially identified as wholly noisy/uninformative. This approach increases the PTM’s model sparsity, which also allows model selection of more topics and with lower BIC cost than the original PTM. Second, in the original PTM model learning strategy, word switches were updated sequentially, which is myopic and susceptible to finding poor locally optimal solutions. Here, instead, we jointly optimize all the switches that correspond to the same word (across topics). This approach jointly optimizes many more parameters at each step than the original PTM, which in principle should be less susceptible to finding poor local minima. Results on several document data sets show that our proposed method outperformed the original PTM model with respect to multiple performance measures, and gave a sparser topic model representation than the original PTM.

Read full abstract

BackgroundIntegrative medicine (IM) is a patient-centered, evidence-based, therapeutic paradigm which combines conventional and complementary approaches. The use of IM in pediatrics has increased in the past two decades and parents’ demand for it is growing. An IM whole systems approach is anthroposophic medicine. Considering the growing demand for integrative approaches in children, it is relevant from a public health perspective to find out which kind of children use IM in Germany and whether they differ from the entirety of pediatric inpatients in Germany. Moreover, it would be interesting to known, whether these patients are willing to travel a longer distance to gain integrative treatment.MethodsThe present study investigates the standard ward documentation datasets of 29,956 patients of all German integrative anthroposophic pediatric inpatient wards from 2005 to 2016 and compares them systematically to collect data of the entirety of all pediatric inpatient wards in Germany. Apart from patients’ age and gender, and the ICD-10 admission diagnoses, the geographical catchment area of the hospitals were analyzed.ResultsSociodemographic characteristics of pediatric inpatients in the integrative anthroposophic departments (IAH) did not differ from the entirety of all pediatric inpatients. Regarding clinical characteristics, higher frequencies were found for endocrine, nutritional and metabolic diseases (IAH: 7.24% vs. 2.98%); mental, behavioral, and neurodevelopmental disorders (IAH: 9.83% vs. 3.78%) and nervous diseases (IAH: 8.82% vs. 5.16%) and lower frequencies for general pediatric diseases such as respiratory diseases (IAH: 17.06% vs. 19.83%), digestive diseases (IAH: 3.90% vs. 6.25%), and infectious and parasitic diseases (IAH: 12.88% vs. 14.82%) in comparison to the entirety of all pediatric inpatients in Germany. The IAH showed a broad catchment area, with most patients being from former, Western federal republic of Germany. Large catchment areas (> 100 km) for the IAH are merely covered by severe and chronic diseases.ConclusionPediatric inpatients of IAH do not differ from the entirety of pediatric inpatients in Germany regarding sociodemographic characteristics but show differences regarding clinical characteristics. Parents are willing to travel further distance to get specialized integrative anthroposophic medical care for children with severe and chronic diseases.

Read full abstract

Documentation Data Set Research Articles

Articles published on Documentation Data Set

Construction of a Character Dataset for Historical Uchen Tibetan Documents under Low-Resource Conditions

GAE-Based Document Embedding Method for Clustering

A Framework for Service Semantic Description Based on Knowledge Graph

GRAPH BASED CLUSTERING WITH CONSTRAINTS AND ACTIVE LEARNING

Estimating Hydraulic Conductivity of Overconsolidated Soils Based on Piezocone Penetration Test (PCPT)

Document structure model for survey generation using neural network

MOWDOC: A Dataset of Documents From Taking the Measure of Work for Building a Latent Semantic Analysis Space.

Content analytics based on random forest classification technique: An empirical evaluation using online news dataset

Smoothness Regularized Multiview Subspace Clustering With Kernel Learning

Can Social Media Listening Platforms’ Artificial Intelligence Be Trusted? Examining the Accuracy of Crimson Hexagon’s (Now Brandwatch Consumer Research’s) AI-Driven Analyses

Improved Parsimonious Topic Modeling Based on the Bayesian Information Criterion.

Do patients of integrative anthroposophic pediatric inpatient departments differ? Comparative analysis to all pediatric inpatients in Germany considering demographic and clinical characteristics

CHOOSING SEEDS FOR SEMI-SUPERVISED GRAPH BASED CLUSTERING

Effects of voluntary crouch gait on lower limb muscle forces and joint reaction forces in healthy children: work in progress

A fuzzy Approach based for Document Datasets Clustering

Clustering of multi-view relational data based on particle swarm optimization

Examination of design for large and complex network projects

A novel somatic cancer gene-based biomedical document feature ranking and clustering model

A Software-driven Workflow for the Reuse of Language Documentation Data in Typological Studies

Accelerating Nonnegative Matrix Factorization Algorithms Using Extrapolation.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Documentation Data Set Research Articles

Articles published on Documentation Data Set

Construction of a Character Dataset for Historical Uchen Tibetan Documents under Low-Resource Conditions

GAE-Based Document Embedding Method for Clustering

A Framework for Service Semantic Description Based on Knowledge Graph

GRAPH BASED CLUSTERING WITH CONSTRAINTS AND ACTIVE LEARNING

Estimating Hydraulic Conductivity of Overconsolidated Soils Based on Piezocone Penetration Test (PCPT)

Document structure model for survey generation using neural network

MOWDOC: A Dataset of Documents From Taking the Measure of Work for Building a Latent Semantic Analysis Space.

Content analytics based on random forest classification technique: An empirical evaluation using online news dataset

Smoothness Regularized Multiview Subspace Clustering With Kernel Learning

Can Social Media Listening Platforms’ Artificial Intelligence Be Trusted? Examining the Accuracy of Crimson Hexagon’s (Now Brandwatch Consumer Research’s) AI-Driven Analyses

Improved Parsimonious Topic Modeling Based on the Bayesian Information Criterion.

Do patients of integrative anthroposophic pediatric inpatient departments differ? Comparative analysis to all pediatric inpatients in Germany considering demographic and clinical characteristics

CHOOSING SEEDS FOR SEMI-SUPERVISED GRAPH BASED CLUSTERING

Effects of voluntary crouch gait on lower limb muscle forces and joint reaction forces in healthy children: work in progress

A fuzzy Approach based for Document Datasets Clustering

Clustering of multi-view relational data based on particle swarm optimization

Examination of design for large and complex network projects

A novel somatic cancer gene-based biomedical document feature ranking and clustering model

A Software-driven Workflow for the Reuse of Language Documentation Data in Typological Studies

Accelerating Nonnegative Matrix Factorization Algorithms Using Extrapolation.