Average Linkage Algorithm Research Articles

• A Novel ensemble/cooperative framework based on concept-based and clustering is proposed to perform Twitter sentiment Analysis. • It employs majority voting, tie breaker criteria, and linguistic rules in concept-based module. • Comparative analysis between clustering and classification is presented when integrated with concept based methods. • It presents the performance of feature representation methods (Boolean and TF-IDF). • Experimental results on Twitter Datasets revealed better performance of proposed framework. Concept-based sentiment analysis (CBSA) methods have gained prominence in natural language processing in recent years. These methods consider the underlying semantic meanings of text to perform different tasks such as Twitter sentiment analysis (assigning positive, negative, or neutral sentiment to Tweets). CBSA is superior to traditional statistical methods for accurately discovering sentiment labels. Due to a limited knowledge base, these methods are unable to identify the sentiment polarity of all kinds of text. Therefore, supervised learning techniques are mostly ensembled with CBSA methods to classify the whole text. These techniques require labeled data. It is a tedious and time-consuming task due to the manually labeling of large datasets (Such as Twitter datasets). Therefore, an unsupervised learning mechanism can be a better alternative to solve this problem. In this paper, a novel unsupervised learning framework based on Concept-based and hierarchical clustering is proposed for Twitter sentiment analysis. Popular hierarchical clustering methods including single linkage, complete linkage, and average linkage algorithms are ensembled serially. Two different feature representation methods including Boolean and Term frequency-inverse document frequency (TF-IDF) are investigated. We have also experimented with Well-known classifiers (Naive Bayes, Neural Network) for a fair comparison. Accuracy measure (proportion of correct predictions) is used to evaluate the performance of understudied techniques. It is empirically shown that the performance of unsupervised learning techniques is comparable with supervised learning techniques.

Next-generation sequencing platforms are routinely used for molecular assignment due to their high impact for risk stratification and prognosis in medulloblastomas. Yet, low and middle-income countries still lack an accurate cost-effective platform to perform this allocation. TaqMan Low Density array (TLDA) assay was performed using a set of 20 genes in 92 medulloblastoma samples. The same methodology was assessed in silico using microarray data for 763 medulloblastoma samples from the GSE85217 study, which performed MB classification by a robust integrative method (Transcriptional, Methylation and cytogenetic profile). Furthermore, we validated in 11 MBs samples our proposed method by Methylation Array 450 K to assess methylation profile along with 390 MB samples (GSE109381) and copy number variations. TLDA with only 20 genes accurately assigned MB samples into WNT, SHH, Group 3 and Group 4 using Pearson distance with the average-linkage algorithm and showed concordance with molecular assignment provided by Methylation Array 450 k. Similarly, we tested this simplified set of gene signatures in 763 MB samples and we were able to recapitulate molecular assignment with an accuracy of 99.1% (SHH), 94.29% (WNT), 92.36% (Group 3) and 95.40% (Group 4), against 97.31, 97.14, 88.89 and 97.24% (respectively) with the Ward.D2 algorithm. t-SNE analysis revealed a high level of concordance (k = 4) with minor overlapping features between Group 3 and Group 4. Finally, we condensed the number of genes to 6 without significantly losing accuracy in classifying samples into SHH, WNT and non-SHH/non-WNT subgroups. Additionally, we found a relatively high frequency of WNT subgroup in our cohort, which requires further epidemiological studies. TLDA is a rapid, simple and cost-effective assay for classifying MB in low/middle income countries. A simplified method using six genes and restricting the final stratification into SHH, WNT and non-SHH/non-WNT appears to be a very interesting approach for rapid clinical decision-making.

Average Linkage Algorithm Research Articles

Related Topics

Articles published on Average Linkage Algorithm

A novel unsupervised ensemble framework using concept-based linguistic methods and machine learning for twitter sentiment analysis

Coal elemental (compositional) data analysis with hierarchical clustering algorithms

Pengelompokkan Tingkat Kriminalitas di Indonesia Menggunakan Algoritma Average Linkage

Short Communication: Leaf architectural analysis of confusing Syzygium species: Syzygium aqueum (Burm.f.) Alston and Syzygium samarangense (Blume) Merr. & L.M.Perry (Myrtaceae)

Efficient text feature extraction by integrating the average linkage and K-medoids clustering

A clustering of red wines based on physicochemical and optical properties

Car shape clustering using sobel edge detection with divisive average linkage and single linkage algorithm (case: bus, sedan, citycar, mpv, and truck)

Assessment of ISSR Markers for Tagging Genetic Variability for Yield Components in Small Cardamom (Elettaria cardamomum Maton)

Identification Bacterial Contaminant in Semiarundinaria fastuosa Tissue Culture

A simplified approach using Taqman low-density array for medulloblastoma subgrouping

Analysis of precipitation data in Bangladesh through hierarchical clustering and multidimensional scaling

The Expansion of Initial Point Algorithm for K-Modes Algorithm

Enhancing point symmetry-based distance for data clustering

Using Dendritic Heat Maps to Simultaneously Display Genotype Divergence with Phenotype Divergence.

A comparison of bacterial communities in feces of breastfed infants determined by 454 pyrosequencing and Illumina MiSeq (1017.5)

Evaluación de la diversidad genética en genotipos de Brassica juncea (Brassicaceae) utilizando diferencias fenotípicas y marcadores SSR

Principal component and clustering analysis on molecular dynamics data of the ribosomal L11·23S subdomain

Normalised monthly shortage curves: a contribution for a better understanding of monthly rain deficit in Western Europe

Exploring the conformational changes of the ATP binding site of gyrase B from Escherichia coli complexed with different established inhibitors by using molecular dynamics simulation : Protein–ligand interactions in the light of the alanine scanning and free energy decomposition methods

Abscisic acid and sucrose increase the protein content in date palm somatic embryos, causing changes in 2-DE profile

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Average Linkage Algorithm Research Articles

Related Topics

Articles published on Average Linkage Algorithm

A novel unsupervised ensemble framework using concept-based linguistic methods and machine learning for twitter sentiment analysis

Coal elemental (compositional) data analysis with hierarchical clustering algorithms

Pengelompokkan Tingkat Kriminalitas di Indonesia Menggunakan Algoritma Average Linkage

Short Communication: Leaf architectural analysis of confusing Syzygium species: Syzygium aqueum (Burm.f.) Alston and Syzygium samarangense (Blume) Merr. &amp; L.M.Perry (Myrtaceae)

Efficient text feature extraction by integrating the average linkage and K-medoids clustering

A clustering of red wines based on physicochemical and optical properties

Car shape clustering using sobel edge detection with divisive average linkage and single linkage algorithm (case: bus, sedan, citycar, mpv, and truck)

Assessment of ISSR Markers for Tagging Genetic Variability for Yield Components in Small Cardamom (Elettaria cardamomum Maton)

Identification Bacterial Contaminant in Semiarundinaria fastuosa Tissue Culture

A simplified approach using Taqman low-density array for medulloblastoma subgrouping

Analysis of precipitation data in Bangladesh through hierarchical clustering and multidimensional scaling

The Expansion of Initial Point Algorithm for K-Modes Algorithm

Enhancing point symmetry-based distance for data clustering

Using Dendritic Heat Maps to Simultaneously Display Genotype Divergence with Phenotype Divergence.

A comparison of bacterial communities in feces of breastfed infants determined by 454 pyrosequencing and Illumina MiSeq (1017.5)

Evaluación de la diversidad genética en genotipos de Brassica juncea (Brassicaceae) utilizando diferencias fenotípicas y marcadores SSR

Principal component and clustering analysis on molecular dynamics data of the ribosomal L11·23S subdomain

Normalised monthly shortage curves: a contribution for a better understanding of monthly rain deficit in Western Europe

Exploring the conformational changes of the ATP binding site of gyrase B from Escherichia coli complexed with different established inhibitors by using molecular dynamics simulation : Protein–ligand interactions in the light of the alanine scanning and free energy decomposition methods

Abscisic acid and sucrose increase the protein content in date palm somatic embryos, causing changes in 2-DE profile

Short Communication: Leaf architectural analysis of confusing Syzygium species: Syzygium aqueum (Burm.f.) Alston and Syzygium samarangense (Blume) Merr. & L.M.Perry (Myrtaceae)