Unsupervised Learning Problem Research Articles

Most cancer treatments efficacy depends on tumor metastasis suppression, where tumor suppressor genes play an important role. Maspin (Mammary Serine Protease Inhibitor), an non-inhibitory serpin has been reported as a potential tumor suppressor to influence cell migration, adhesion, proliferation and apoptosis in in vitro and in vivo experiments in last two decades. Lack of computational investigations hinders its ability to go through clinical trials. Previously, we reported first computational model for maspin effects on tumor growth using artificial neural network and cellular automata paradigm with in vitro data support. This paper extends the previous in silico model by encompassing how maspin influences cell migration and the cell–extracellular matrix interaction in subcellular level. A feedforward neural network was used to define each cell behavior (proliferation, quiescence, apoptosis) which followed a cell-cycle algorithm to show the microenvironment impacts over tumor growth. Furthermore, the model concentrates how the in silico experiments results can further confirm the fact that maspin reduces cell migration using specific in vitro data verification method. The data collected from in vitro and in silico experiments formulates an unsupervised learning problem which can be solved by using different clustering algorithms. A density based clustering technique was developed to measure the similarity between two datasets based on the number of links between instances. Our proposed clustering algorithm first finds the nearest neighbors of each instance, and then redefines the similarity between pairs of instances in terms of how many nearest neighbors share the two instances. The number of links between two instances is defined as the number of common neighbors they have. The results showed significant resemblances with in vitro experimental data. The results also offer a new insight into the dynamics of maspin and establish as a metastasis suppressor gene for further molecular research.

Read full abstract

We consider a basic problem in unsupervised learning: learning an unknown Poisson binomial distribution. A Poisson binomial distribution (PBD) over $$\{0,1,\ldots ,n\}$${0,1,?,n} is the distribution of a sum of $$n$$n independent Bernoulli random variables which may have arbitrary, potentially non-equal, expectations. These distributions were first studied by Poisson (Recherches sur la Probabilite des jugements en matie criminelle et en matiere civile. Bachelier, Paris, 1837) and are a natural $$n$$n-parameter generalization of the familiar Binomial Distribution. Surprisingly, prior to our work this basic learning problem was poorly understood, and known results for it were far from optimal. We essentially settle the complexity of the learning problem for this basic class of distributions. As our first main result we give a highly efficient algorithm which learns to $$\epsilon $$∈-accuracy (with respect to the total variation distance) using $$\tilde{O}(1/ \epsilon ^{3})$$O~(1/∈3) samples independent of$$n$$n. The running time of the algorithm is quasilinear in the size of its input data, i.e., $$\tilde{O}(\log (n)/\epsilon ^{3})$$O~(log(n)/∈3) bit-operations (we write $$\tilde{O}(\cdot )$$O~(·) to hide factors which are polylogarithmic in the argument to $$\tilde{O}(\cdot )$$O~(·); thus, for example, $$\tilde{O}(a \log b)$$O~(alogb) denotes a quantity which is $$O(a \log b \cdot \log ^c(a \log b))$$O(alogb·logc(alogb)) for some absolute constant $$c$$c. Observe that each draw from the distribution is a $$\log (n)$$log(n)-bit string). Our second main result is a proper learning algorithm that learns to $$\epsilon $$∈-accuracy using $$\tilde{O}(1/\epsilon ^{2})$$O~(1/∈2) samples, and runs in time $$(1/\epsilon )^{\mathrm {poly}(\log (1/\epsilon ))} \cdot \log n$$(1/∈)poly(log(1/∈))·logn. This sample complexity is nearly optimal, since any algorithm for this problem must use $$\Omega (1/\epsilon ^{2})$$Ω(1/∈2) samples. We also give positive and negative results for some extensions of this learning problem to weighted sums of independent Bernoulli random variables.

Read full abstract

Unsupervised Learning Problem Research Articles

Related Topics

Articles published on Unsupervised Learning Problem

Data Exploration with Selection of Representative Regions: Formulation, Axioms, Methods, and Consistency

Multi-Source Multi-Target Dictionary Learning for Prediction of Cognitive Decline.

Distribution-Preserving Stratified Sampling for Learning Problems.

Learning Sparse Feature Representations Using Probabilistic Quadtrees and Deep Belief Nets

Chi-square based hierarchical agglomerative clustering for web sessionization

Automatic Subspace Learning via Principal Coefficients Embedding.

Unsupervised learning of indoor localization based on received signal strength

Unsupervised dictionary learning with Fisher discriminant for clustering

Adaptive Multi-Subpopulation Competition and Multi-Niche Crowding based Memetic Algorithm for Automatic Data Clustering

Discriminative Transfer Subspace Learning via Low-Rank and Sparse Representation.

An in silico model to demonstrate the effects of Maspin on cancer cell dynamics

Domain Adaptation Extreme Learning Machines for Drift Compensation in E-Nose Systems

Unsupervised Object Class Discovery via Saliency-Guided Multiple Class Learning.

Semi-Supervised Local Fisher Discriminant Analysis Based on Reconstruction Probability Class

Learning Poisson Binomial Distributions

An unsupervised learning algorithm for membrane computing

Decision Theoretic Evaluation of Rough Fuzzy Clustering

Cell population identification using fluorescence-minus-one controls with a one-class classifying algorithm.

Ensemble learning with trees and rules: Supervised, semi-supervised, unsupervised

A Spectral Algorithm for Latent Dirichlet Allocation

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Unsupervised Learning Problem Research Articles

Related Topics

Articles published on Unsupervised Learning Problem

Data Exploration with Selection of Representative Regions: Formulation, Axioms, Methods, and Consistency

Multi-Source Multi-Target Dictionary Learning for Prediction of Cognitive Decline.

Distribution-Preserving Stratified Sampling for Learning Problems.

Learning Sparse Feature Representations Using Probabilistic Quadtrees and Deep Belief Nets

Chi-square based hierarchical agglomerative clustering for web sessionization

Automatic Subspace Learning via Principal Coefficients Embedding.

Unsupervised learning of indoor localization based on received signal strength

Unsupervised dictionary learning with Fisher discriminant for clustering

Adaptive Multi-Subpopulation Competition and Multi-Niche Crowding based Memetic Algorithm for Automatic Data Clustering

Discriminative Transfer Subspace Learning via Low-Rank and Sparse Representation.

An in silico model to demonstrate the effects of Maspin on cancer cell dynamics

Domain Adaptation Extreme Learning Machines for Drift Compensation in E-Nose Systems

Unsupervised Object Class Discovery via Saliency-Guided Multiple Class Learning.

Semi-Supervised Local Fisher Discriminant Analysis Based on Reconstruction Probability Class

Learning Poisson Binomial Distributions

An unsupervised learning algorithm for membrane computing

Decision Theoretic Evaluation of Rough Fuzzy Clustering

Cell population identification using fluorescence-minus-one controls with a one-class classifying algorithm.

Ensemble learning with trees and rules: Supervised, semi-supervised, unsupervised

A Spectral Algorithm for Latent Dirichlet Allocation