Latent Dirichlet Allocation (LDA) is Latent Dirichlet Allocation is a powerful learning al- gorithm for automatically clustering words and documents into topics [5]. Another way to view a topic model is as a generalization of a mixture model like Dirichlet Process Cluster- ing. Starting from a normal mixture model, in which we have a single global mixture of several distributions, we instead say that each document has its own mixture distribution over the globally shared mixture components. Operationally in Dirichlet Process Clustering, each document has its own latent variable drawn from a global mixture that specifies which model it belongs to, while in LDA each word in each document has its own parameter drawn from a document-wide mixture. Another way to view a topic model is as a generalization of a mixture model like Dirichlet Process Clustering . Starting from a normal mixture model, in which we have a single global mixture of several distributions, we instead say that each document has its own mixture distribution over the globally shared mixture components. Operationally in Dirichlet Process Clustering, each document has its own latent variable drawn from a global mixture that specifies which model it belongs to, while in LDA each word in each document has its own parameter drawn from a document-wide mixture. The idea is that we use a prob- abilistic mixture of a number of models that we use to explain some observed data. Each observed data point is assumed to have come from one of the models in the mixture, but we don’t know which. The way we deal with that is to use a so-called latent parameter which specifies which model each data point came from.
Latent Dirichlet Allocation (LDA) is Latent Dirichlet Allocation is a powerful learning al- gorithm for automatically clustering words and documents into topics [5]. Another way to view a topic model is as a generalization of a mixture model like Dirichlet Process Cluster- ing. Starting from a normal mixture model, in which we have a single global mixture of several distributions, we instead say that each document has its own mixture distribution over the globally shared mixture components. Operationally in Dirichlet Process Clustering, each document has its own latent variable drawn from a global mixture that specifies which model it belongs to, while in LDA each word in each document has its own parameter drawn from a document-wide mixture. Another way to view a topic model is as a generalization of a mixture model like Dirichlet Process Clustering . Starting from a normal mixture model, in which we have a single global mixture of several distributions, we instead say that each document has its own mixture distribution over the globally shared mixture components. Operationally in Dirichlet Process Clustering, each document has its own latent variable drawn from a global mixture that specifies which model it belongs to, while in LDA each word in each document has its own parameter drawn from a document-wide mixture. The idea is that we use a prob- abilistic mixture of a number of models that we use to explain some observed data. Each observed data point is assumed to have come from one of the models in the mixture, but we don’t know which. The way we deal with that is to use a so-called latent parameter which specifies which model each data point came from.
การแปล กรุณารอสักครู่..