R语言 topicmodels包 ldaformat2dtm()函数中文帮助文档(中英文对照)

loveR · 发表于 2012-10-1 11:03:29

ldaformat2dtm(topicmodels)
ldaformat2dtm()所属R语言包：topicmodels

                                    Transform data from and for use with the lda package
                                       变换的数据和使用与lda的包

                                       译者：生物统计家园网机器人LoveR

描述----------Description----------

Data from the lda package is transformed to a document-term matrix. This data format can be used to fit topic models using package topicmodels.
从lda包的数据被变换到一个文档的术语基质。这一数据格式中，可以使用适合主题的模型，使用套件topicmodels。

Data in form of a document-term matrix is transformed to the LDA format used by package lda.
数据包lda使用LDA格式的文件术语矩阵的形式转化为。

用法----------Usage----------

ldaformat2dtm(documents, vocab, omit_empty = TRUE)
dtm2ldaformat(x, omit_empty = TRUE)

参数----------Arguments----------

参数：documents
A list where each entry corresponds to a document; for each document the number of terms occurring in the document are stored in a matrix with two rows such that in each column the first entry corresponds to the vocabulary id of the term and the second entry to the number of times this term occurred in the document.
Alist，其中每个条目对应于一个文档，每个文档被存储在文档中出现的术语数目matrix与两排，使得在每列中的第一个条目对应的词汇的id项和第二项的次数发生这个术语在文档中。

参数：vocab
A "character" vector of the terms in the vocabulary.
A"character"向量的词汇中的条款。

参数：x
An object of class "DocumentTermMatrix" as defined in package tm.
类的一个对象"DocumentTermMatrix"在包tm的定义。“

参数：omit_empty
A logical indicating if empty documents should be removed when converting the objects. By default empty documents are removed.
一个逻辑，表示如果是空的文件被删除时，转换的对象。默认情况下，空文件将被删除。

值----------Value----------

An object of class "DocumentTermMatrix" is returned by ldaformat2dtm() and a list with components "documents" and "vocab" by dtm2ldaformat().
一个对象的类"DocumentTermMatrix"的返回的组件ldaformat2dtm()和"documents""vocab"dtm2ldaformat()和一个列表。

（作者）----------Author(s)----------

Bettina Gruen

实例----------Examples----------

if (require("lda")) {
  data("cora.documents", package = "lda")
  data("cora.vocab", package = "lda")
  dtm <- ldaformat2dtm(cora.documents, cora.vocab)
  cora <- dtm2ldaformat(dtm)
  all.equal(cora, list(documents = cora.documents,
                     vocab = cora.vocab))
}

转载请注明:出自生物统计家园网(http://www.biostatistic.net)。

注：
注1：为了方便大家学习，本文档为生物统计家园网机器人LoveR翻译而成，仅供个人R语言学习参考使用，生物统计家园保留版权。
注2：由于是机器人自动翻译，难免有不准确之处，使用时仔细对照中、英文内容进行反复理解，可以帮助R语言的学习。
注3：如遇到不准确之处，请在本贴的后面进行回帖，我们会逐渐进行修订。

账号		自动登录	找回密码
密码			注册