最新 'gensim' 问题

关联标签

10得票6回答

如何在gensim中仅访问主题词

我用Gensim构建了LDA模型，现在只想获取主题词，不需要概率和ID。我尝试过gensim中的print_topics()和show_topics()函数，但是无法获得干净的单词。以下是我使用的代码：我使用了以下代码：dictionary = corpora.Dictionary(do...

pythonnlpgensimldatopic-modeling

15得票1回答

Gensim Word2vec：语义相似性

我想了解gensim word2vec的两种相似度测量方法most_similar()和most_similar_cosmul()之间的区别。我知道第一种方法使用词向量的余弦相似度，而另一种方法使用Omer Levy和Yoav Goldberg提出的乘法组合目标。我想知道它们对结果有什么影响？...

pythonsemanticssimilaritygensimword2vec

9得票1回答

如何在gensim LDA中获取所有文档的document_topics分布？

我是Python的新手，需要构建一个LDA项目。在进行了一些预处理步骤后，这是我的代码： dictionary = Dictionary(docs) corpus = [dictionary.doc2bow(doc) for doc in docs] from gensim.models ...

python-3.xgensimldatopic-modelingprobability-distribution

7得票1回答

gensim word2vec的训练时间

我正在使用34 GB预处理的MS_MARCO语料库(22 GB)从头开始训练word2vec。(预处理语料库经过sentencepiece分词，因此其大小更大) 我正在使用以下代码训练我的word2vec模型： from gensim.test.utils import common_tex...

pythonnlpgensimword2vec

8得票1回答

使用Gensim获取文档的主题（已查看的文档）

我知道在gensim中对lda模型进行训练后，我们可以通过以下方式获取未见过的文档主题： lda = LdaModel(corpus, num_topics=10) doc_lda = lda[doc_bow] 那么，已经用于训练的文档怎么办呢？我的意思是，在不将其视为新文档的情况下，有...

pythonldagensim

7得票2回答

通过Gensim找到未见过的文档主题

我正在使用Gensim进行大规模主题建模。我不太明白如何确定非索引文档的预测主题。例如：我有2500万个文档已经在LSA（和LDA）空间中转换为向量。现在，我想要找出一个新文档的主题，我们称其为x。根据Gensim的文档，我可以使用以下方法： topics = lsi[doc(x)] ...

pythonnlplatent-semantic-indexinggensim

14得票5回答

Python node2vec（Gensim Word2Vec）"Process finished with exit code 134 (interrupted by signal 6: SIGABRT)" （注：该内容为提问标题，无需回答）

我正在使用Python中的node2vec，它在内部使用Gensim的Word2Vec。当我使用小数据集时，代码运行良好。但是，一旦我尝试在大数据集上运行相同的代码，代码就会崩溃: 错误：进程以退出码134结束(被信号6:SIGABRT中断)。导致错误的行是：model = Word2...

pythonpycharmword2vecgensim

8得票2回答

Gensim LDA 相关性得分为Nan。

我按照以下教程创建了一个Gensim LDA模型：https://www.machinelearningplus.com/nlp/topic-modeling-gensim-python/ lda_model = gensim.models.LdaMulticore(data_df['bow...

pythonmachine-learninggensimldatopic-modeling

29得票5回答

使用gensim理解LDA实现

我正在努力理解Python中使用gensim软件包实现潜在狄利克雷分配的过程。我正在按照以下步骤进行：定义数据集documents = ["Apple is releasing a new product", "Amazon sells many things"...

pythongensimldatopic-modelingdirichlet

18得票6回答

如何在gensim中完全从Word2Vec模型中删除一个单词？

给定一个模型，例如。from gensim.models.word2vec import Word2Vec documents = ["Human machine interface for lab abc computer applications", "A survey of user...

pythondictionaryword2vecgensimdel