How to visualize Word2Vec embeddings in Gensim?

作者

2023年08月22日

更新时间

10.94 分钟

阅读时间

阅读量

To visualize Word2Vec embeddings in Gensim, you can use the t-SNE algorithm to reduce the dimensionality of the embeddings, and then plot them using a scatter plot. Here is an example code snippet:

import gensim
from sklearn.manifold import TSNE
import matplotlib.pyplot as plt

# Load trained Word2Vec model
model = gensim.models.Word2Vec.load('path/to/model')

# Get vectors for a sample of words
sample_words = ['cat', 'dog', 'bird', 'horse', 'fish', 'snake']
vectors = [model.wv[word] for word in sample_words]

# Use t-SNE to reduce dimensionality to 2D
tsne = TSNE(n_components=2)
vectors_2d = tsne.fit_transform(vectors)

# Plot the words as points on a scatter plot
plt.scatter(vectors_2d[:, 0], vectors_2d[:, 1])
for i, word in enumerate(sample_words):
    plt.annotate(word, xy=(vectors_2d[i, 0], vectors_2d[i, 1]))
plt.show()

This code will plot a scatter plot of the selected words, where words that have similar contexts in the original dataset should be grouped together in the plot. You can adjust the size of the plot and other visualization parameters as desired.

How to visualize Word2Vec embeddings in Gensim?

相关标签

How to use Gensim for topic modeling with LDA?

How to load a text corpus into Gensim?

博客作者

GLM 是真敢删啊？！说好的 P0 安全规范呢？

如果要投票一个最弱智的ai模型一定是千问

告别手动拼接：PromptForge 如何重新定义你的 AI 工作流

Privacy Policy for TerryVoiceRead Chrome Extension

告别龟速！NAS迅雷内测体验，速度起飞，附邀请码！