word2vec获取
时间: 2023-12-14 18:44:05 浏览: 31
词向量的过程是通过word2vec进行的。首先,你需要对语料库进行预处理,包括大小写转换、拼写错误检查和分词处理等操作。接下来,将处理过的语料库的one-hot向量作为word2vec的输入。word2vec会训练低维词向量(也称为词嵌入),将corpus的one-hot向量转换为低维词向量(模型的中间产物,更具体来说是输入权重矩阵)。word2vec有两种训练模型(CBOW和Skip-gram)和两种加速算法(Negative Sample与Hierarchical Softmax)。这里只讨论了将corpus的one-hot向量转换为低维词向量的过程。<span class="em">1</span><span class="em">2</span><span class="em">3</span>
#### 引用[.reference_title]
- *1* *3* [word2vec是如何得到词向量的](https://blog.csdn.net/weixin_38569817/article/details/79299030)[target="_blank" data-report-click={"spm":"1018.2226.3001.9630","extra":{"utm_source":"vip_chatgpt_common_search_pc_result","utm_medium":"distribute.pc_search_result.none-task-cask-2~all~insert_cask~default-1-null.142^v92^chatsearchT0_1"}}] [.reference_item style="max-width: 50%"]
- *2* [word2vec 获得当前的所有词向量表的几种方式](https://blog.csdn.net/lhmood/article/details/89710642)[target="_blank" data-report-click={"spm":"1018.2226.3001.9630","extra":{"utm_source":"vip_chatgpt_common_search_pc_result","utm_medium":"distribute.pc_search_result.none-task-cask-2~all~insert_cask~default-1-null.142^v92^chatsearchT0_1"}}] [.reference_item style="max-width: 50%"]
[ .reference_list ]