I am having a ready to go word2vec model that I already trained. I have serialized it as a CSV file:
word, v0, v1, ..., vN
house, 0.1234, 0.4567, ..., 0.3461
car, 0.456, 0.677, ..., 0.3461
What I'd like to know is how I can load that word vector model in gensim
and use that to train a paragraph or doc2vec model.
This Doc2Vec tutorial says I can load a model in form of a "# C text format
" but I have no idea what that actually means. What is "C text format" in the first place but more important:
- How can I load my word2vec model and use it for doc2vec training?
How do I build the vocabulary from my word2vec model?